Data Science Math Skills

本文发布于：2020年5月25日

字数：805字

时长：5分钟

Data Science Math Skills course provided by DUKE UNIVERSITY is designed to teach the vocabulary, notation, concepts, and algebra rules that all data scientists must know before moving on to more advanced material. The following are the notes I took during this course.

1. Sets

A set is made up of elements.

The cardinality (size) of a set is the number of elements in it.

$|A| = 4$ (there are 4 elements in A, so the cardinality is 4)

Intersections

The intersection is defined as elements that are in both sets. Symbol $∩$: “intersects” (and)

$A ∩ B = {x : x ∈ A and x ∈ B}$

If there are no elements in common, the answer is the empty set ∅. The cardinality of the empty set $|∅| = 0$

The union is defined as elements that are in either set. Symbol ∪: “union” (or)

$A ∪ B = {x ∈ A or x ∈ B}$

Visualizing sets: Venn diagrams

Inclusion-exclusion formula: $|A ∪ B| = |A| + |B| − |A ∩ B|$

2. Numbers

Integers and rational numbers: Some real numbers terminate, and some do not. The number π = 3.14159… is irrational, it does not repeat after the decimal point.

Absolute value: The absolute value of a number x, |x|, is the distance from x to 0.

Intervals and Interval Notation

Closed intervals $[2, 3.1]$
Open intervals $(5, 8)$
Half-open intervals $(2, 3], [20, 20.3)$

3. Sigma notation (Σ)

distributive property: $a(b + c) = ab + ac$

commutative property: $a + b = b + a$

4. Cartesian Plane

Axes and quadrants:

X−axis
Y−axis
first quadrant
second quadrant
third quadrant
fourth quadrant

Pythagorean theorem

Derivation using point-slope form

Slope-intercept form: If L has slope m, and hits the y-axis at (0, b), then y = mx + b is an equation for L, where m is the slope and b is the y-intercept.

Point-Slope Formula for Lines:

$y-y{0}=m\left( x-x{0}\right)$ Point-slope form
$y=mx+b$ Slope-intercept form

5. Functions & Tangent Lines

f : A → B

The Slope of a Graph at a Point: The slope of the tangent line gives the instantaneous rate of change. This is also called the derivative of the function at that point, or f(a).

The Derivative Function (Derivative formula): $\lim _{n\rightarrow 0}\dfrac{f\left( a+h\right) -f\left( a\right) }{h}$

6. Fast Growth, Slow Growth

Integer Exponents:

Multiplication rule: $x^{n}x^{m}=x^{m+n}$
Power to a power: $x^{n^{m}}=x^{nm}$
Product to a power: $\left( x\cdot y\right) ^{n}=x^{n}\cdot y^{n}$
Fraction to a power: $\left( \dfrac{x}{y}\right) ^{n}=\dfrac{x^{n}}{y^{n}}$
Division and negative powers

How Logarithms and Exponents Are Related:

$b^{x}=N$ “exponential form”
$x=\log _{b}N$ “logarithmic form”

Product rule: $\log(xy) = \log(x) + \log(y)$
Quotient rule: $\log ( \dfrac{x}{y}) = \log(x) − \log(y)$
Power and root rule: $\log \left( x^{n}\right) =nlog\left( x\right)$

7. Basic Probability Definitions

probability—the degree of belief in the truth or falsity of a statement

Range of uncertainty from 0 to 1

P(x) probability of x

∼x negation of statement x

joint probability—probability that two separate events with separate probability distributions are both true.

P(A and B) is written P(A, B), and read “the joint probability of A and B” or “the probability that A is true and B is true.”

8. Problem Solving Methods

Permutations and Combinations:

permutation—order matters, $\dfrac{n!}{\left( n-m\right) !}$
combination—order does not matter, n! / (m! * (n-m)!)

Using Factorial and “M Choose N”: $(m n) = m! / ((m − n)! · n!)$

The Sum Rule, Conditional Probability, and the Product Rule:

P(A) = P(A, B1) + P(A, B2) + … + P(A, Bn)
P(A | B) = (relevant outcomes) / (total outcomes remaining in universe, when B is true)
P(A | B) = P(A, B) / P(B)

9. Bayes’ Theorem

P(A | B) = P(B | A) * P(A) / P(B)

Technical vocabulary of Bayesian inverse probability: posterior probability = likelihood * prior probability / marginal probability

posterior probability—probability after new data is observed
prior probability—probability before any data is observed or before new data is observed
likelihood—standard forward probability of data given parameters
marginal probability—probability of the data

10. The Binomial Theorem and Bayes’ Theorem

Binomial theorem used when there are two possible outcomes—a success or a non-success, for example, flipping a coin—heads are a success, binary outcome.

Not limited to fair coins, where the probability of success is 0.5. Probability can be any value > 0 and < 1.

Probability of s successes in n trials, when probability of 1 success is p: $(n s) - p^s - (1 − p)^(n−s)$. where n is the number of independent trials (with replacement), s is the number of successes,and p is the probability of one success.

Bezhuang

使用 Python 程序画画

turtle 库是 Python 的标准库之一，属于入门级的图形绘制函数库，其原理是让一只海龟在画布上游走，走过的轨迹形成了绘制的图形，海龟由程序控制，可以自由改变颜色、方向宽度等。我们也可以...

Python for everybody

The Python for Everybody Specialization provided by UNIVERSITY OF MICHIGAN introduces fundamenta...