Week 4: Statistics and the Stories We Tell Ourselves
September 23, 2019noun
The practice or science of collecting and analysing numerical data in large quantities, especially for the purpose of inferring proportions in a whole from those in a representative sample.
SourceA mathematical function that provides the probabilities of occurrence of different possible outcomes in an experiment.
Sourcedescribes the likelihood for k successes over n trials with p probability of success where:
[1, 1, 1, 1, 6, 2, 4, 2, 9]
Mean
Median
Mode
Range
Min, Max
Variance, Standard Deviation
Percentiles
How to describe the relationship between two distributions?
Covariance
Correlation
Law of Large Numbers
The average of the results obtained from a large number of trials should be close to the expected value, and will tend to become closer as more trials are performed.
Central Limit Theorem
When independent random variables are added, their properly normalized sum tends toward a normal distribution (informally a "bell curve") even if the original variables themselves are not normally distributed.
where m is the sample maximum and k is the sample size
Reference: Data Science from Scratch
DataCamp's Statistical Thinking in Python (Part 2)