๐Ÿ“Š
๐ŸŽฒ

Statistics Literacy Basics

Core concepts for reading and reasoning about data.

๐Ÿ“ˆ
๐Ÿšซ

Correlation is not causation

Front

Two variables moving together does not prove one causes the other.

Back
๐Ÿ‘ฅ
๐Ÿ“

Sample size

Front

More data reduces random noise, but it does not fix biased sampling or bad measurement.

Back
๐Ÿ“‰
๐ŸŽฏ

Regression to the mean

Front

Extreme results tend to move closer to average on the next measurement.

Back
โ†”๏ธ
๐Ÿ“

Median

Front

The middle value; less affected by outliers than the mean.

Back
โš–๏ธ
๐Ÿงฑ

Confidence interval

Front

A range that would contain the true value in ~X% of repeated samples (given assumptions).

Back
๐Ÿงฎ
โš–๏ธ

Mean

Front

The average value; sensitive to outliers.

Back
โ†”๏ธ
๐Ÿ’จ

Variance

Front

Average squared distance from the mean; a measure of spread.

Back
๐Ÿ“
ใ€ฐ๏ธ

Standard deviation

Front

Square root of variance; typical distance from the mean.

Back
๐Ÿ‘ฝ
๐Ÿ“

Outlier

Front

A value far from others; can strongly affect averages and models.

Back
๐Ÿงบ
โŒ

Selection bias

Front

Sampling that is not representative.

Fix: check who is missing and why.

Back
๐Ÿ•ต๏ธ
๐Ÿ”—

Confounder

Front

A hidden variable that influences both cause and effect, creating a false relationship.

Back
๐ŸŽฒ
๐Ÿ”€

Randomization

Front

Assigning by chance helps balance confounders across groups.

Back
๐Ÿ›ก๏ธ
๐Ÿงช

Control group

Front

A comparison group that does not receive the treatment.

Back
๐ŸŽฒ
โ“

P-value

Front

Probability of data at least this extreme under the null hypothesis (not the probability the hypothesis is true).

Back
๐ŸŒŸ
๐Ÿ“Š

Statistical significance

Front

A threshold decision about evidence against null; not the same as importance.

Back
๐Ÿ˜
๐Ÿ“

Effect size

Front

How big the difference/relationship is (magnitude matters).

Back
๐ŸŒ
โœ…

Practical significance

Front

Whether an effect is large enough to matter in real life.

Back
๐Ÿšจ
๐Ÿคก

False positive

Front

Detecting an effect that is not real (Type I error).

Back
๐Ÿ“‰
๐Ÿง 

Base rate fallacy

Front

Ignoring prior probability when interpreting new evidence.

Back
๐Ÿงฉ
๐Ÿ”„

Simpson's paradox

Front

A trend appears in groups but reverses when groups are combined.

Back
๐Ÿ”
โ˜ฃ๏ธ

Relative vs absolute risk

Front

Relative change can mislead; always look at absolute difference too.

Back
๐ŸŒŠ
โš–๏ธ

Law of large numbers

Front

As sample size grows, averages tend to stabilize near the expected value.

Back
โ›๏ธ
๐ŸŽฃ

Data dredging (p-hacking)

Front

Trying many analyses until something is significant.

Fix: pre-register or correct for multiple tests.

Back
๐Ÿ›ค๏ธ
๐Ÿค”

Causal counterfactual

Front

Causation asks: what would happen if the same case were different in one factor?

Back