Standard Deviation and Variance
Why spread matters as much as the average, and how to measure it.
The problem with averages alone
Imagine two classes take the same test. Both have an average score of 70. But in Class A, every student scored between 65 and 75. In Class B, half scored 40 and half scored 100.
Same average. Completely different story.
This is why spread matters. Variance and standard deviation are the tools we use to measure it.
What variance actually measures
Variance captures how far values typically sit from the mean — but it measures that distance in a specific way: as the average of the squared deviations, not the raw distances.
For a set of values, you:
- Find the mean
- Subtract the mean from each value (the deviation)
- Square each deviation (so negatives don't cancel out)
- Average those squared deviations
Standard deviation: back to the original units
Variance is in squared units — hard to interpret. Taking the square root brings you back to the original scale.
If test scores have a standard deviation of 15 points, it tells you the typical distance from the mean is about 15 points. For a roughly bell-shaped distribution, that also means most scores fall within 15 points of the mean — but how much "most" is depends on the shape (see the 68–95–99.7 rule below).
Why this matters
- Finance: volatility of a stock is its standard deviation of returns
- Quality control: tight manufacturing tolerances mean low standard deviation
- Science: high standard deviation in measurements suggests noisy data
A low standard deviation doesn't mean good. It means consistent. Sometimes you want high variance — think brainstorming, not precision manufacturing.
Key takeaways
- Variance = average squared distance from the mean
- Standard deviation = square root of variance (same units as data)
- High std dev → values are spread out; low std dev → values cluster around the mean
- Always look at spread and center, never just the average
The Normal Distribution
Why so many things in nature cluster around a middle value — and how to read the bell curve.
The Central Limit Theorem
Why averages of random samples tend toward a normal distribution — and how this single fact makes all of classical statistics possible.
Confidence Intervals
What '95% confident' actually means — and why the most common interpretation is precisely backwards.
Enjoying this? Get notified when new concepts and articles launch.