Normal Distribution, Standard Deviation, and the Empirical Rule
Lumi — a curious silver fox in a data-analyst's vest — stands at a large whiteboard covered in a symmetric bell-shaped curve drawn in chalk, holding a ruler against the curve's center line and tapping each labeled band with an enthusiastic grin, surrounded by stacks of data cards labeled with measurements like heights and test scores.
- Explain what a normal distribution is and identify its key visual features on a bell curve.
- Describe how the mean and standard deviation together define the center and spread of a normal distribution.
- Apply the Empirical Rule to predict what percentage of data falls within one, two, and three standard deviations of the mean.
- Compare two normally distributed data sets using their means and standard deviations to draw conclusions about spread and consistency.
Key terms
- Normal distribution
- A symmetric bell-shaped distribution where data clusters around the mean and tapers toward the tails.
- Mean (μ)
- The central balance point of a distribution located at the peak of the bell.
- Standard deviation (σ)
- A measure of spread describing the typical distance of values from the mean.
- Empirical Rule
- The guideline that 68, 95, and 99.7 percent of data lie within one, two, and three sigma.
- Z-score
- The number of standard deviations a value lies above or below the mean.
Mean and Standard Deviation Together
Two parameters describe a normal distribution completely: the mean fixes its center and the standard deviation fixes its spread. Shifting the mean slides the whole bell left or right without changing its shape, while changing the standard deviation stretches or squeezes the bell around that center. A small sigma concentrates the data into a tall, narrow peak, signaling high consistency; a large sigma flattens and widens the curve, signaling more variability. Because the two parameters are independent, two distributions can share a mean yet differ greatly in spread, or share a spread yet sit at different centers.
Reading the Empirical Rule
The Empirical Rule gives the proportion of data captured by symmetric bands around the mean: about 68% within one standard deviation, about 95% within two, and about 99.7% within three. Because the curve is symmetric, each band splits evenly, so roughly 34% lies between the mean and one sigma above it. Subtracting nested bands reveals the slivers: about 13.5% sits between one and two sigma on each side, and only about 0.15% lies beyond three sigma in each tail. These figures let you estimate percentages without integrating the curve.
Counting Standard Deviations With Z-Scores
To judge how unusual a value is, convert it to a z-score by subtracting the mean and dividing by the standard deviation: z = (x − μ)/σ. A z-score of 0 sits exactly at the mean, a z-score of −1 is one sigma below, and a z-score of 3 is three sigma above. Counting standard deviations this way is more reliable than eyeballing distance on a graph, where unequal axis scaling can mislead. Once you know the z-score, the Empirical Rule tells you roughly what fraction of the data is more extreme.
Worked examples
Test scores are normal with μ = 500 and σ = 100; what percentage scores between 400 and 600?
- Find the z-scores: (400 − 500)/100 = −1 and (600 − 500)/100 = +1.
- The interval spans from one sigma below to one sigma above the mean.
- By the Empirical Rule, about 68% of data lies within one sigma of the mean.
Answer: About 68% of students score between 400 and 600.
For μ = 50 and σ = 5, classify how unusual the value 35 is.
- Compute the z-score: (35 − 50)/5 = −15/5 = −3.
- A z-score of −3 places the value three standard deviations below the mean.
- The Empirical Rule says fewer than about 0.15% of values fall below the −3σ mark.
Answer: 35 is three sigma below the mean, an extreme value in the rare lower tail.
Activity
Drag each measurement card to the correct band on the bell curve — within 1σ, between 1σ and 2σ, or beyond 2σ — given μ = 100 and σ = 15.
Practice
Heights are normal with μ = 170 cm and σ = 6 cm; what percent fall between 158 cm and 182 cm?
For μ = 80 and σ = 4, find the z-score of the value 90 and describe its rarity.
Common mistakes to avoid
- Standard deviation measures the averageStandard deviation measures spread around the mean, not the center itself, so two data sets can share a mean while differing in spread.
- The Empirical Rule needs the data countThe 68-95-99.7 percentages apply to any normal distribution regardless of how many data points it contains, since they describe proportions.
Check your understanding
The heights of students at a school are approximately normally distributed with a mean of 165 cm and a standard deviation of 8 cm. According to the Empirical Rule, approximately what percentage of students are between 157 cm and 173 cm tall?
Two factories produce bolts. Factory A's bolt diameters have σ = 0.5 mm; Factory B's have σ = 2.0 mm. Both factories set the same target mean diameter. Which factory produces more consistent bolts, and why?
A data set is normally distributed with μ = 50 and σ = 5. A value of 35 appears in the data. Which statement BEST describes this value using the Empirical Rule?
Recap
A normal distribution is a symmetric bell defined by its mean for center and standard deviation for spread; the Empirical Rule places about 68, 95, and 99.7 percent of data within one, two, and three standard deviations, and converting a value to a z-score reveals how rare it is.
Reflect
What measurements in your world might cluster into a bell curve around a typical value?