Variance and standard deviation — calculate for discrete frequency distribution

medium CBSE-11JEE-MAIN NCERT Class 11 3 min read

Question

The marks of 30 students in a test are given as a frequency distribution:

Marks (xix_i)1020304050
Frequency (fif_i)461073

Calculate the variance and standard deviation.

(NCERT Class 11, Statistics)


Solution — Step by Step

xˉ=fixifi\bar{x} = \frac{\sum f_i x_i}{\sum f_i}
xix_ifif_ifixif_i x_i
10440
206120
3010300
407280
503150
Total30890
xˉ=89030=29.67\bar{x} = \frac{890}{30} = 29.67
xix_ifif_idi=xixˉd_i = x_i - \bar{x}di2d_i^2fidi2f_i d_i^2
10419.67-19.67386.911547.64
2069.67-9.6793.51561.06
30100.330.330.111.10
40710.3310.33106.71746.97
50320.3320.33413.311239.93
Total304096.70
σ2=fidi2fi=4096.7030=136.56\sigma^2 = \frac{\sum f_i d_i^2}{\sum f_i} = \frac{4096.70}{30} = \mathbf{136.56} σ=σ2=136.56=11.69\sigma = \sqrt{\sigma^2} = \sqrt{136.56} = \mathbf{11.69}

The standard deviation tells us that marks are spread about 11.69 units around the mean of 29.67.


Why This Works

Variance measures the average of squared deviations from the mean. We square the deviations because positive and negative deviations would cancel out if we simply averaged them. Squaring ensures all terms are positive.

Standard deviation is the square root of variance — it brings the measure back to the original units (marks, in this case). A large standard deviation means data is widely spread; a small one means data is tightly clustered around the mean.

The frequency fif_i acts as a weight — values that appear more often contribute more to the variance. This is why we multiply each di2d_i^2 by fif_i before summing.


Alternative Method

Use the shortcut formula to avoid calculating deviations:

σ2=fixi2fi(fixifi)2=fixi2Nxˉ2\sigma^2 = \frac{\sum f_i x_i^2}{\sum f_i} - \left(\frac{\sum f_i x_i}{\sum f_i}\right)^2 = \frac{\sum f_i x_i^2}{N} - \bar{x}^2

Compute fixi2=4(100)+6(400)+10(900)+7(1600)+3(2500)=400+2400+9000+11200+7500=30500\sum f_i x_i^2 = 4(100) + 6(400) + 10(900) + 7(1600) + 3(2500) = 400 + 2400 + 9000 + 11200 + 7500 = 30500

σ2=3050030(29.67)2=1016.67880.31=136.36\sigma^2 = \frac{30500}{30} - (29.67)^2 = 1016.67 - 880.31 = 136.36

(Small difference due to rounding in xˉ\bar{x}.)

The shortcut formula σ2=x2xˉ2\sigma^2 = \overline{x^2} - \bar{x}^2 (mean of squares minus square of mean) is faster for calculations. For JEE, also know the step deviation method where you use ui=(xiA)/hu_i = (x_i - A)/h with assumed mean AA and class width hh to simplify the arithmetic further.


Common Mistake

Students often forget to divide by NN (total frequency) at the end and instead divide by the number of distinct values (5 in this case). The divisor for variance is always N=fiN = \sum f_i (total number of observations), not the number of classes. With 30 students, you divide by 30, not by 5.

Want to master this topic?

Read the complete guide with more examples and exam tips.

Go to full topic guide →

Try These Next