## Variance

Variance is a statistical measure that represents the dispersion or spread of a set of data points. In simpler terms, it quantifies how far each data point in the set is from the mean (average) and is calculated as the average of the squared differences from the mean. A low variance indicates that the data points tend to be close to the mean and to each other, while a high variance indicates that the data points are spread out over a wider range.

The formula to calculate the variance (Ïƒ2Ïƒ2) for a population of NN data points x1,x2,…,xNx1â€‹,x2â€‹,…,xNâ€‹ with mean Î¼Î¼ is:

Ïƒ2 = âˆ‘^{N}_{i=1} (x_{i} âˆ’ Î¼)^{2} / N

For a sample of nn data points, the formula for sample variance (s2s2) is slightly different:

s2 = âˆ‘^{n}_{i=1} (x_{i} âˆ’ xË‰)^{2} / n âˆ’ 1

Note that in the sample variance formula, we divide by nâˆ’1nâˆ’1 instead of nn. This is known as Bessel’s correction and it corrects the bias in the estimation of the population variance from a sample.

## Example of Variance

Let’s consider an example using exam scores to demonstrate how to calculate variance.

**Data set**

Imagine a class of 5 students who received the following scores on a math exam:

{70,80,85,90,100}

**Step 1: Calculate the Mean (Average)**

To calculate the mean, sum up all the scores and divide by the number of students:

Mean (xË‰)= 70 + 80 + 85 + 90 + 100 / 5 = 425 / 5 = 85

**Step 2: Calculate the Squared Differences from the Mean**

Next, find the squared difference of each score from the mean:

- (70 âˆ’ 85)
^{2}= (âˆ’15)^{2}= 225 - (80 âˆ’ 85)
^{2}= (âˆ’5)^{2}= 25 - (85 âˆ’ 85)
^{2}= 0^{2}= 0 - (90 âˆ’ 85)
^{2}= 5^{2}= 25 - (100 âˆ’ 85)
^{2}= 15^{2}= 225

**Step 3: Calculate the Variance**

Finally, sum up these squared differences and divide by nâˆ’1nâˆ’1, where nn is the number of data points (in this case, n=5n=5):

SampleÂ Variance(s^{2}) = 225 + 25 + 0 + 25 + 225 / 5 âˆ’ 1 = 500 / 4 = 125

So, the variance of the exam scores for these 5 students is 125.

**Interpretation**

A variance of 125 means that the scores are dispersed around the mean to that degree. In this example, the higher variance suggests that there is a larger spread among the students’ exam scores. This could be indicative of different levels of preparation or understanding of the material among the students.

This example should give you a good understanding of how to calculate and interpret variance.