Variability in Statistics: Definition & Measures
Variability is a measure of the spread of a data set. Learn more about the different measures of variability including the range, variance, and standard deviation, and the way in which they are used in the field of psychology.
What is a Variability?
Imagine that you are teaching a psychology course and you want to examine your students’ performance on the midterm and final exams. The grades of your students are as follows:
student data
Notice that there are thirteen students in the class and both the midterm and final grades are listed for each student. Notice that there is only one student who received the same grade for both the midterm and the final. You now want to know if the students’ scores on each exam are similar to each other, or if the scores are spread out. This is called variability.
Variability refers to how spread out a group of data is. In other words, variability measures how much your scores differ from each other. Variability is also referred to as dispersion or spread. Data sets with similar values are said to have little variability, while data sets that have values that are spread out have high variability.
Data set B is wider and more spread out than data set A. This indicates that data set B has more variability.
variability example
Measures of Variability
The range is the simplest measure of variability. You take the smallest number and subtract it from the largest number to calculate the range. This shows the spread of our data. The range is sensitive to outliers, or values that are significantly higher or lower than the rest of the data set, and should not be used when outliers are present.
Let’s calculate the range for midterm exam grades. The midterm grades listed in numerical order are shown here. Since the range is equal to the highest midterm grade minus the lowest midterm grade, we can easily find the range for this data set. Plugging in 100 for our highest midterm grade and 52 for our lowest midterm grade, we find that the range is equal to 100 minus 52, or 48.
Range = highest midterm grade minus lowest midterm grade
Range = 100 – 52 = 48
Measures of Variability: IQR
So what measure of variability can we use when working with sets of data that contain outliers? One solution is to use the interquartile range (IQR). The IQR, or the middle fifty, is the range for the middle fifty percent of the data. The IQR only considers middle values, so it is not affected by the outliers.
IQR = Q3 -Q1
To calculate the IQR, use the following steps:
1) List the data in numerical order. Listing the data in numerical order is necessary for finding the range and median.
2) Find the median. In a set of odd data, the median is the middle value that cuts the data into half. For example, in a set of 13 data, the median is the number in the seventh place. In a set of even data, the median is the mean of the two middle values. For this data set, the median is 85.
3) Place brackets around the numbers above and below the median but not around the median. The brackets will separate the median from Q1 and Q3. So for the midterm grades, our data will now look like this.
4) Locate the quartiles. For the midterm grades, Q1 is 52, 55, 71, 75, 81, 83. And Q3 is 89, 90, 90, 99, 100, 100.
5) Find the median of the data in Q1. We’ve got an even number of data points in Q1, so our median will be the average of the middle two numbers. In other words, we add 71 to 75 and then divide the sum by 2. The median is 73. And this is our Q1 value.
6) Find the median of the data in Q3. To find the median, repeat what was done for Q1 but with the values in Q3. We can find that Q3 is 94.5.
7) Find the interquartile range using the formula IQR = Q3 – Q1. Plugging 73 for Q1 and 94.5 for Q3, we’ll find that IQR = 94.5 – 73 = 21.5. So the interquartile range for this data set is 21.5.
Measures of Variability: Variance
The variance is a measure of how close the scores in the data set are to the mean. The variance is mainly used to calculate the standard deviation and other statistics. There are four steps to calculate the variance. Let’s use the midterm exam grades again, but this time we’ll calculate the variance.
Take another look at our midterm data.
1) Find the mean of the data set. To find the mean, add to get the sum of all the numbers in the data set. Divide the sum by 13. The mean of the midterm exam grades is 82.31.
2) Subtract the mean from each value in the data set. To do this, subtract 82.31 from each number in the data set. For example, the student’s score is 71 and 71 minus 82.31 is negative 11.31. Continue subtracting 82.31 from each number in the data set.
3) Now square each of the values so that you now have all positive values. For example, the difference we got before was negative 11.31. That number squared gives us 127.86. Continue to square each value, then add the squared values together. The sum of the squared values is 2922.77.
4) Finally, divide the sum of the squares by the total number of values in the set to find the variance. This data set has 13 numbers, so divide the sum of the squared differences by 13. The variance is 2922.77 divided by 13, or 224.83.
midterm variability