When you’ve got a set of data like a big list of numbers, range and outliers tell you which values don’t appear right, and how spread out they are.

Are all the values bunched up or scattered all over? The easiest measure of spread is the range. Understanding outliers lets us know which ones we should probably ignore, as those data points are very strange or have been recorded incorrectly.


TLDR


What You Need to Know (Taken From The Specification)

This topic connects to other statistical concepts like measures of central tendency (mean, median, mode) and graphical representations of data.


Range – The Basics

What is the Range, How do we Calculate it

The range is the simplest measure of spread. It’s just the difference between the maximum and minimum values in your data set.

Range = Maximum value – Minimum value

That’s it – nothing more complicated than that. Just find the biggest number, find the smallest number, and subtract.

Example 1

Let’s find the range of this data set: 5, 8, 12, 15, 16, 18, 22

Step 1: Find the maximum value.

Maximum = 22

Step 2: Find the minimum value.

Minimum = 5

Step 3: Calculate the range.

Range = Maximum – Minimum = 22 – 5 = 17

So the range of this data set is 17.


Example 2

Now let’s look at a slightly messier example with decimal values.

Find the range of these test scores: 65.5, 71.2, 68.7, 82.9, 75.0, 69.3

Step 1: Find the maximum value.

Maximum = 82.9

Step 2: Find the minimum value.

Minimum = 65.5

Step 3: Calculate the range.

Range = Maximum – Minimum = 82.9 – 65.5 = 17.4

So the range of the test scores is 17.4.


Outliers

What are Outliers?

Outliers are values that are unusually high or low compared to the rest of the data. They sit away from the main cluster of values and can skew your statistics.

The usual rule for what an outlier is, is a value which is 1.5 x the interquartile range (IQR) below the first quartile, or 1.5 x the IQR above the 3rd quartile.

In reality there’s no single rule for what counts as an outlier – it depends on the context of your data. But an outlier is always a value that’s very different from most of the values in your data set. Unless told otherwise, stick to the 1.5 x IQR.

Identifying Outliers

There are several ways to identify outliers:

  1. Visual inspection – plot the data and see if any points lie far away from the rest on the graph
  2. Common sense and context – if you know what values are reasonable for your data
  3. Statistical methods – like the 1.5 × IQR rule (you’ll learn this in the higher tier)

Example 3

Let’s look at this data set: 15, 17, 16, 18, 14, 16, 42

Most values are clustered between 14 and 18, but 42 is way higher. It doesn’t fit the pattern, so 42 is an outlier.

If we calculate the range including the outlier:

Range = 42 – 14 = 28

But if we remove the outlier:

Range = 18 – 14 = 4

A massive difference! The range with the outlier is 7 times larger than without it.


Example 4

Consider these temperatures (°C) recorded over a week: 22, 24, 23, 25, -2, 21, 23

The value -2 looks really weird compared to the others. Let’s think about it:

Without more context, we’d typically consider -2 an outlier.

Range with the outlier: 25 – (-2) = 27

Range without the outlier: 25 – 21 = 4

So the outlier here massively affects the range. Bearing in mind the size of the change and the short time frame, we should probably ignore it. It’s almost unheard of for the temperature to plunge from summer to winter levels for one day of the week!

From these two examples you can see the range is extremely sensitive to outliers because it only uses the two most extreme values in the data set. Just one unusual value can completely change the range.


A Weakness of Range as a Measure of Spread

This is the main weakness of the range as a measure of spread. It doesn’t tell you anything about how the data is distributed between the extremes.

Example 5

Consider these two data sets:

Data Set A: 10, 11, 12, 13, 14, 15, 16

Data Set B: 10, 10, 10, 13, 16, 16, 16

Both have the same range (16 – 10 = 6), but the distribution of values is completely different:

This shows why the range alone doesn’t give you the complete picture of how data is spread out.


Reporting Range With and Without Outliers

When you have outliers in your data, it’s often useful to report the range both with and without them.

For example: “The range of temperatures was 27°C, but excluding the unusually cold day, the range was 4°C.”

This gives a more complete picture of the data’s spread and acknowledges the unusual values without letting them dominate the description.


Common Mistakes to Avoid

  1. Forgetting to subtract:

  2. Incorrect: Just giving the maximum and minimum values (e.g., “the range is 5 to 22”)

  3. Correct: Subtract minimum from maximum to get a single value (e.g., “the range is 17”)

  4. Ignoring outliers completely:

  5. Incorrect: Removing outliers without mentioning them

  6. Correct: Acknowledge outliers and consider calculating the range both with and without them

  7. Confusing range with other measures:

  8. Incorrect: Mixing up range with interquartile range or standard deviation

  9. Correct: Remember that range is just max – min

  10. Not considering context:

  11. Incorrect: Blindly identifying values as outliers without considering if they’re reasonable

  12. Correct: Use your knowledge of the context to help judge whether extreme values are outliers

  13. Thinking the range tells the whole story:

  14. Incorrect: Relying only on the range to describe spread

  15. Correct: Recognize that the range is limited because it only uses two values


Questions

Try these questions to practice working with range and outliers:

  1. Calculate the range of this data set:

    7, 12, 18, 21, 25, 30, 32

  2. For the following data set, identify any outliers and calculate the range both with and without them:

    15, 18, 17, 16, 19, 62, 14

  3. The heights (in cm) of plants in a garden are:

    25, 28, 30, 32, 29, 31, 27, 5

a) Calculate the range of the heights.

b) Is there an outlier? Explain your reasoning.

c) If there is an outlier, calculate the range without it.

  1. Two students measured the temperature of water samples. Their results were:

Student A: 22°C, 23°C, 21°C, 24°C, 22°C

Student B: 10°C, 35°C, 22°C, 23°C, 20°C

a) Calculate the range for each student’s measurements.

b) Which student’s measurements show greater consistency? Explain your answer.


Solutions

Question 1

Range = Maximum – Minimum = 32 – 7 = 25

Question 2

The data set is: 15, 18, 17, 16, 19, 62, 14

Most values are between 14 and 19, but 62 is much higher. This is clearly an outlier.

Range with the outlier:

Range = 62 – 14 = 48

Range without the outlier:

Range = 19 – 14 = 5

Question 3

a) Range = Maximum – Minimum = 32 – 5 = 27

b) Yes, 5 cm appears to be an outlier because:

c) Range without the outlier:

Range = 32 – 25 = 7

Question 4

a) Student A’s range:

Range = 24 – 21 = 3°C

Student B’s range:

Range = 35 – 10 = 25°C

b) Student A’s measurements show greater consistency because:


Summary

Remember, in statistics, no single measure tells the complete story. The range is just one number that describes data. It isn’t even the only measure of spread. It works best when used alongside other measures like the mean, median, and mode.

Leave a Reply

Your email address will not be published. Required fields are marked *