Topic 14 | Statistics & Probability

Summary statistics

Year 7 core: range, mean, median and mode; choosing a measure of central tendency; how outliers affect summary statistics.

30-40 min Printable practice Answer key
How to use this page

Read the explanation, work through the examples, then complete the core practice before printing.

Study progress: Not started

What you will learn

Worked example 0 Real-world example: which 'average' house price?

Five houses on a street sold for $420 000, $440 000, $450 000, $460 000, and $1 800 000 (a mansion).

400k600k800k1.0M1.8Moutliermean 714kmedian 450k
  1. Mean: (420+440+450+460+1800)÷5=3570÷5=714(420 + 440 + 450 + 460 + 1800) \div 5 = 3570 \div 5 = 714 (thousands).
  2. Median (middle value when sorted): 450450 (thousands) — that’s $450 000.
  3. The mansion pulls the mean up to $714 000 — far above what four of the five houses actually cost.
  4. A news headline saying “average house price $714 000” is technically correct but misleading. The median ($450 000) better represents a typical house on this street.

Key idea: one outlier can drag the mean far from the centre. That is why property reports use the median — it resists extreme values.

1. Mean

The mean adds all the values and divides by how many there are.

Mean
mean  =  sum of all valuesnumber of values.\text{mean} \;=\; \frac{\text{sum of all values}}{\text{number of values}}.
Worked example 1 Mean of five numbers

Find the mean of 7,9,11,10,87, 9, 11, 10, 8.

mean=7+9+11+10+85=455=9.\text{mean} = \frac{7 + 9 + 11 + 10 + 8}{5} = \frac{45}{5} = 9.

2. Median

The median is the middle value after sorting the data from smallest to largest. If the count is even, the median is the mean of the two middle values.

Worked example 2 Median - odd count

Find the median of 5,8,3,9,75, 8, 3, 9, 7.

  1. Sort: 3,5,7,8,93, 5, 7, 8, 9.
  2. The middle (3rd of 5) is 77.
median=7.\text{median} = 7.
Worked example 3 Median - even count

Find the median of 2,4,6,82, 4, 6, 8.

Sort: 2,4,6,82, 4, 6, 8. The two middle values are 44 and 66.

median=4+62=5.\text{median} = \frac{4 + 6}{2} = 5.

3. Mode

The mode is the value that appears most often. A data set can have one mode, two modes (bimodal), or no mode (all values unique).

1,2,2,3,4,4,4,5    mode=41, 2, 2, 3, 4, 4, 4, 5 \;\to\; mode = 4.

1,2,2,3,3,4    modes=2and31, 2, 2, 3, 3, 4 \;\to\; modes = 2 and 3 (bimodal).

4. Range

The range is a simple measure of spread.

Range
range=largestsmallest.\text{range} = \text{largest} - \text{smallest}.

5. Which measure to use?

Choosing a measure

Mean

Uses every value. Sensitive to outliers (extreme values can pull it away from the middle).

Median

Only position matters. Not sensitive to outliers - useful when the data has extreme values.

Mode

The typical or most-common value. Useful for categorical data, where mean and median do not apply.

Worked example 4 Effect of an outlier

A small business has five salaries (in thousands): $40, $45, $50, $55, $300 (the owner).

Mean=40+45+50+55+3005=4905=98.\text{Mean} = \frac{40 + 45 + 50 + 55 + 300}{5} = \frac{490}{5} = 98.

So the mean salary is $98 000. The median (middle after sorting) is $50 000.

The mean is pulled up sharply by the owner’s high salary; the median gives a better picture of “typical” pay.

Symmetricmean ≈ medianRight-skewed (e.g. salaries)medianmeantail pullsmean right
Symmetric vs skewed data. When data is symmetric, mean ≈ median. When skewed, the mean is pulled toward the tail — use the median instead.

Practice

Fluency

Tier 1: basic skills

    1. Find the mean of 4,6,8,10,124, 6, 8, 10, 12.
    2. Find the mean of 3,7,5,9,63, 7, 5, 9, 6.
    3. Find the mean of 12,14,18,2012, 14, 18, 20.
    4. Find the median of 1,3,5,7,91, 3, 5, 7, 9.
    5. Find the median of 4,8,6,2,104, 8, 6, 2, 10.
    6. Find the median of 3,5,8,103, 5, 8, 10.
    7. Find the median of 11,13,14,17,18,2011, 13, 14, 17, 18, 20.
    8. Find the mode of 2,3,3,5,7,3,92, 3, 3, 5, 7, 3, 9.
    9. Find the mode of 4,4,6,6,7,84, 4, 6, 6, 7, 8.
    10. Find the range of 2,7,5,9,32, 7, 5, 9, 3.
    11. Find the range of 45,60,72,38,5545, 60, 72, 38, 55.
    12. A data set: 10,12,14,16,18,2010, 12, 14, 16, 18, 20. Find the mean.
    13. For the set 5,5,6,8,115, 5, 6, 8, 11, find the mean, median, mode, range.
Reasoning

Tier 2: mixed practice

    Use this data for questions 1-5: 12,15,18,15,14,20,17,15,16,1812, 15, 18, 15, 14, 20, 17, 15, 16, 18.

    1. Sort the data and find the median.

    2. Find the mean.

    3. Find the mode.

    4. Find the range.

    5. If you added the value 100100 to this data set, which of mean, median, mode, range would change most? Explain briefly.

    6. Five students scored an average of 7070 on a test. Four of the scores are 65,72,68,7565, 72, 68, 75. Find the fifth score.

    7. The mean of 66 numbers is 1818. Five of them are 15,20,12,18,2215, 20, 12, 18, 22. Find the sixth.

    8. A data set has mean 1515 and 1010 values. If one value is wrongly recorded as 88 but should be 1818, what is the correct mean?

    9. Give an example of a data set with mean 1010, median 1010, mode 1010 and range 00.

    10. Give an example of a data set with 55 values where mean >> median.

Reasoning

Tier 3: explain and spot the mistake

    1. A student writes: “the mode is 1010 because 1010 is the biggest number in the list”. Explain the confusion and give the correct definition.
    2. Explain why the median is usually a better measure than the mean when a data set has a single extreme value.
    3. Can a data set have a mode but no mean? Can it have a mean but no mode? Explain both.
    4. Is the median always in the data set? Give an example where it is not.
Problem solving

Tier 4: real-world problems

    1. A cricket batter’s last 77 scores are 23,45,12,38,62,51,2923, 45, 12, 38, 62, 51, 29. Find the mean and median.
    2. A family has children aged 4,7,10,134, 7, 10, 13. Another child aged 1616 joins. Find the new mean age.
    3. The daily temperatures ( degC) for a week: 22,25,21,28,30,24,2322, 25, 21, 28, 30, 24, 23. Find the mean temperature and the range.
    4. Seven students scored an average of 6060 marks. Adding a new student with a score of 7676 changes the class size to 88. What is the new mean?
    5. A data set of 1010 values has a mean of 5050. The smallest value is 2020 and the largest is 9090. What is the mean of the middle 88 values (the 1010 values with the min and max removed)?
Answers

Answer key

Attempt the practice first. When you're ready to check, expand the answers below.

Show the full answer key

Tier 1: basic skills

Fluency

Fluency

    1. 88
    2. 66
    3. 1616
    4. 55
    5. 66. Sorted: 2,4,6,8,102, 4, 6, 8, 10.
    6. 6.56.5. Sorted: 3,5,8,103, 5, 8, 10; mean of 55 and 88.
    7. 15.515.5. Mean of middle two 1414 and 1717.
    8. 33 (appears three times)
    9. 44 and 66 (bimodal)
    10. 77. Method: 929 - 2.
    11. 3434. Method: 723872 - 38.
    12. 1515
    13. Mean 77; median 66; mode 55; range 66.

Tier 2: mixed practice

Reasoning

Mixed practice

    1. Sorted: 12,14,15,15,15,16,17,18,18,2012, 14, 15, 15, 15, 16, 17, 18, 18, 20; median =15+162=15.5= \dfrac{15 + 16}{2} = 15.5.
    2. 1616. Method: sum =160= 160; 160÷10160 \div 10.
    3. 1515 (appears three times).
    4. 88. Method: 201220 - 12.
    5. Range changes most (from 88 to 8888). The mode is unaffected. The median shifts only slightly; the mean goes up by about 7.67.6.
    6. 7070. Method: total required =5×70=350= 5 \times 70 = 350; subtract 65+72+68+75=28065 + 72 + 68 + 75 = 280.
    7. 2121. Method: total =6×18=108= 6 \times 18 = 108; sum of five given =87= 87; 10887=21108 - 87 = 21.
    8. 1616. Method: original total =15×10=150= 15 \times 10 = 150; correction +10+10 gives 160160; new mean 160÷10160 \div 10.
    9. Any data set with all values equal to 1010, e.g. {10,10,10,10}\{10, 10, 10, 10\}.
    10. Many possible. Example: 1,2,3,4,401, 2, 3, 4, 40. Mean =10= 10, median =3= 3.

Tier 3: explain and spot the mistake

Reasoning

Explain and spot the mistake

    1. The mode is the most frequently occurring value, not the largest one. The student has confused mode with maximum (the upper end of the range). Correct: the mode is whichever value appears most often; a data set can have no mode, one mode, or multiple modes.
    2. The mean uses every value, so a single extreme number can pull it noticeably up or down. The median depends only on position in the sorted list, so one outlier only shifts the middle by one rank at most - hence the median remains close to the bulk of the data when there are extreme values.
    3. Yes to both. Categorical data (e.g. eye colours) can have a mode (most common colour) but no mean - you cannot average “red”, “blue”, “green”. A data set with all distinct numerical values (e.g. 1,2,3,41, 2, 3, 4) has a mean (2.52.5) but no mode, since no value repeats.
    4. Not always. For an even-count numerical set, the median is the average of the two middle values, which may not itself be in the data. Example: median of {2,4}\{2, 4\} is 33, which is not in the set.

Tier 4: real-world problems

Problem solving

Real-world problems

    1. Mean 37.1\approx 37.1; median 3838. Method: sum =260= 260; mean =260/7= 260 / 7; sort and take the 44th value (3838).
    2. Mean age =10= 10. Method: sum =4+7+10+13+16=50= 4 + 7 + 10 + 13 + 16 = 50; 50÷550 \div 5.
    3. Mean 24.7\approx 24.7 degC (exactly 173/7173/7); range 99 degC. Method: 173÷724.71173 \div 7 \approx 24.71; 302130 - 21.
    4. New mean =62= 62. Method: previous total =7×60=420= 7 \times 60 = 420; new total =420+76=496= 420 + 76 = 496; 496÷8=62496 \div 8 = 62.
    5. Mean of middle 88 is 48.7548.75. Method: total of all 1010 is 500500; remove 20+90=11020 + 90 = 110; remaining total 390390; 390÷8390 \div 8.

Prefer paper? Print the answer key as a separate booklet: open print view ->