Start here: why we can’t always ask everyone
Imagine you want to know the average height of all Year 8 students in Australia. There are hundreds of thousands of them. You can’t measure every single one — it would take years.
Instead, you pick a group — maybe Year 8 students chosen carefully — measure their heights, and use that to estimate the whole-country average.
That group is called a sample. The whole country of Year 8 students is called the population. This whole topic is about: how do you pick a good sample, and how much can you trust what it tells you?
What you will learn
- distinguish a population from a sample,
- recognise the main sampling methods and their biases,
- understand why bigger samples are more reliable,
- plan a simple statistical investigation that avoids obvious bias,
- report findings acknowledging uncertainty.
1. Population vs sample
The population is the entire group you want information about. A sample is a subset of the population you actually collect data from.
A census collects data from every member of the population. A sample survey collects from a subset, usually because a census would be too costly or slow.
A school surveys students from one Year 8 class to ask about screen time. What is the population? The sample?
- Population: all Year 8 students the school wants to understand (maybe the whole year level, maybe the whole school — depends on the question).
- Sample: the students who were actually surveyed.
2. Sampling methods
Common sampling methods
Every member of the population has an equal chance of being chosen. Use a random number generator or drawing from a hat.
Choose every th member after a random start. E.g. every th name on the roll.
Divide the population into groups (strata) by a feature (year level, gender, region), then sample within each group in proportion.
Divide into natural clusters (e.g. whole classes), randomly pick clusters, survey everyone in the chosen clusters.
Ask whoever is easy to reach. Fast but usually biased.
Fill preset numbers of people in predetermined categories - subjective and biased.
3. Sources of bias
A sample is biased when certain members of the population are systematically more or less likely to be sampled. Common traps:
- Selection bias: a shopping-mall survey over-represents shoppers at that mall.
- Non-response bias: people who decline to answer may differ systematically from those who do.
- Question bias: loaded wording can push respondents to a particular answer.
- Timing bias: polling only at lunchtime misses workers.
A school wants to know how many students favour longer recess. A survey is handed out at the canteen during lunch and only those who hand it back are counted.
Biases:
- Selection: lunch-goers are over-represented.
- Non-response: students in favour may be more likely to respond.
A better method: stratify by year level, pick a random sample from each stratum, follow up non-responders.
4. Sample size and variation
Different random samples of the same size will give slightly different results - but bigger samples are more stable. Doubling the sample size roughly halves the random variation.
A school of students has in favour of a uniform change. Three separate random samples of students each give percentages , , - wide variation. Three samples of might give , , - much closer to the true .
5. Planning an investigation
A basic workflow:
- Question: what do we want to find out?
- Population: whom does it apply to?
- Sampling plan: method, size, how to pick.
- Data collection: tool, timing.
- Analysis: summary statistics, displays, comparisons.
- Report: findings with uncertainty acknowledged.
Practice: Year 8 core
Population, sample, census
- A school has students. The Principal surveys every student in the school. Census or sample?
- A shop owner asks every tenth customer about satisfaction. Sampling method?
- A researcher wants to know heights of all Australian -year-olds. Census or sample? Why?
- A market researcher surveys only people in shopping centres. Name one likely bias.
- State the population and suggest a suitable sample for: “What proportion of Year 8 students at our school ride a bike to school?”
Sampling methods and bias
- Which sampling method divides the population into strata and samples from each? Stratified, cluster, or convenience?
- What type of bias arises from a survey question like “Do you agree that more homework is harmful?”
- Explain why a phone-in survey is usually biased.
- A school has Year 7, Year 8, Year 9 students. Using stratified sampling with a sample, how many from each year?
Explain and spot the mistake
- Sam claims “a sample of is enough to be certain about a school of ”. Is Sam correct? Explain.
- Explain why two random samples of the same size can give different summary statistics.
- Write an unbiased version of this question: “Don’t you agree that our coach is doing a great job?”
- A newspaper reports a poll of readers showing support a policy. What caveats should be stated before trusting the result?
Plan and analyse
- Design a statistical investigation to answer: “How much sleep do Year 8 students at our school get on a school night?” Include population, sample method, sample size, and a data display.
- A school has students. You take four random samples of and count those who cycle: . Calculate the mean percentage and comment on variability.
- A factory tests of its daily output of screws. Is a large enough sample? What factors matter?
- Two weather stations collect rainfall each day for two weeks. Station A records days; Station B records days. Which would you trust more for “average daily rainfall this fortnight”?