Data handling and probability (Grade 8) – Week 8 focus
Download the Lessonotes Mobile South Africa app for faster lesson access on Android and iPhone.
Subject: Mathematics
Class: Grade 8
Term: Term 4
Week: 8
Theme: General lesson support
This page supports the lesson note with a companion video and a short classroom-ready summary.
For class groups and homework, share this lesson page so learners also get the summary, objectives, and full lesson context.
This week, we delve into the fascinating world of Data Handling and Probability. This topic is crucial not just for mathematics, but for understanding the world around us. From interpreting crime statistics in news reports to making informed choices about lottery tickets, the skills you learn here will be invaluable throughout your life. In South Africa, understanding data helps us analyse social issues like unemployment, education levels, and healthcare access, enabling us to make informed decisions and advocate for positive change.
2.1 Measures of Central Tendency: These measures aim to find the "typical" value in a dataset.
We'll focus on three: Mean: The mean is the average of all the values in a dataset. To calculate the mean, you sum all the values and divide by the number of values.
Formula: Mean = (Sum of all values) / (Number of values)
Example: Consider the ages of 5 learners in your class: 13, 14, 13, 15,
1
3. Sum = 13 + 14 + 13 + 15 + 13 = 68 Number of values = 5 Mean = 68 / 5 = 13.6 years Why it matters:* The mean tells you the average age of the learners.
Median: The median is the middle value in a dataset when the dataset is arranged in ascending order. If there are an even number of values, the median is the average of the two middle values.
Example 1 (Odd number of values): Using the same ages: 13, 14, 13, 15,
1
3. Arrange in ascending order: 13, 13, 13, 14, 15 The middle value is
1
3. Therefore, the median is 13 years.
Example 2 (Even number of values): Consider the heights (in cm) of 6 soccer players: 160, 165, 170, 175, 180,
1
8
5. Already in ascending order. The two middle values are 170 and
1
7
5. Median = (170 + 175) / 2 = 172.5 cm Why it matters:* The median gives you the 'middle' height or age, which is less sensitive to extreme values than the mean. Imagine one very tall rugby player was added - the median would change less than the mean.
Mode: The mode is the value that appears most frequently in a dataset. A dataset can have no mode, one mode (unimodal), or multiple modes (bimodal, trimodal, etc.).
Example: Again, using the ages 13, 14, 13, 15,
1
3. The age 13 appears 3 times, which is more than any other age.
Therefore, the mode is 13 years.
Why it matters:* The mode tells you the most common value. In retail, the mode helps determine which shoe size is most popular and therefore needs to be stocked the most. 2.2 Measures of Dispersion: These measures describe how spread out the data is.
Range: The range is the difference between the highest and lowest values in a dataset.
Formula: Range = Highest Value - Lowest Value
Example: Consider exam scores out of 100: 50, 60, 70, 80, 90 Highest value = 90 Lowest value = 50 Range = 90 - 50 = 40 Why it matters:* The range gives you a quick idea of the overall spread. A small range indicates that the data points are clustered close together, while a large range indicates that they are more spread out. In a school, a smaller range on a test might indicate more consistent student understanding.
Interquartile Range (IQR): The IQR is a measure of spread based on quartiles. Quartiles divide the ordered data into four equal parts.
Q1 (First Quartile):* The median of the lower half of the data.
Q2 (Second Quartile):* The median of the entire data set.
Q3 (Third Quartile):* The median of the upper half of the data.
Formula: IQR = Q3 - Q1
Example: Consider the data: 2, 4, 6, 8, 10, 12, 14, 16, 18, 20. n = 10 (even) Q2 (Median) = (10 + 12) / 2 = 11 Lower half: 2, 4, 6, 8, 10 Q1 = 6 (median of the lower half)
Upper half: 12, 14, 16, 18, 20 Q3 = 16 (median of the upper half) IQR = Q3 - Q1 = 16 - 6 = 10 Why it matters:* The IQR is less sensitive to outliers (extreme values) than the range. It tells you the spread of the middle 50% of the data. This is useful when analyzing income data, as a few very high earners can skew the range, but have less effect on the IQR. 2.3 Data Representation: Histograms: Histograms are used to represent continuous data. They are similar to bar graphs, but the bars touch each other to show the continuous nature of the data. Histograms group data into ranges (bins).
Example:* Imagine recording the heights of all Grade 8 learners. You could group the heights into ranges like 140-149cm, 150-159cm, 160-169cm, etc., and then draw a histogram showing the number of learners in each height range.
Interpreting Histograms:* The shape of the histogram can tell you a lot about the data. A symmetrical histogram indicates that the data is evenly distributed, while a skewed histogram indicates that the data is concentrated more on one side.
Pie Charts: Pie charts are used to represent categorical data. They show the proportion of each category as a slice of a circle. The size of each slice is proportional to the percentage of the category. The entire pie represents 100%.
Example:* A survey of favourite sports among Grade 8 learners.
Soccer: 40% Rugby: 30% Netball: 20% Cricket: 10% The pie chart would have four slices, representing each sport. The soccer slice would be the largest (40% of the circle), and the cricket slice would be the smallest (10% of the circle).
Interpreting Pie Charts:* Pie charts allow for quick visual comparison of proportions. It is easy to see which category is the largest and smallest. 2.4 Probability: Probability is the measure of the likelihood that an event will occur.
Calculating Probability: The probability of an event is calculated by dividing the number of favourable outcomes by the total number of possible outcomes.