Lesson 6

The Median

Let's explore the median of a data set and what it tells us.

6.1: The Plot of the Story

  1. Here are two dot plots and two stories. Match each story with a dot plot that could represent it. Be prepared to explain your reasoning.

    Two dot plots from 10 to 75 by 5’s. Age in years. Labeled data set A, data set B.
     
    • Twenty people—high school students, teachers, and invited guests—attended a rehearsal for a high school musical. The mean age was 38.5 years and the MAD was 16.5 years.

    • High school soccer team practice is usually watched by supporters of the players. One evening, twenty people watched the team practice. The mean age was 38.5 years and the MAD was 12.7 years.
  2. Another evening, twenty people watched the soccer team practice. The mean age was similar to that from the first evening, but the MAD was greater (about 20 years).

    Make a dot plot that could illustrate the distribution of ages in this story.

    A blank dot plot for “age in years.” The numbers 10 through 70, in increments of 5, are indicated.

6.2: Finding the Middle

  1. Your teacher will give you an index card. Write your first and last names on the card. Then record the total number of letters in your name. After that, pause for additional instructions from your teacher.

  2. Here is the data set on numbers of siblings from an earlier activity.

    1

    0

    2

    1

    7

    0

    2

    0

    1

    10

    1. Sort the data from least to greatest, and then find the median.
    2. In this situation, do you think the median is a good measure of a typical number of siblings for this group? Explain your reasoning.
  3. Here is the dot plot showing the travel time, in minutes, of Elena’s bus rides to school.

    Dot plot from 5 to 14 by 1’s. Travel time in minutes. Beginning at 5, number of dots above each increment is 0, 2, 1, 3, 3, 2, 0, 1, 0, 0.
    1. Find the median travel time. Be prepared to explain your reasoning.
    2. What does the median tell us in this context?

6.3: Mean or Median?

  1. Your teacher will give you six cards. Each has either a dot plot or a histogram. Sort the cards into two piles based on the distributions shown. Be prepared to explain your reasoning.

  2. Discuss your sorting decisions with another group. Did you have the same cards in each pile? If so, did you use the same sorting categories? If not, how are your categories different?

    Pause here for a class discussion.

  3. Use the information on the cards to answer the following questions.

    1. Card A: What is a typical age of the dogs being treated at the animal clinic?
    2. Card B: What is a typical number of people in the Irish households?
    3. Card C: What is a typical travel time for the New Zealand students?
    4. Card D: Would 15 years old be a good description of a typical age of the people who attended the birthday party?
    5. Card E: Is 15 minutes or 24 minutes a better description of a typical time it takes the students in South Africa to get to school?
    6. Card F: Would 21.3 years old be a good description of a typical age of the people who went on a field trip to Washington, D.C.?
  4. How did you decide which measure of center to use for the dot plots on Cards A–C? What about for those on Cards D–F?


Most teachers use the mean to calculate a student’s final grade, based on that student’s scores on tests, quizzes, homework, projects, and other graded assignments.

Diego thinks that the median might be a better way to measure how well a student did in a course. Do you agree with Diego? Explain your reasoning.

Summary

The median is another measure of center of a distribution. It is the middle value in a data set when values are listed in order. Half of the values in a data set are less than or equal to the median,and half of the values are greater than or equal to the median.

To find the median, we order the data values from least to greatest and find the number in the middle.

Suppose we have 5 dogs whose weights, in pounds, are shown in the table. The median weight for this group of dogs is 32 pounds because three dogs weigh less than or equal to 32 pounds and three dogs weigh greater than or equal to 32 pounds.

20

25

32

40

55

Now suppose we have 6 cats whose weights, in pounds, are as shown in the table. Notice that there are two values in the middle: 7 and 8.

4

6

7

8

10

10

The median weight must be between 7 and 8 pounds, because half of the cats weigh less or equal to 7 pounds and half of the cats weigh greater than or equal to 8 pounds.

In general, when we have an even number of values, we take the number exactly in between the two middle values. In this case, the median cat weight is 7.5 pounds because \((7+8)\div 2=7.5\).

Here is a set of 30 cookies. It has a mean weight of 21 grams, but the median weight is 23 grams.

A dot plot for “cookie weights in grams.” 

In this case, the median is closer to where most of the data points are clustered and is therefore a better measure of center for this distribution. That is, it is a better description of a typical cookie weight. The mean weight is influenced (in this case, pulled down) by a handful of much smaller cookies, so it is farther away from most data points.

In general, when a distribution is symmetrical or approximately symmetrical, the mean and median values are close. But when a distribution is not roughly symmetrical, the two values tend to be farther apart. Because the mean is fairly influenced by each value in the data set, it is generally preferred for distributions where it makes sense to use it. In cases when the distribution is less symmetric, the median is often reported as the typical value.

Glossary Entries

  • average

    The average is another name for the mean of a data set.

    For the data set 3, 5, 6, 8, 11, 12, the average is 7.5.

    \(3+5+6+8+11+12=45\)

    \(45 \div 6 = 7.5\)

  • mean

    The mean is one way to measure the center of a data set. We can think of it as a balance point. For example, for the data set 7, 9, 12, 13, 14, the mean is 11.

    data set for travel time in minutes with a mean of 11 minutes

    To find the mean, add up all the numbers in the data set. Then, divide by how many numbers there are. \(7+9+12+13+14=55\) and \(55 \div 5 = 11\).

  • mean absolute deviation (MAD)

    The mean absolute deviation is one way to measure how spread out a data set is. Sometimes we call this the MAD. For example, for the data set 7, 9, 12, 13, 14, the MAD is 2.4. This tells us that these travel times are typically 2.4 minutes away from the mean, which is 11.

    data set for travel time in minutes with a mean of 11 minutes

    To find the MAD, add up the distance between each data point and the mean. Then, divide by how many numbers there are.

    \(4+2+1+2+3=12\) and \(12 \div 5 = 2.4\)

  • measure of center

    A measure of center is a value that seems typical for a data distribution.

    Mean and median are both measures of center.

  • median

    The median is one way to measure the center of a data set. It is the middle number when the data set is listed in order.

    For the data set 7, 9, 12, 13, 14, the median is 12.

    For the data set 3, 5, 6, 8, 11, 12, there are two numbers in the middle. The median is the average of these two numbers. \(6+8=14\) and \(14 \div 2 = 7\).