Why is an average of an average usually incorrect?

The Question :

104 people think this question is useful

Can someone explain why taking an average of an average usually results in a wrong answer? Is there ever a case where the average of the average can be used correctly?

As an example, let’s say that an assessment is given to three schools and I want to find out the average score for all three schools combined and the average score per school. When I attempt to add the three individual scores and divide by three I get a number that is very close (+/- 1 percent) to the actual overall average.

The Question Comments :
  • It’s not wrong, it needs interpretation. If school A has 10000 students, with average marks in chemistry, say 2.56, and school B has one pupil (ok, bit extreme, but why not), with average mark in chemistry 1, then you may either want to know the average marks w.r.t. the number of schools (2.56 + 1)/2 or w.r.t to pupils (10000*2.56 +1)/10001. Both number make sense but have, of course, completely different implications and interpretations.
  • Say two of the schools are small and very good, and the third school is huge, and pretty bad. If you average over students, the good results from the small schools hardly make a dent. If you first find the average per school, and average the averages, this will tend to hide the poor results from the huge school. In the real world, disparities of size or performance are often not very dramatic, so the two procedures can end up giving fairly similar numbers. One thing we can be sure of: whoever is trying to make a point will choose the procedure that supports his/her point.
  • @AndréNicolas – You may want to post that as your answer. I feel as though you are implying that neither approach is wrong, is this true?
  • An interesting real-world case. When a university reports the “average class size”, the result is quite different if they report the class size for the average student, or the class size for the average instructor.
  • This distortion of averages is a common trick of journalists and politicians to tilt statistical evidence to support their positions on various issues.

The Answer 1

88 people think this answer is useful

If there are $n_1$, $n_2$, and $n_3$ students in the three schools, and the average test score for each school is $a_1$,$a_2$,$a_3$, respectively, the correct average is a “weighted average:”


The average of the averages is:

$$\frac{1}{3}a_1 + \frac{1}{3}a_2 + \frac{1}{3}a_3$$

These two values will be exactly the same if each school has exactly the same number of students, and will tend to be “close” if the schools are relatively close in size and/or the scores for the three schools are close.

If a school system put all the smart students at a single school, they could bump up the second value – the “average of averages” – but they couldn’t do that if they take the correct weighted average.

The Answer 2

52 people think this answer is useful

For example:
the average of
$ \{
is $2$, ($N=13$) and the average of $\{4\}$ is $4$, ($N=1$). The average of the averages is $3$.
But the average of all numbers is $30/14$ ≃ $2.14$.

I hope this is enough to explain what goes wrong (you’re giving equal weights to the “first averages” when you take their average, which isn’t the correct thing to do if you want the average of all the numbers).

The Answer 3

25 people think this answer is useful

Thomas Andrews already answered the question, but I’d like to present a more analytical solution to the problem.

The average of averages is only equal to the average of all values in two cases:

  1. if the number of elements of all groups is the same; or
  2. the trivial case when all the group averages are zero

Here’s why this is so.

Consider two sets $X = \{x_1, x_2, …, x_n\}$ and $Y = \{y_1, y_2, …, y_m\}$ and their averages:

$$ \bar{x} = \frac{\sum_{i=1}^{n}{x_i}}{n} \,,\,
\bar{y} = \frac{\sum_{i=1}^{m}{y_i}}{m}

The average of the averages is:

$$ average(\bar{x}, \bar{y})
= \frac{\frac{\sum_{i=1}^{n}{x_i}}{n} + \frac{\sum_{i=1}^{m}{y_i}}{m}}{2}
= \frac{\sum_{i=1}^{n}{x_i}}{2n} + \frac{\sum_{i=1}^{m}{y_i}}{2m}

Now consider the whole group
$Z = \{x_1, x_2, …, x_n, y_1, y_2, …, y_m\}$ and its average:

$$ \bar{z} = \frac{\sum_{i=1}^{n}{x_i} + \sum_{i=1}^{m}{y_i}}{n + m}$$

For the general case, we can see that these averages are different:

$$ \frac{\sum_{i=1}^{n}{x_i}}{2n} + \frac{\sum_{i=1}^{m}{y_i}}{2m}
\ne \frac{\sum_{i=1}^{n}{x_i} + \sum_{i=1}^{m}{y_i}}{n + m}

This answers the first OP question, as to why the average of averages usually gives the wrong answer.

However, if we make $n = m$, we have:

$$ \frac{\sum_{i=1}^{n}{x_i}}{2n} + \frac{\sum_{i=1}^{m}{y_i}}{2n}
= \frac{\sum_{i=1}^{n}{x_i} + \sum_{i=1}^{n}{y_i}}{2n}

This is why the average of averages is equal to the average of the whole group when the groups have the same size.

The second case is trivial: $\bar{x} = \bar{y} = average(\bar{x}, \bar{y}) = 0$.

Note that the above reasoning can be extended for any number of groups.


Add a Comment