Median
#1
Posted 2008-September-30, 19:42
"If there is an even number of observations, the median is not unique, so one often takes the mean of the two middle values."
I have tracked down a few examples of calculating the median of an even number of observations and each time the mean of the two middle values is used. But, is this the only median if a strict mathematical definition of median is used?
#2
Posted 2008-September-30, 19:46
#3
Posted 2008-September-30, 19:46
Anyway, how can this question matter?
#4
Posted 2008-September-30, 19:50
cherdano, on Sep 30 2008, 08:46 PM, said:
Sometimes it's good to know the answer just for the sake of knowing the answer.
#5
Posted 2008-September-30, 22:19
- hrothgar
#6
Posted 2008-September-30, 22:44
#7
Posted 2008-October-01, 03:25
The median is not so useful for small data sets, and for large data sets it doesn't matter much.
#8
Posted 2008-October-01, 06:20
matmat, on Sep 30 2008, 11:44 PM, said:
You know what my 12 year-old daughter knows -- honestly, she looked at my 10 year-old son like he was crazy when he said it had something to do with math.
Said 10 year-old son was told by his 5th grade teacher that the median is determined by averaging the middle two elements when there are an even number of elements. Nothing really wrong with him following the prescribed procedure, but I want to know if it would be wrong to call one of the two middle elements a median (or any other value between the two middle elements).
#9
Posted 2008-October-01, 06:29
Is this standard? Wikipedia disagrees, it says the ranks should be rounded off so that the quartiles produce numbers that actually occur in the data set. It seems inconsequent that the median is the mean of the two middle observations while the other quartiles are picked among the actual observations.
#10
Posted 2008-October-01, 07:41
We don't really have separate definitions in the even and odd case this way, with an odd number of data points the interval becomes a one point interval [m,m].
In general, mathematics is governed by what might be called conditional practicality. Once you accept the idea that doing mathematics is a sensible activity, many of the definitions are practical attempts to pursue that activity.
#11
Posted 2008-October-01, 08:30
kenberg, on Oct 1 2008, 05:41 AM, said:
Isn't that only true if the x_i's are positive?
#12
Posted 2008-October-01, 08:37
TimG, on Oct 1 2008, 06:20 AM, said:
matmat, on Sep 30 2008, 11:44 PM, said:
You know what my 12 year-old daughter knows -- honestly, she looked at my 10 year-old son like he was crazy when he said it had something to do with math.
Said 10 year-old son was told by his 5th grade teacher that the median is determined by averaging the middle two elements when there are an even number of elements. Nothing really wrong with him following the prescribed procedure, but I want to know if it would be wrong to call one of the two middle elements a median (or any other value between the two middle elements).
Math course books at any level make many similar decisions to pick one definition among several plausible ones. (Well, actual math books too, for that matter.)
In fact, every time I teach calculus I teach some much more questionable definitions.
#13
Posted 2008-October-01, 09:03
TimG, on Oct 1 2008, 07:20 AM, said:
matmat, on Sep 30 2008, 11:44 PM, said:
You know what my 12 year-old daughter knows -- honestly, she looked at my 10 year-old son like he was crazy when he said it had something to do with math.
Said 10 year-old son was told by his 5th grade teacher that the median is determined by averaging the middle two elements when there are an even number of elements. Nothing really wrong with him following the prescribed procedure, but I want to know if it would be wrong to call one of the two middle elements a median (or any other value between the two middle elements).
so your median child is 11yo?
#14
Posted 2008-October-01, 09:16
matmat, on Oct 1 2008, 10:03 AM, said:
TimG, on Oct 1 2008, 07:20 AM, said:
matmat, on Sep 30 2008, 11:44 PM, said:
You know what my 12 year-old daughter knows -- honestly, she looked at my 10 year-old son like he was crazy when he said it had something to do with math.
Said 10 year-old son was told by his 5th grade teacher that the median is determined by averaging the middle two elements when there are an even number of elements. Nothing really wrong with him following the prescribed procedure, but I want to know if it would be wrong to call one of the two middle elements a median (or any other value between the two middle elements).
so your median child is 11yo?
I don't think you have a median child. Your childrens' median age is 11.
I learned it the way wikipedia defines it.
#15
Posted 2008-October-01, 09:35
jdonn, on Oct 1 2008, 10:16 AM, said:
I learned it the way wikipedia defines it.
i don't believe i ever learned this, and I don't EVER recall having to use the median. I understand the purpose for the means of various sorts, and the mode, but i could never figure out what the point of a median was, except to provide additional useless crap to put on the SAT
#16
Posted 2008-October-01, 10:37
The "most mathematical" définition is to say that the median is a number which minimizes the sum of the (absolute) distances to all the numbers of your list (and the arithmetic mean to be the one which minimizes the sum of the squares of these distances), in which case all the interval between the two middle numbers is a median, so why not take the middle of this interval ?
But there are indeed people who want the median to be one of the numbers of the list, and usually pick the smaller one. A convenient definition for this median is the smallest number for which the cumulate frequency reachs 0.5.
PS : Sorry for the maybe bad translation from French of some of the mathematical words...
#17
Posted 2008-October-01, 10:44
helene_t, on Oct 1 2008, 01:25 AM, said:
Not so sure what you mean by this. My understanding is that the median is a more robust statistic than the mean. It's been awhile, but I thought the point was that the median is robust to, for example, measurement error.
Suppose you had the following data from a poll that was taken:
1, 1, 2, 2, 2, 3, 3, 4, 40
Now the median would be 2 and the mean would be ~6.4.
Is 40 a true value or maybe one of the poll takers wrote down the wrong answer?
Another related topic is interquartile range. I was surprised to find when I started working that the IRS definition and the Quartile function on Excel use different definitions. Personally, I side with the IRS on the matter.
Treas. Reg Section 1.482-1(e)(2)(C ) said:
Quote
k=(quart/4)*(n-1))+1
If k is not an integer, truncate it but store the fractional portion (f) for use in step 3.
And where: •
• quart = value between 0 and 4 depending on which quartile you want to find
• n = number of values in the array
2. Find the smallest data point in the array of values that is greater than the kth smallest -- the (k+1)th smallest member.
3. Interpolate between the kth smallest and the (k+1)th smallest values:
Output = a[k]+(f*(a[k+1]-a[k]))
a[k] = the kth smallest
a[k+1] = the k+1th smallest
#18
Posted 2008-October-01, 11:16
matmat, on Oct 1 2008, 04:35 PM, said:
jdonn, on Oct 1 2008, 10:16 AM, said:
I don't think you have a median child. Your childrens' median age is 11.
I learned it the way wikipedia defines it.
i don't believe i ever learned this, and I don't EVER recall having to use the median. I understand the purpose for the means of various sorts, and the mode, but i could never figure out what the point of a median was, except to provide additional useless crap to put on the SAT
I find medians quite useful. They are much more robust than means (as Gnome describes above). And they make sense for ordered sets while means only make sense for real numbers (or vectors).
Modes, on the other hand, I never found a use for, but that is no doubt just related to the limited scope of my work.
Gnome: the reason why I think the median is not so useful for small data sets is that for normal distributed data it converges more slowly, i.e. for a given sample set you have a wider confidence interval for the median than for the mean.
#19
Posted 2008-October-01, 11:28
#20
Posted 2008-October-01, 11:34

Help
