Background
The median is a fundamental statistical concept used to represent the central tendency of a dataset. Specifically, it identifies the value below and above which half of the observations in a dataset fall.
Historical Context
The application of the median dates back to the earliest recordings of statistical methodology, and it has been widely used in various disciplines, including economics, to provide a robust measure of central tendency free from the distortions that can affect the mean.
Definitions and Concepts
The median of a distribution of a random variable \( X \) is defined as the value \( m \) such that:
- \( P[X \leq m] \geq \frac{1}{2} \), and
- \( P[X \geq m] \geq \frac{1}{2} \).
For samples:
- When the sample size is odd, the median is the middle value of the ordered data.
- When the sample size is even, the median is the average of the two central values.
As a measure of central tendency, the median is often preferred over the mean because it is not influenced by outliers.
Major Analytical Frameworks
Classical Economics
In classical economics, measures such as the median are often used for the analysis of income distribution, wealth distribution, and consumption patterns to understand economic behaviors and outcomes.
Neoclassical Economics
In neoclassical economics, the median can signify the centrality of agent behaviors, optimizing decisions, and preference distributions, particularly in analyzing how typical agents behave without the skewness that outliers introduce.
Keynesian Economic
Keynesian economics may employ the median to analyze typical expenditure behavior within an economy, particularly in the study of consumption across different income brackets.
Marxian Economics
In Marxian economics, the median can indicate class distinctions and economic disparities without the distortion of few extreme wealth holders, reflecting more accurately the condition of the median worker or household.
Institutional Economics
Institutional economists use the median to study norms and behaviors across various institutions and how they affect economic outcomes on a typical basis.
Behavioral Economics
Behavioral economists consider the median crucial in understanding decision-making processes, presenting a typical scenario excluding extreme behaviors which might bias analysis.
Post-Keynesian Economics
Post-Keynesian analysts might focus on the median to study income and wealth inequality showing reliance on concrete median values rather than means which could be skewed due to high-income outliers.
Austrian Economics
Austrian economists could use the median to elaborate on entrepreneurial behavior and market-clearing prices in the absence of distortions contributed by extreme values.
Development Economics
In development economics, the median assesses typical standard-of-living and economic well-being metrics, which are more resistant to skewing effects than mean values.
Monetarism
Monetarists might use median values to gauge aggregate values that avoid being influenced by occasional extreme anomalies, likely for money supply and interest rate distributions analysis.
Comparative Analysis
Comparative analysis using the median involves comparison across different datasets or subgroups within a dataset to identify differences and similarities in central tendency unaffected by outliers.
Case Studies
- Income Distribution: A study showing how the median income provides a better representation of central tendency than the mean due to the presence of extremely high or low incomes.
- Healthcare Spending: Analysis of median healthcare expenses can provide a more realistic summary of typical costs compared to the mean which could be misrepresentative due to high-cost outliers.
Suggested Books for Further Studies
- The Visual Display of Quantitative Information by Edward R. Tufte
- Introduction to the Theory of Statistics by Mood, Graybill, and Boes
- Statistics for Business and Economics by Paul Newbold, William L. Carlson, and Betty Thorne
Related Terms with Definitions
- Mean: The arithmetic average of a set of values.
- Mode: The value that appears most frequently in a data set.
- Outliers: Observations that significantly differ from the observations in their dataset.
- Quartiles: Values that divide a data set into four equal parts.
$$$$
## Quiz
### The median of the dataset [3, 7, 9, 15, 21] is:
- [ ] 7
- [x] 9
- [ ] 15
- [ ] 21
> **Explanation:** Since there are five numbers, the median is the third number when ordered from least to greatest.
### For the dataset [4, 16, 8, 10, 24, 14], the median is:
- [ ] 12.5
- [ ] 20
- [x] 12
- [ ] 14
> **Explanation:** The ordered dataset [4, 8, 10, 14, 16, 24] has an even number of values. Therefore, the median is the average of the two central numbers (10 and 14), which is 12.
### The main difference between median and mean is:
- [x] Median is resistant to outliers, mean is not
- [ ] Mean represents the middle value of data
- [ ] Median cannot be computed for even sample sizes
- [ ] Median considers all data points
> **Explanation:** The median is resistant to outliers, which is not a property of the mean.
### True or False: The median is the most frequently occurring value in a dataset.
- [ ] True
- [x] False
> **Explanation:** The most frequently occurring value in a dataset is known as the mode, not the median.
### The term "median" is derived from:
- [ ] Greek
- [x] Latin
- [ ] French
- [ ] Spanish
> **Explanation:** The term "median" derives from the Latin word "medianus," which means "in the middle."
### For the dataset {2, 3, 3, 6, 7, 8, 12}, the median is:
- [ ] 3
- [ ] 6
- [x] 7
- [ ] 8
> **Explanation:** The dataset ordered in numerical order is {2, 3, 3, 6, 7, 8, 12}, making the median 7, the middle value.
### If a dataset is {8, 15, 7, 9, 12}, what should you do first to find the median?
- [x] Order the data
- [ ] Calculate the mean
- [ ] Sum all numbers
- [ ] Find the mode
> **Explanation:** To find the median, you must first order the data.
### In a symmetrical distribution, the median and the mean are:
- [x] Usually the same
- [ ] Usually different
- [ ] Unrelated
- [ ] Both robust
> **Explanation:** In a symmetrical distribution, the mean and median are typically the same.
### When is the median particularly useful?
- [ ] When data is normally distributed
- [x] When data contains outliers
- [ ] For very large datasets
- [ ] For categorical data
> **Explanation:** The median is particularly useful when data contains outliers since it is not affected by extremely large or small values.
### The median of the dataset {1, 3, 5, 7, 9, 11} is:
- [ ] 5
- [x] 6
- [ ] 7
- [ ] 8
> **Explanation:** With an even number of observations, the dataset's median is the average of 5 and 7, which is 6.