Measures of Central Tendency and Dispersion

In statistics, data analysis relies on two primary types of descriptive measures: those that identify the “center” of a dataset (Central Tendency) and those that describe how “spread out” the values are (Dispersion). Together, they provide a comprehensive summary of a distribution.

1. Measures of Central Tendency

These measures describe the central or typical value of a data distribution.

  • Mean (bar{x}): The arithmetic average, calculated by summing all observations and dividing by the number of observations (n). It is the most common measure but is highly sensitive to extreme outliers. bar{x} = frac{sum x_i}{n}
  • Median: The middle value when the data is arranged in ascending or descending order. It effectively divides the distribution into two equal halves. It is robust against outliers.
  • Mode: The value that appears most frequently in a dataset. A dataset may have no mode, one mode (unimodal), or multiple modes (multimodal).

2. Measures of Dispersion

These measures quantify the variability or spread of the data. A central tendency measure alone can be misleading without understanding how far the other data points are from that center.

  • Range: The difference between the highest and lowest values. It provides a quick snapshot of the total spread but is highly susceptible to outliers. text{Range} = text{Maximum Value} – text{Minimum Value}
  • Variance (sigma2 or s2): The average of the squared deviations from the mean. It measures how far each number in the set is from the mean.
  • Standard Deviation (sigma or s): The square root of the variance. It is the most widely used measure of dispersion because it is expressed in the same units as the original data, making it highly interpretable. s = sqrt{frac{sum (x_i – bar{x})^2}{n – 1}}

    3. Comparative Overview

    Measure Purpose Sensitivity to Outliers
    Mean Represents the average value High
    Median Represents the central position Low
    Mode Represents the most frequent value Low
    Range Total spread Very High
    Standard Deviation Average distance from the mean Moderate

    4. Practical Application in Research

    Understanding these measures is vital for interpreting data distributions:

    • Symmetry: In a perfectly symmetrical (normal) distribution, the Mean, Median, and Mode are identical.
    • Skewness: * Positively Skewed: The mean is typically greater than the median.
      • Negatively Skewed: The mean is typically less than the median.
    • Data Integrity: A large standard deviation indicates that the data points are spread out over a wide range of values, suggesting high variability in the population. A small standard deviation indicates that the data points are clustered closely around the mean, suggesting high consistency.

    Key Statistical Fact

    When choosing between the mean and the median, look at the distribution of your data. If the data is skewed by extreme values (e.g., income distribution where a few billionaires skew the average), the median is a much more accurate representation of the “typical” individual than the mean.

Originally written on March 29, 2015 and last modified on June 30, 2026.

1 Comment

  1. P.C.DAS .

    November 3, 2017 at 3:58 pm

    HOW DO I JOIN “GIVE IT UP ” SCHEME .? EVEN AFTER REPEATEDLY INFORMING AND FILLING UP SUBSIDY FORM SUPPLIED BY GAS SUPPLIER , I AM NOT ABLE TO GIVE IT UP .
    MY GAS SUPPLIER IS BHARAT PETROLEUM , M/S MANGALAM BHARAT GAS SERVICES , SECTOR -41 , NOIDA
    AND MY CONSUMER NO IS 18956 .

    Reply

Leave a Reply

Your email address will not be published. Required fields are marked *