📊 Statistics — Class 10

Mean, Mode, Median of grouped data, and Ogive curves

1. Introduction

In Class 9, we studied raw data (ungrouped). In Class 10, we deal with grouped data presented in frequency distribution tables. We find three measures of central tendency: Mean, Mode, and Median.

2. Mean of Grouped Data

📖 Method 1 — Direct Method

x̄ = Σfᵢxᵢ / Σfᵢ

where xᵢ = class mark (midpoint) = (upper limit + lower limit)/2

fᵢ = frequency of each class

📖 Method 2 — Assumed Mean Method (Shortcut)

x̄ = a + (Σfᵢdᵢ / Σfᵢ)

where a = assumed mean (any class mark), dᵢ = xᵢ – a

📖 Method 3 — Step Deviation Method

x̄ = a + (Σfᵢuᵢ / Σfᵢ) × h

where uᵢ = (xᵢ – a)/h, h = class size (width)

Best used when class widths are equal and numbers are large.

💡 Example: Mean by Direct Method

Find mean of the following data:

ClassFrequency (f)Midpoint (x)fx
0–105525
10–20815120
20–301525375
30–401035350
40–50745315
Total451185

Mean = Σfx / Σf = 1185 / 45 = 26.33

3. Mode of Grouped Data

📖 Modal Class and Formula

The modal class is the class with the highest frequency.

Mode = l + [(f₁ – f₀) / (2f₁ – f₀ – f₂)] × h

where:
l = lower boundary of modal class
f₁ = frequency of modal class
f₀ = frequency of class before modal class
f₂ = frequency of class after modal class
h = class width

💡 Example: Finding Mode

Using the above data: Modal class = 20–30 (highest f = 15)

l = 20, f₁ = 15, f₀ = 8, f₂ = 10, h = 10

Mode = 20 + [(15–8)/(2×15–8–10)] × 10

= 20 + [7/(30–18)] × 10 = 20 + (7/12) × 10 = 20 + 5.83 = 25.83

4. Median of Grouped Data

📖 Cumulative Frequency and Formula

First, create a cumulative frequency table. The median class is the class where cumulative frequency first exceeds n/2 (n = total frequency).

Median = l + [(n/2 – cf) / f] × h

where:
l = lower boundary of median class
n = total frequency
cf = cumulative frequency before the median class
f = frequency of median class
h = class width

💡 Example: Finding Median

ClassfCumulative f (cf)
0–1055
10–20813
20–301528
30–401038
40–50745

n = 45, n/2 = 22.5

Median class: cf first exceeds 22.5 → class 20–30 (cf = 28 > 22.5)

l = 20, cf = 13, f = 15, h = 10

Median = 20 + [(22.5 – 13)/15] × 10 = 20 + [9.5/15] × 10 = 20 + 6.33 = 26.33

5. Graphical Representation — Ogive

📖 Ogive (Cumulative Frequency Curve)

An ogive is a graph drawn by plotting cumulative frequencies against class boundaries.

Less Than Ogive: Plot (upper boundary, cumulative frequency). Curve goes upward.

More Than Ogive: Plot (lower boundary, total cf – previous cf). Curve goes downward.

The intersection of both ogives gives the Median.

⚡ Empirical Relationship

Mode = 3 × Median – 2 × Mean

This formula is approximate and holds for moderately skewed distributions. It's useful for finding one measure when the other two are known.

🔑 When to Use Which Measure

  • Mean: Best for symmetric data; uses all data values. Affected by extreme values.
  • Median: Best for skewed data or when outliers are present; not affected by extremes.
  • Mode: Best for categorical data; most frequently occurring value.

📋 Statistics Formula Sheet

Mean (Direct)

x̄ = Σfᵢxᵢ / Σfᵢ

Mean (Step Deviation)

x̄ = a + (Σfᵢuᵢ/Σfᵢ) × h

uᵢ = (xᵢ – a) / h

Mode

l + [(f₁–f₀)/(2f₁–f₀–f₂)] × h

Median

l + [(n/2 – cf)/f] × h

Empirical Relation

Mode = 3×Median – 2×Mean