1. Arithmetic Mean JEE Main & Advanced

The arithmetic mean (or simply mean) x¯ of a dataset is the sum of all observations divided by the number of observations.

Mean for Ungrouped Data

For n observations x1,x2,,xn:

x¯=x1+x2++xnn=i=1nxin=Σxin

Mean for Discrete Frequency Distribution

For values x1,x2,,xk with frequencies f1,f2,,fk (where Σfi=N):

x¯=ΣfixiΣfi=ΣfixiN

Mean for Continuous (Grouped) Data — Three Methods

Method 1 — Direct Method:

x¯=ΣfimiN

where mi is the midpoint of the ith class interval: mi=lower limit+upper limit2.

Method 2 — Assumed Mean (Short-cut) Method:

x¯=A+ΣfidiN     where di=miA

A is the assumed mean (usually the midpoint of the class with the highest frequency). This method reduces arithmetic when numbers are large.

Method 3 — Step Deviation Method:

x¯=A+ΣfiuiN×h     where ui=dih=miAh

h = class width (common class size). Simplifies further when all class widths are equal.

Properties of Arithmetic Mean

Property Statement
Sum of deviations Σ(xix¯)=0 — deviations from mean always sum to zero
Shift by constant If each xi is replaced by xi+k, new mean =x¯+k
Scale by constant If each xi is replaced by kxi, new mean =kx¯
Minimises sum of squared deviations Σ(xia)2 is minimum when a=x¯ (least squares property)
Combined mean For two groups with means x¯1, x¯2 and sizes n1, n2: x¯=n1x¯1+n2x¯2n1+n2
Effect of wrong entry Corrected mean =x¯+correct valuewrong valuen

Weighted Mean

When observations have different weights wi:

x¯w=ΣwixiΣwi

2. Median JEE Main & Advanced

The median is the middle value of a dataset when arranged in ascending or descending order. It divides the distribution into two equal halves.

Median for Ungrouped Data

Arrange data in ascending order. Then:

  • If n is odd: Median =(n+12)th observation.
  • If n is even: Median =(n2)th+(n2+1)th observation2.

Median for Grouped (Continuous) Data

Median=l+N2cff×h

where:

  • l = lower boundary of the median class
  • N = total frequency (Σfi)
  • cf = cumulative frequency of the class preceding the median class
  • f = frequency of the median class
  • h = class width (height of the median class)
  • Median class = the class whose cumulative frequency first exceeds N/2

Properties of Median

  • Median minimises Σ|xia| (sum of absolute deviations) — the least absolute deviations property.
  • Not affected by extreme values (outliers) — unlike the mean.
  • Suitable for skewed distributions and ordinal data.
  • For a symmetric distribution: Mean = Median = Mode.

3. Mode JEE Main & Advanced

The mode is the observation (or class) that occurs with the highest frequency in a dataset.

Mode for Ungrouped Data

Simply the value that appears most often. A dataset can be:

  • Unimodal: one mode
  • Bimodal: two modes
  • Multimodal: more than two modes
  • No mode: all values appear equally often

Mode for Grouped Data

The modal class is the class with the highest frequency. Mode is found by:

Mode=l+f1f02f1f0f2×h

where:

  • l = lower boundary of the modal class
  • f1 = frequency of the modal class (highest frequency)
  • f0 = frequency of the class preceding the modal class
  • f2 = frequency of the class following the modal class
  • h = class width

Properties of Mode

  • Easiest to compute — no calculation needed for ungrouped data.
  • Not affected by extreme values.
  • May not exist or may not be unique.
  • Useful for qualitative (categorical) data.

4. Empirical Relationship Between Mean, Median and Mode JEE Main & Advanced

For a moderately skewed (asymmetric) distribution, Karl Pearson established the empirical relationship:

Mode=3Median2Mean

This is a very frequently tested result — it allows computing one measure when the other two are known.

Distribution Type Relationship Shape
Symmetric Mean = Median = Mode Bell-shaped; no skew
Positively skewed (right-skewed) Mean > Median > Mode Long tail on the right
Negatively skewed (left-skewed) Mean < Median < Mode Long tail on the left

5. Geometric Mean and Harmonic Mean CBSE Boards

Geometric Mean (GM) of n positive observations:

GM=(x1x2xn)1/n

Used for averaging ratios, growth rates, and index numbers.

Harmonic Mean (HM) of n observations:

HM=n1x1+1x2++1xn=nΣ(1/xi)

Used for averaging rates (speed, frequency).

Inequality: AM GM HM

For positive numbers, the arithmetic mean is always geometric mean harmonic mean, with equality iff all observations are equal.

6. Choosing the Right Measure of Central Tendency

Situation Best Measure Reason
Symmetric distribution, no outliers Mean Uses all data; most efficient
Skewed distribution or outliers present Median Not distorted by extreme values
Categorical (qualitative) data Mode Mean/median not applicable for non-numeric data
Most common value needed Mode Directly gives the most frequent value
Averaging growth rates, ratios Geometric Mean Appropriate for multiplicative relationships
Averaging speeds (same distance) Harmonic Mean Gives correct average speed