BUSA3129›Measures of dispersion (range, variance, standard deviation)

Statistical Analysis for BusinessTopic 13 of 43

Measures of dispersion (range, variance, standard deviation)

5 minread

913words

Intermediatelevel

Measures of Dispersion: Range, Variance, and Standard Deviation

Measures of dispersion provide insights into the spread and variability of a dataset. Understanding how data points differ from the central tendency is crucial for accurate data analysis and interpretation. Here’s a detailed look at three key measures of dispersion: range, variance, and standard deviation.

1. Range

Definition: The range is the simplest measure of dispersion. It indicates the difference between the highest and lowest values in a dataset.

Calculation:

\text{Range} = \text{Maximum Value} - \text{Minimum Value}

Example:

Consider the dataset: 10, 15, 20, 25, 30.
- Maximum: 30
- Minimum: 10
- Range:
$\text{Range} = 30 - 10 = 20$

Advantages:

Easy to calculate and understand.
Provides a quick sense of the spread of values.

Disadvantages:

Highly sensitive to outliers. A single extreme value can dramatically affect the range.
Does not provide information about the distribution of values within the range.

2. Variance

Definition: Variance measures how far each number in a dataset is from the mean and, consequently, from every other number. It indicates the degree of spread in the data.

Calculation:

For a Population:

\sigma^2 = \frac{\sum (X_i - \mu)^2}{N}

where:

$\sigma^2$ = population variance
$X_i$ = each value in the dataset
$\mu$ = population mean
$N$ = number of values in the population
For a Sample:

s^2 = \frac{\sum (X_i - \bar{X})^2}{n - 1}

where:

$s^2$ = sample variance
$\bar{X}$ = sample mean
$n$ = number of values in the sample

Example:

Consider the sample dataset: 5, 10, 15.
- Mean:
$\bar{X} = \frac{5 + 10 + 15}{3} = 10$
- Variance Calculation:
  - Differences from the mean:
    - $(5 - 10)^2 = 25$
    - $(10 - 10)^2 = 0$
    - $(15 - 10)^2 = 25$
  - Sum of squared differences: $25 + 0 + 25 = 50$
  - Sample variance:
  $s^2 = \frac{50}{3 - 1} = \frac{50}{2} = 25$

Advantages:

Takes all data points into account, providing a comprehensive measure of dispersion.
Useful for further statistical analysis.

Disadvantages:

Variance is expressed in squared units, which can make it less intuitive.
Sensitive to outliers, as extreme values can heavily influence the result.

3. Standard Deviation

Definition: The standard deviation is the square root of the variance. It provides a measure of dispersion in the same units as the data, making it more interpretable.

Calculation:

For a Population:

\sigma = \sqrt{\sigma^2}

For a Sample:

s = \sqrt{s^2}

Example: Continuing from the previous variance example:

Sample variance $s^2 = 25$ .
Standard Deviation:

s = \sqrt{25} = 5

Advantages:

Expressed in the same units as the original data, making it easier to interpret.
Provides a clear measure of variability and is widely used in statistical analysis.

Disadvantages:

Still sensitive to outliers, similar to variance.
Requires a more complex calculation than the range.

Summary of Measures of Dispersion

Measure	Definition	Calculation	Sensitivity to Outliers
Range	Difference between the maximum and minimum values	Max - Min	Highly sensitive
Variance	Average of squared differences from the mean	$\frac{\sum (X_i - \mu)^2}{N}$ (population) or $\frac{\sum (X_i - \bar{X})^2}{n - 1}$ (sample)	Sensitive
Standard Deviation	Square root of the variance	$\sqrt{s^2}$ (sample) or $\sqrt{\sigma^2}$ (population)	Sensitive

Conclusion

Measures of dispersion—range, variance, and standard deviation—are essential for understanding the spread of data in any analysis. They provide valuable insights into variability, risk assessment, and data reliability in various business contexts. By utilizing these measures, businesses can make informed decisions and enhance their strategic planning. If you have specific questions or would like to explore a particular aspect further, feel free to ask!

Previous topic 12

Dispersion and Variability Analysis

Next topic 14

Coefficient of variation and its implications

Past Papers

Open this section to load past papers

Click on Show Past Papers to see past papers.

Measure

Definition

Calculation

Sensitivity to Outliers

Range

Difference between the maximum and minimum values

Max - Min

Highly sensitive

Variance

Average of squared differences from the mean

\frac{\sum (X_i - \mu)^2}{N}

(population) or

\frac{\sum (X_i - \bar{X})^2}{n - 1}

(sample)

Sensitive

Standard Deviation

Square root of the variance

\sqrt{s^2}

(sample) or

\sqrt{\sigma^2}

(population)

Sensitive