ScholarQuill logoScholarQuillUniversity Notes
  • Notes
  • Past Papers
  • Blogs
  • Todo
Login
ScholarQuill logoScholarQuillUniversity Notes
Login
NotesPast PapersBlogsTodo
More
SubjectsDiscussionCGPA CalculatorGPA CalculatorStudent PortalCourse Outline
About
About usPrivacy PolicyReportContact
Notes
Past Papers
Blogs
Todo
Analytics
    Current Subject
    🧩
    Probability and Statistics
    MS-251
    Progress0 / 36 topics
    Topics
    1. Introduction: Statistics and Data Analysis2. Statistical Inference3. Samples, Populations, and the Role of Probability4. Sampling Procedures5. Discrete and Continuous Data6. Statistical Modeling7. Types of Statistical Studies8. Probability: Sample Space, Events, Counting Sample Points9. Probability of an Event10. Additive Rules11. Conditional Probability12. Independence and the Product Rule13. Bayes’ Rule14. Random Variables and Probability Distributions15. Mathematical Expectation: Mean of a Random Variable16. Variance and Covariance of Random Variables17. Means and Variances of Linear Combinations of Random Variables18. Chebyshev’s Theorem19. Discrete Probability Distributions20. Continuous Probability Distributions21. Fundamental Sampling Distributions22. Sampling Distributions and Data Descriptions23. Random Sampling24. Sampling Distributions25. Sampling Distribution of Means and the Central Limit Theorem26. Sampling Distribution of S227. t-Distribution28. F-Quantile and Probability Plots29. Single Sample & One- and Two-Sample Estimation Problems30. Single Sample & One- and Two-Sample Tests of Hypotheses31. The Use of P-Values for Decision Making in Testing Hypotheses32. Regression: Linear Regression and Correlation33. Least Squares and the Fitted Model34. Multiple Linear Regression and Certain Nonlinear Regression Models35. Linear Regression Model Using Matrices36. Properties of the Least Squares Estimators
    MS-251›Chebyshev’s Theorem
    Probability and StatisticsTopic 18 of 36

    Chebyshev’s Theorem

    7 minread
    1,205words
    Intermediatelevel

    Chebyshev's Theorem (or Chebyshev's Inequality)

    Chebyshev's Theorem is a fundamental result in probability and statistics that provides a bound on the probability that a random variable deviates from its mean. Unlike many other results that require knowledge of the distribution of the data (e.g., the normal distribution), Chebyshev's Theorem applies to any probability distribution, as long as the mean and variance exist. It is particularly useful for distributions that are not normal or when little is known about the distribution.


    Statement of Chebyshev’s Inequality

    Let XXX be a random variable with a mean μ=E[X]\mu = E[X]μ=E[X] and variance σ2=Var(X)\sigma^2 = \text{Var}(X)σ2=Var(X). Chebyshev's Inequality states that for any k>0k > 0k>0:

    P(∣X−μ∣≥kσ)≤1k2P(|X - \mu| \geq k\sigma) \leq \frac{1}{k^2}P(∣X−μ∣≥kσ)≤k21​

    This inequality tells us that the probability that XXX deviates from its mean by more than kkk standard deviations is at most 1k2\frac{1}{k^2}k21​.

    Alternatively, the probability that XXX lies within kkk standard deviations of the mean is at least:

    P(∣X−μ∣<kσ)≥1−1k2P(|X - \mu| < k\sigma) \geq 1 - \frac{1}{k^2}P(∣X−μ∣<kσ)≥1−k21​

    Interpretation

    • Chebyshev’s inequality provides a bound on how much of the distribution lies within a given number of standard deviations from the mean.
    • It guarantees that no more than 1k2\frac{1}{k^2}k21​ of the probability mass is outside kkk standard deviations from the mean, regardless of the shape of the distribution.
    • This is useful because, while specific distributions (like the normal distribution) may allow for more precise predictions, Chebyshev’s inequality holds for any distribution with a finite mean and variance.

    Examples and Applications

    1. Example of Chebyshev’s Inequality

    Suppose you have a random variable XXX with the following parameters:

    • Mean μ=50\mu = 50μ=50
    • Standard deviation σ=10\sigma = 10σ=10

    Let’s use Chebyshev’s inequality to estimate the probability that XXX lies within 2 standard deviations from the mean.

    For k=2k = 2k=2, Chebyshev’s inequality tells us that:

    P(∣X−μ∣≥2σ)≤122=14=0.25P(|X - \mu| \geq 2\sigma) \leq \frac{1}{2^2} = \frac{1}{4} = 0.25P(∣X−μ∣≥2σ)≤221​=41​=0.25

    This means that at least 75% of the data lies within 2 standard deviations of the mean (since P(∣X−μ∣<2σ)≥1−14=0.75P(|X - \mu| < 2\sigma) \geq 1 - \frac{1}{4} = 0.75P(∣X−μ∣<2σ)≥1−41​=0.75).

    2. Chebyshev’s Inequality for k=3k = 3k=3

    Let’s apply Chebyshev’s inequality for k=3k = 3k=3. We know that:

    P(∣X−μ∣≥3σ)≤132=19≈0.1111P(|X - \mu| \geq 3\sigma) \leq \frac{1}{3^2} = \frac{1}{9} \approx 0.1111P(∣X−μ∣≥3σ)≤321​=91​≈0.1111

    So, for this value of kkk, at least 88.89% of the values of XXX lie within 3 standard deviations of the mean.

    3. General Insights

    • For k=1k = 1k=1: Chebyshev's inequality tells us that at least 0% of the data lies within 1 standard deviation from the mean. (This is a trivial and uninformative result.)
    • For k=2k = 2k=2: We get that at least 75% of the values lie within 2 standard deviations of the mean.
    • For k=3k = 3k=3: At least 88.89% of the values lie within 3 standard deviations of the mean.

    As kkk increases, the proportion of the data within kkk standard deviations increases as well.


    Why Is Chebyshev's Inequality Useful?

    1. Distribution-Agnostic: Unlike the empirical rule for the normal distribution, which states that approximately 68% of the data lies within 1 standard deviation, 95% within 2, and 99.7% within 3, Chebyshev’s inequality does not assume any specific distribution. It applies to all distributions that have a finite mean and variance.

    2. Worst-case Scenario: Chebyshev’s inequality is often used when there is uncertainty about the underlying distribution. It gives a "worst-case" bound for the probability that a random variable deviates from its mean.

    3. Handling Outliers: In situations where we want to be sure that the data does not have extreme values (outliers), Chebyshev’s inequality helps us establish a minimum proportion of data within certain bounds.

    4. General Statistical Analysis: It can be used to assess how data is distributed in cases where little is known about the data distribution, especially in the early stages of statistical analysis or in exploratory data analysis.


    Limitations of Chebyshev's Inequality

    • Looseness of the Bound: Chebyshev's inequality tends to be a very conservative (loose) bound, especially when the distribution is normal. For normal distributions, much tighter bounds are available (e.g., the empirical rule or the 68-95-99.7 rule).
    • Lack of Specificity: It does not provide any information about the actual shape of the distribution. For example, it doesn’t tell you whether the distribution is skewed or whether it has heavy tails; it only tells you how much data is within a certain distance from the mean.
    • Not Sharp: Chebyshev’s inequality provides a general bound, but it might overestimate the proportion of data outside the given range. It is a worst-case scenario bound, which makes it less useful for more specific, well-understood distributions.

    Summary

    • Chebyshev's Theorem gives a probabilistic bound on how much of a distribution lies within a certain number of standard deviations from the mean.
    • It is applicable to all distributions with finite mean and variance.
    • The theorem states that for any k>0k > 0k>0, at least 1−1k21 - \frac{1}{k^2}1−k21​ of the data lies within kkk standard deviations from the mean.
    • While the bound is loose and not specific to any distribution, it is particularly useful in situations where the distribution is unknown or irregular.

    In practice, Chebyshev's inequality is valuable for establishing basic confidence about data spread, especially in cases where other statistical assumptions (such as normality) cannot be made.

    Previous topic 17
    Means and Variances of Linear Combinations of Random Variables
    Next topic 19
    Discrete Probability Distributions

    Past Papers

    Open this section to load past papers

    Click on Show Past Papers to see past papers.
    On This Page
      Reading Stats
      Est. reading time7 min
      Word count1,205
      Code examples0
      DifficultyIntermediate