ScholarQuill logoScholarQuillUniversity Notes
  • Notes
  • Past Papers
  • Blogs
  • Todo
Login
ScholarQuill logoScholarQuillUniversity Notes
Login
NotesPast PapersBlogsTodo
More
SubjectsDiscussionCGPA CalculatorGPA CalculatorStudent PortalCourse Outline
About
About usPrivacy PolicyReportContact
Notes
Past Papers
Blogs
Todo
Analytics
    Current Subject
    🧩
    Probability and Statistics
    MS-251
    Progress0 / 36 topics
    Topics
    1. Introduction: Statistics and Data Analysis2. Statistical Inference3. Samples, Populations, and the Role of Probability4. Sampling Procedures5. Discrete and Continuous Data6. Statistical Modeling7. Types of Statistical Studies8. Probability: Sample Space, Events, Counting Sample Points9. Probability of an Event10. Additive Rules11. Conditional Probability12. Independence and the Product Rule13. Bayes’ Rule14. Random Variables and Probability Distributions15. Mathematical Expectation: Mean of a Random Variable16. Variance and Covariance of Random Variables17. Means and Variances of Linear Combinations of Random Variables18. Chebyshev’s Theorem19. Discrete Probability Distributions20. Continuous Probability Distributions21. Fundamental Sampling Distributions22. Sampling Distributions and Data Descriptions23. Random Sampling24. Sampling Distributions25. Sampling Distribution of Means and the Central Limit Theorem26. Sampling Distribution of S227. t-Distribution28. F-Quantile and Probability Plots29. Single Sample & One- and Two-Sample Estimation Problems30. Single Sample & One- and Two-Sample Tests of Hypotheses31. The Use of P-Values for Decision Making in Testing Hypotheses32. Regression: Linear Regression and Correlation33. Least Squares and the Fitted Model34. Multiple Linear Regression and Certain Nonlinear Regression Models35. Linear Regression Model Using Matrices36. Properties of the Least Squares Estimators
    MS-251›Fundamental Sampling Distributions
    Probability and StatisticsTopic 21 of 36

    Fundamental Sampling Distributions

    9 minread
    1,571words
    Intermediatelevel

    Fundamental Sampling Distributions

    Sampling distributions are a fundamental concept in inferential statistics. They describe the probability distribution of a sample statistic (such as the sample mean, variance, or proportion) obtained from a random sample drawn from a population. Understanding sampling distributions is crucial for making inferences about the population based on sample data.

    In essence, a sampling distribution is the distribution of a given statistic (such as the sample mean) computed from multiple samples taken from the same population. These distributions help to understand how sample statistics vary from one sample to another.

    Key Concepts of Sampling Distributions

    1. Sample Statistic:

      • A sample statistic is any summary measure calculated from a sample, such as the sample mean Xˉ\bar{X}Xˉ, sample variance s2s^2s2, or sample proportion p^\hat{p}p^​.
    2. Sampling Distribution:

      • The sampling distribution of a statistic is the probability distribution of that statistic based on all possible random samples of a specific size nnn from a population. It tells us how the statistic behaves across many different samples.
    3. Standard Error:

      • The standard error (SE) of a statistic is a measure of how much the statistic (like the sample mean) varies from sample to sample. It is the standard deviation of the sampling distribution.
      • For example, the standard error of the sample mean is: SEXˉ=σnSE_{\bar{X}} = \frac{\sigma}{\sqrt{n}}SEXˉ​=n​σ​ Where σ\sigmaσ is the population standard deviation and nnn is the sample size.
    4. Central Limit Theorem (CLT):

      • The Central Limit Theorem (CLT) is one of the most important results in statistics. It states that for a large enough sample size, the sampling distribution of the sample mean Xˉ\bar{X}Xˉ (or any other statistic) will be approximately normally distributed, regardless of the shape of the population distribution. This is true for any population distribution, as long as the sample size is sufficiently large (typically n≥30n \geq 30n≥30).

      The CLT implies that:

      • The distribution of the sample mean becomes approximately normal as the sample size increases.
      • The mean of the sample means is equal to the population mean μ\muμ.
      • The standard error of the sample mean decreases as the sample size increases, making the sample mean more reliable.
      Xˉ∼N(μ,σ2n)\bar{X} \sim N \left( \mu, \frac{\sigma^2}{n} \right)Xˉ∼N(μ,nσ2​)

    Types of Sampling Distributions

    Here are some common sampling distributions that are fundamental to understanding statistical inference:

    1. Sampling Distribution of the Sample Mean

    The sampling distribution of the sample mean Xˉ\bar{X}Xˉ refers to the distribution of the mean of a sample taken from a population.

    • Mean of the Sampling Distribution:

      • The mean of the sample mean is equal to the population mean μ\muμ. That is:
      E[Xˉ]=μE[\bar{X}] = \muE[Xˉ]=μ
    • Variance of the Sampling Distribution:

      • The variance of the sample mean is given by:
      Var(Xˉ)=σ2n\text{Var}(\bar{X}) = \frac{\sigma^2}{n}Var(Xˉ)=nσ2​

      Where σ2\sigma^2σ2 is the population variance and nnn is the sample size.

    • Standard Error of the Sample Mean:

      • The standard error of the sample mean (SE) is:
      SEXˉ=σnSE_{\bar{X}} = \frac{\sigma}{\sqrt{n}}SEXˉ​=n​σ​

      As the sample size increases, the standard error decreases, making the sample mean a more precise estimate of the population mean.

    • Central Limit Theorem:

      • The Central Limit Theorem (CLT) states that, regardless of the shape of the population distribution, the sampling distribution of the sample mean will be approximately normal for sufficiently large sample sizes (usually n≥30n \geq 30n≥30).

    2. Sampling Distribution of the Sample Proportion

    The sample proportion is the proportion of successes in a sample. If ppp is the population proportion of successes, the sample proportion is denoted by p^\hat{p}p^​, and it is defined as:

    p^=Number of successes in the samplen\hat{p} = \frac{\text{Number of successes in the sample}}{n}p^​=nNumber of successes in the sample​
    • Mean of the Sampling Distribution of p^\hat{p}p^​:

      • The mean of the sample proportion is equal to the population proportion ppp:
      E[p^]=pE[\hat{p}] = pE[p^​]=p
    • Variance of the Sampling Distribution of p^\hat{p}p^​:

      • The variance of p^\hat{p}p^​ is given by:
      Var(p^)=p(1−p)n\text{Var}(\hat{p}) = \frac{p(1 - p)}{n}Var(p^​)=np(1−p)​

      Where nnn is the sample size.

    • Standard Error of the Sample Proportion:

      • The standard error of the sample proportion is:
      SEp^=p(1−p)nSE_{\hat{p}} = \sqrt{\frac{p(1 - p)}{n}}SEp^​​=np(1−p)​​

      As the sample size increases, the standard error of p^\hat{p}p^​ decreases, making p^\hat{p}p^​ a more reliable estimate of ppp.

    • Central Limit Theorem:

      • If the sample size nnn is large enough, the sampling distribution of p^\hat{p}p^​ will be approximately normal with mean ppp and standard deviation p(1−p)n\sqrt{\frac{p(1 - p)}{n}}np(1−p)​​, provided that the conditions np≥10np \geq 10np≥10 and n(1−p)≥10n(1 - p) \geq 10n(1−p)≥10 are met. This is the rule of thumb for approximating p^\hat{p}p^​'s distribution as normal.

    3. Sampling Distribution of the Sample Variance

    The sampling distribution of the sample variance is the distribution of the variance computed from a sample. If the population follows a normal distribution with variance σ2\sigma^2σ2, the sample variance s2s^2s2 follows a chi-square distribution.

    • Mean of the Sampling Distribution of s2s^2s2:

      • The mean of the sample variance is equal to the population variance:
      E[s2]=σ2E[s^2] = \sigma^2E[s2]=σ2
    • Variance of the Sampling Distribution of s2s^2s2:

      • The variance of the sample variance is:
      Var(s2)=2σ4n−1\text{Var}(s^2) = \frac{2 \sigma^4}{n - 1}Var(s2)=n−12σ4​
    • Chi-Square Distribution:

      • The sample variance follows a chi-square distribution with n−1n - 1n−1 degrees of freedom:
      (n−1)s2σ2∼χn−12\frac{(n - 1) s^2}{\sigma^2} \sim \chi^2_{n-1}σ2(n−1)s2​∼χn−12​

      This distribution is important for statistical tests that involve variance or standard deviation.

    4. T-Distribution

    When estimating the population mean μ\muμ using a small sample size (typically n<30n < 30n<30) and the population variance σ2\sigma^2σ2 is unknown, the t-distribution is used instead of the normal distribution. The t-distribution is similar to the normal distribution but has heavier tails, which account for the increased uncertainty when using small sample sizes.

    • Sampling Distribution of the Sample Mean with Unknown Variance:
      • When the population variance σ2\sigma^2σ2 is unknown, the sampling distribution of the sample mean follows a t-distribution:
      t=Xˉ−μsnt = \frac{\bar{X} - \mu}{\frac{s}{\sqrt{n}}}t=n​s​Xˉ−μ​ Where sss is the sample standard deviation, and the t-distribution has n−1n - 1n−1 degrees of freedom.

    Summary

    • Sampling distributions describe how sample statistics vary from sample to sample. They are fundamental for making inferences about the population.
    • The sampling distribution of the sample mean is approximately normal, regardless of the population distribution, for large enough sample sizes, due to the Central Limit Theorem.
    • The sample proportion also follows a normal distribution for large sample sizes, under the condition that the expected number of successes and failures are sufficiently large.
    • The sample variance follows a chi-square distribution for normally distributed populations.
    • For small sample sizes and unknown population variance, the t-distribution is used to model the sampling distribution of the sample mean.

    Sampling distributions allow statisticians to make precise probabilistic statements about how close a sample statistic is to the true population parameter, which is essential for hypothesis testing and confidence interval estimation.

    Previous topic 20
    Continuous Probability Distributions
    Next topic 22
    Sampling Distributions and Data Descriptions

    Past Papers

    Open this section to load past papers

    Click on Show Past Papers to see past papers.
    On This Page
      Reading Stats
      Est. reading time9 min
      Word count1,571
      Code examples0
      DifficultyIntermediate