ScholarQuill logoScholarQuillUniversity Notes
  • Notes
  • Past Papers
  • Blogs
  • Todo
Login
ScholarQuill logoScholarQuillUniversity Notes
Login
NotesPast PapersBlogsTodo
More
SubjectsDiscussionCGPA CalculatorGPA CalculatorStudent PortalCourse Outline
About
About usPrivacy PolicyReportContact
Notes
Past Papers
Blogs
Todo
Analytics
    Current Subject
    🧩
    Introduction to Statistics
    STAT2115
    Progress0 / 24 topics
    Topics
    1. Scope of Statistics2. Introduction to Basic Concepts of Statistics: Descriptive and Inferential Statistics3. Population, Sample, Parameter, and Statistic4. Types of Data and Scales of Measurement5. Frequency Distribution and Graphical Representation6. Bar Chart, Pie Chart, and Histogram7. Frequency Polygon, Frequency Curve, and Cumulative Frequency Polygon8. Measures of Central Tendency9. Quantiles10. Absolute and Relative Measures of Dispersion11. Moments, Skewness and Kurtosis12. Basic Concepts of Probability13. Counting Rules: Multiplication Principle, Permutation and Combination14. Probability Spaces and Laws of Probability15. Conditional Probability and Bayes' Theorem16. Discrete and Continuous Random Variables17. Probability Distributions: Binomial, Poisson, and Hypergeometric18. Probability Distributions: Uniform, Exponential, and Normal19. Overview of Sampling: Sample Design and Sampling Frame20. Sampling and Non-Sampling Errors21. Sampling Distributions for Mean and Proportion22. Sampling Distributions for Difference of Means and Difference of Proportions23. Overview of Hypothesis Testing24. Overview of Regression Analysis
    STAT2115›Sampling Distributions for Difference of Means and Difference of Proportions
    Introduction to StatisticsTopic 22 of 24

    Sampling Distributions for Difference of Means and Difference of Proportions

    7 minread
    1,243words
    Intermediatelevel

    1. Sampling Distribution of the Difference of Means

    Suppose we have two independent populations:

    • Population 1: mean μ1\mu_1μ1​, variance σ12\sigma_1^2σ12​, sample size n1n_1n1​
    • Population 2: mean μ2\mu_2μ2​, variance σ22\sigma_2^2σ22​, sample size n2n_2n2​

    Sample means: Xˉ1\bar{X}_1Xˉ1​ and Xˉ2\bar{X}_2Xˉ2​

    We are interested in the difference of sample means:

    D=Xˉ1−Xˉ2D = \bar{X}_1 - \bar{X}_2D=Xˉ1​−Xˉ2​

    Properties

    1. Mean of DDD:

      E(D)=E(Xˉ1−Xˉ2)=μ1−μ2E(D) = E(\bar{X}_1 - \bar{X}_2) = \mu_1 - \mu_2E(D)=E(Xˉ1​−Xˉ2​)=μ1​−μ2​
    2. Variance of DDD (assuming independence):

      Var(D)=Var(Xˉ1)+Var(Xˉ2)=σ12n1+σ22n2Var(D) = Var(\bar{X}_1) + Var(\bar{X}_2) = \frac{\sigma_1^2}{n_1} + \frac{\sigma_2^2}{n_2}Var(D)=Var(Xˉ1​)+Var(Xˉ2​)=n1​σ12​​+n2​σ22​​
    3. Standard Error (SE) of Difference:

      SE(D)=σ12n1+σ22n2SE(D) = \sqrt{\frac{\sigma_1^2}{n_1} + \frac{\sigma_2^2}{n_2}}SE(D)=n1​σ12​​+n2​σ22​​​
    4. Distribution Shape:

    • If populations are normal → DDD is exactly normal
    • If sample sizes are large → DDD is approximately normal (Central Limit Theorem)

    Example:

    • Population 1 mean = 50, σ1=10\sigma_1 = 10σ1​=10, n1=25n_1 = 25n1​=25
    • Population 2 mean = 45, σ2=8\sigma_2 = 8σ2​=8, n2=36n_2 = 36n2​=36 SE(D)=10225+8236=4+1.78=5.78≈2.41SE(D) = \sqrt{\frac{10^2}{25} + \frac{8^2}{36}} = \sqrt{4 + 1.78} = \sqrt{5.78} \approx 2.41SE(D)=25102​+3682​​=4+1.78​=5.78​≈2.41
    • Difference in means = (50 - 45 = 5), SE = 2.41

    2. Sampling Distribution of the Difference of Proportions

    Suppose we have two independent populations with proportions of success:

    • Population 1: p1p_1p1​, sample size n1n_1n1​
    • Population 2: p2p_2p2​, sample size n2n_2n2​

    Sample proportions: p^1\hat{p}_1p^​1​ and p^2\hat{p}_2p^​2​

    We are interested in the difference of sample proportions:

    Dp=p^1−p^2D_p = \hat{p}_1 - \hat{p}_2Dp​=p^​1​−p^​2​

    Properties

    1. Mean of DpD_pDp​:

      E(Dp)=p1−p2E(D_p) = p_1 - p_2E(Dp​)=p1​−p2​
    2. Variance of DpD_pDp​ (assuming independence):

      Var(Dp)=p1(1−p1)n1+p2(1−p2)n2Var(D_p) = \frac{p_1(1-p_1)}{n_1} + \frac{p_2(1-p_2)}{n_2}Var(Dp​)=n1​p1​(1−p1​)​+n2​p2​(1−p2​)​
    3. Standard Error (SE) of Difference:

      SE(Dp)=p1(1−p1)n1+p2(1−p2)n2SE(D_p) = \sqrt{\frac{p_1(1-p_1)}{n_1} + \frac{p_2(1-p_2)}{n_2}}SE(Dp​)=n1​p1​(1−p1​)​+n2​p2​(1−p2​)​​
    4. Distribution Shape:

    • For large n1n_1n1​ and n2n_2n2​ → approximately normal

    Example:

    • Population 1 proportion p1=0.6p_1 = 0.6p1​=0.6, n1=100n_1 = 100n1​=100
    • Population 2 proportion p2=0.4p_2 = 0.4p2​=0.4, n2=120n_2 = 120n2​=120 SE(Dp)=0.6∗0.4100+0.4∗0.6120=0.0024+0.002=0.0044≈0.066SE(D_p) = \sqrt{\frac{0.6*0.4}{100} + \frac{0.4*0.6}{120}} = \sqrt{0.0024 + 0.002} = \sqrt{0.0044} \approx 0.066SE(Dp​)=1000.6∗0.4​+1200.4∗0.6​​=0.0024+0.002​=0.0044​≈0.066
    • Difference in proportions = (0.6 - 0.4 = 0.2), SE ≈ 0.066

    3. Key Points to Remember

    Statistic Mean Standard Error (SE) Distribution
    Difference of Means (Xˉ1−Xˉ2\bar{X}_1 - \bar{X}_2Xˉ1​−Xˉ2​) μ1−μ2\mu_1 - \mu_2μ1​−μ2​ σ12n1+σ22n2\sqrt{\frac{\sigma_1^2}{n_1} + \frac{\sigma_2^2}{n_2}}n1​σ12​​+n2​σ22​​​ Normal (exact or approx.)
    Difference of Proportions (p^1−p^2\hat{p}_1 - \hat{p}_2p^​1​−p^​2​) p1−p2p_1 - p_2p1​−p2​ p1(1−p1)n1+p2(1−p2)n2\sqrt{\frac{p_1(1-p_1)}{n_1} + \frac{p_2(1-p_2)}{n_2}}n1​p1​(1−p1​)​+n2​p2​(1−p2​)​​ Approximately Normal (large n)

    Important Notes:

    1. Populations should be independent.
    2. For small sample sizes, consider using t-distribution for means.
    3. For proportions, normal approximation works if n1p1,n1(1−p1),n2p2,n2(1−p2)≥5n_1p_1, n_1(1-p_1), n_2p_2, n_2(1-p_2) \ge 5n1​p1​,n1​(1−p1​),n2​p2​,n2​(1−p2​)≥5.
    Previous topic 21
    Sampling Distributions for Mean and Proportion
    Next topic 23
    Overview of Hypothesis Testing

    Past Papers

    Open this section to load past papers

    Click on Show Past Papers to see past papers.
    On This Page
      Reading Stats
      Est. reading time7 min
      Word count1,243
      Code examples0
      DifficultyIntermediate