Tools for Quantitative ReasoningTopic 25 of 27

t-test

7 minread

1,273words

Intermediatelevel

A t-test is a statistical method used to determine if there is a significant difference between the means of two groups or whether a sample mean significantly differs from a known population mean, particularly when the population standard deviation is unknown and the sample size is small (typically $n < 30$ ). It is based on the Student's t-distribution, which accounts for the increased variability expected with smaller samples.

Key Concepts of the T-Test

T-Score:
- The t-score measures the number of standard errors the sample mean is away from the population mean. It is calculated as:
$t = \frac{\bar{x} - \mu}{s/\sqrt{n}}$
- Where:
  - $\bar{x}$ : Sample mean
  - $\mu$ : Population mean (or the mean of another group, in the case of a two-sample t-test)
  - $s$ : Sample standard deviation
  - $n$ : Sample size
Assumptions:
- The data should be normally distributed, especially important when sample sizes are small.
- The observations are independent of each other.
- For two-sample tests, the variances of the two groups should be approximately equal (homogeneity of variance).

Types of T-Tests

One-Sample T-Test:
- Used to determine whether the mean of a single sample is significantly different from a known population mean.
- Hypotheses:
  - Null hypothesis ( $H_0$ ): The sample mean is equal to the population mean ( $\bar{x} = \mu$ ).
  - Alternative hypothesis ( $H_a$ ): The sample mean is not equal to the population mean ( $\bar{x} \neq \mu$ ).
Independent Two-Sample T-Test:
- Used to compare the means of two independent groups.
- Hypotheses:
  - Null hypothesis ( $H_0$ ): The means of the two groups are equal ( $\mu_1 = \mu_2$ ).
  - Alternative hypothesis ( $H_a$ ): The means of the two groups are not equal ( $\mu_1 \neq \mu_2$ ).
- The formula for the t-score in a two-sample t-test is:
$t = \frac{\bar{x}_1 - \bar{x}_2}{\sqrt{s_p^2 \left( \frac{1}{n_1} + \frac{1}{n_2} \right)}}$
- Where:
  - $\bar{x}_1, \bar{x}_2$ : Sample means
  - $s_p^2$ : Pooled variance, calculated as:
  $s_p^2 = \frac{(n_1 - 1)s_1^2 + (n_2 - 1)s_2^2}{n_1 + n_2 - 2}$
Paired Sample T-Test:
- Used when the samples are related or matched, such as before-and-after measurements on the same subjects.
- Hypotheses:
  - Null hypothesis ( $H_0$ ): The mean difference between pairs is zero ( $\mu_d = 0$ ).
  - Alternative hypothesis ( $H_a$ ): The mean difference is not zero ( $\mu_d \neq 0$ ).
- The t-score is calculated as:
$t = \frac{\bar{d}}{s_d/\sqrt{n}}$
- Where:
  - $\bar{d}$ : Mean of the differences
  - $s_d$ : Standard deviation of the differences
  - $n$ : Number of pairs

Steps to Conduct a T-Test

State the Hypotheses:
- Define the null and alternative hypotheses.
Collect Data:
- Gather the necessary sample data.
Calculate the T-Score:
- Use the appropriate formula based on whether it’s a one-sample, two-sample, or paired sample test.
Determine the Degrees of Freedom (df):
- For a one-sample t-test, $df = n - 1$ .
- For a two-sample t-test, $df = n_1 + n_2 - 2$ .
- For a paired sample t-test, $df = n - 1$ .
Find the Critical Value:
- Use a t-table to find the critical t-value based on the desired significance level ( $\alpha$ ) and degrees of freedom.
Make a Decision:
- Compare the calculated t-score to the critical t-value:
  - If $|t| > t_{\text{critical}}$ : Reject the null hypothesis.
  - If $|t| \leq t_{\text{critical}}$ : Fail to reject the null hypothesis.

Example of a One-Sample T-Test

Scenario: A nutritionist claims that a particular diet reduces the average cholesterol level. A sample of 15 patients on this diet has a mean cholesterol level of 190 mg/dL with a standard deviation of 15 mg/dL. The known population mean cholesterol level is 200 mg/dL. Is there enough evidence to support the nutritionist's claim at the 0.05 significance level?

State the Hypotheses:
- $H_0: \mu = 200$ (the average cholesterol level is 200 mg/dL)
- $H_a: \mu < 200$ (the average cholesterol level is less than 200 mg/dL)
Collect Data:
- Sample mean ( $\bar{x} = 190$ ), sample standard deviation ( $s = 15$ ), sample size ( $n = 15$ ).
Calculate the T-Score:
$t = \frac{190 - 200}{15/\sqrt{15}} \approx \frac{-10}{3.87} \approx -2.58$
Determine the Degrees of Freedom:
- $df = 15 - 1 = 14$ .
Find the Critical Value:
- For a one-tailed test at $\alpha = 0.05$ and $df = 14$ , the critical t-value is approximately -1.761.
Make a Decision:
- Since $-2.58 < -1.761$ , we reject the null hypothesis. There is sufficient evidence to support the nutritionist's claim that the average cholesterol level is less than 200 mg/dL.

Conclusion

The t-test is a versatile statistical tool used to analyze differences between means, especially when dealing with small sample sizes and unknown population standard deviations. By following a systematic approach—defining hypotheses, calculating t-scores, and making decisions based on critical values—researchers can effectively test claims and draw conclusions from their data.

Previous topic 24

z-test

Next topic 26

Statistical inference in decision making

Past Papers

Open this section to load past papers

Click on Show Past Papers to see past papers.

MATH2118›t-test

Tools for Quantitative ReasoningTopic 25 of 27

t-test

7 minread

1,273words

Intermediatelevel

Key Concepts of the T-Test

T-Score:
- The t-score measures the number of standard errors the sample mean is away from the population mean. It is calculated as:
$t = \frac{\bar{x} - \mu}{s/\sqrt{n}}$
- Where:
  - $\bar{x}$ : Sample mean
  - $\mu$ : Population mean (or the mean of another group, in the case of a two-sample t-test)
  - $s$ : Sample standard deviation
  - $n$ : Sample size
Assumptions:
- The data should be normally distributed, especially important when sample sizes are small.
- The observations are independent of each other.
- For two-sample tests, the variances of the two groups should be approximately equal (homogeneity of variance).

Types of T-Tests

One-Sample T-Test:
- Used to determine whether the mean of a single sample is significantly different from a known population mean.
- Hypotheses:
  - Null hypothesis ( $H_0$ ): The sample mean is equal to the population mean ( $\bar{x} = \mu$ ).
  - Alternative hypothesis ( $H_a$ ): The sample mean is not equal to the population mean ( $\bar{x} \neq \mu$ ).
Independent Two-Sample T-Test:
- Used to compare the means of two independent groups.
- Hypotheses:
  - Null hypothesis ( $H_0$ ): The means of the two groups are equal ( $\mu_1 = \mu_2$ ).
  - Alternative hypothesis ( $H_a$ ): The means of the two groups are not equal ( $\mu_1 \neq \mu_2$ ).
- The formula for the t-score in a two-sample t-test is:
$t = \frac{\bar{x}_1 - \bar{x}_2}{\sqrt{s_p^2 \left( \frac{1}{n_1} + \frac{1}{n_2} \right)}}$
- Where:
  - $\bar{x}_1, \bar{x}_2$ : Sample means
  - $s_p^2$ : Pooled variance, calculated as:
  $s_p^2 = \frac{(n_1 - 1)s_1^2 + (n_2 - 1)s_2^2}{n_1 + n_2 - 2}$
Paired Sample T-Test:
- Used when the samples are related or matched, such as before-and-after measurements on the same subjects.
- Hypotheses:
  - Null hypothesis ( $H_0$ ): The mean difference between pairs is zero ( $\mu_d = 0$ ).
  - Alternative hypothesis ( $H_a$ ): The mean difference is not zero ( $\mu_d \neq 0$ ).
- The t-score is calculated as:
$t = \frac{\bar{d}}{s_d/\sqrt{n}}$
- Where:
  - $\bar{d}$ : Mean of the differences
  - $s_d$ : Standard deviation of the differences
  - $n$ : Number of pairs

Steps to Conduct a T-Test

State the Hypotheses:
- Define the null and alternative hypotheses.
Collect Data:
- Gather the necessary sample data.
Calculate the T-Score:
- Use the appropriate formula based on whether it’s a one-sample, two-sample, or paired sample test.
Determine the Degrees of Freedom (df):
- For a one-sample t-test, $df = n - 1$ .
- For a two-sample t-test, $df = n_1 + n_2 - 2$ .
- For a paired sample t-test, $df = n - 1$ .
Find the Critical Value:
- Use a t-table to find the critical t-value based on the desired significance level ( $\alpha$ ) and degrees of freedom.
Make a Decision:
- Compare the calculated t-score to the critical t-value:
  - If $|t| > t_{\text{critical}}$ : Reject the null hypothesis.
  - If $|t| \leq t_{\text{critical}}$ : Fail to reject the null hypothesis.

Example of a One-Sample T-Test

State the Hypotheses:
- $H_0: \mu = 200$ (the average cholesterol level is 200 mg/dL)
- $H_a: \mu < 200$ (the average cholesterol level is less than 200 mg/dL)
Collect Data:
- Sample mean ( $\bar{x} = 190$ ), sample standard deviation ( $s = 15$ ), sample size ( $n = 15$ ).
Calculate the T-Score:
$t = \frac{190 - 200}{15/\sqrt{15}} \approx \frac{-10}{3.87} \approx -2.58$
Determine the Degrees of Freedom:
- $df = 15 - 1 = 14$ .
Find the Critical Value:
- For a one-tailed test at $\alpha = 0.05$ and $df = 14$ , the critical t-value is approximately -1.761.
Make a Decision:
- Since $-2.58 < -1.761$ , we reject the null hypothesis. There is sufficient evidence to support the nutritionist's claim that the average cholesterol level is less than 200 mg/dL.

Conclusion

Previous topic 24

z-test

Next topic 26

Statistical inference in decision making

Past Papers

Open this section to load past papers

Click on Show Past Papers to see past papers.