H2 Maths Notes (JC 1-2): 6.5) Hypothesis Testing

Download printable cheat-sheet (CC-BY 4.0)

07 Oct 2025, 00:00 Z

Before you revise\ Prepare a quick-reference sheet: null/alternative templates, common critical values, and GC commands for z, t, and binomial tests. Always practise writing the full conclusion sentence—MOE wants context, not just “reject \( H_0 \)”.

Hypothesis Testing Framework

State hypotheses: \( H_0 \) (status quo) and \( H_1 \) (claim), referencing the parameter (\( \mu \), \( p \), etc.).
Choose test statistic and significance level \( \alpha \).
Calculate the observed statistic and either the p-value or compare with critical values.
Decision: reject or fail to reject \( H_0 \) based on the evidence.
Conclusion: interpret in context, noting the significance level.

z-tests for Means (Known \( \sigma \) or Large \( n \))

Test statistic: \( Z = \frac{\bar{X} - \mu_0}{\sigma / \sqrt{n}} \).
Critical regions for two-tailed tests at \( \alpha \): reject if \( |Z| > z_{1 - \alpha/2} \).

Example -- JC lecture attendance

Historical mean attendance is 520 students. After a scheduling change, a sample of \( n = 64 \) lectures gives \( \bar{X} = 508 \), \( \sigma = 40 \). Test at 5% whether attendance decreased.

\( H_0: \mu = 520 \), \( H_1: \mu < 520 \).
\( Z = \frac{508 - 520}{40 / \sqrt{64}} = -2.40 \).
Critical value: \( z_{0.05} = -1.645 \). Since \( -2.40 < -1.645 \), reject \( H_0 \).
Conclusion: There is sufficient evidence at the 5% level that average attendance fell below 520.

t-tests for Means (Unknown \( \sigma \), Small Samples)

Statistic: \( T = \frac{\bar{X} - \mu_0}{S / \sqrt{n}} \) with \( n - 1 \) degrees of freedom.
Use t-distribution tables or GC tTest functions.

Example -- Tutorial completion time

A new tutorial format should reduce the mean completion time of \( 85 \) minutes. A sample of \( n = 12 \) classes yields \( \bar{X} = 81.3 \) minutes, \( s = 6.5 \). Test at 10% significance.

\( H_0: \mu = 85 \), \( H_1: \mu < 85 \).
\( t = \frac{81.3 - 85}{6.5 / \sqrt{12}} = -1.99 \).
Critical value: \( t_{0.10, 11} = -1.363 \). Since \( -1.99 < -1.363 \), reject \( H_0 \).
Conclude the average completion time has decreased at the 10% level.

Proportion Tests

Statistic: \( Z = \frac{\hat{p} - p_0}{\sqrt{p_0(1 - p_0)/n}} \).
Ensure \( np_0 \) and \( n(1 - p_0) \) are both \( \geq 5 \).

Example -- Device adoption

Suppose previously \( 60% \) of students used graphing calculators daily. A survey of \( n = 150 \) finds \( 105 \) users. Test at 1% if usage increased.

\( H_0: p = 0.6 \), \( H_1: p > 0.6 \).
\( \hat{p} = 0.7 \), \( Z = \frac{0.7 - 0.6}{\sqrt{0.6 \times 0.4 / 150}} = 2.74 \).
Critical value \( z_{0.99} = 2.326 \). Since \( 2.74 > 2.326 \), reject \( H_0 \).
Evidence suggests the proportion of daily users exceeded 60%.

Binomial Tests (Small n)

When \( n \) is small, evaluate exact probabilities using \( \operatorname{Bin}(n, p_0) \).

Example -- Quality audit

A production line historically has defect rate \(5\%\). In \( n = 20 \) items, \( 3 \) are defective. Test at \(5\%\) if the defect rate increased.

\( H_0: p = 0.05 \), \( H_1: p > 0.05 \).
Compute \( P(X \geq 3) = 1 - [P\( X = 0 \) + P\( X = 1 \) + P\( X = 2 \)] \).
Using GC binomcdf(20, 0.05, 2) = 0.924; hence \( P(X \geq 3) = 0.076 \).
Since 0.076 > 0.05, fail to reject \( H_0 \); insufficient evidence that defects increased.

P-values vs Critical Regions

P-value: probability, under \( H_0 \), of observing a statistic at least as extreme as the sample.
If p-value \( < \alpha \), reject \( H_0 \); otherwise do not reject.
Mention explicitly when reporting: “p-value = 0.032 < 0.05, so reject \( H_0 \).”

Calculator Workflows

TI: ZTest, TTest, 1-PropZTest, 2-SampTTest; record the inputs and outputs (test statistic, p-value).
Casio: Use STAT > DIST > NORM / t / BINM; for binomial tail probabilities, Bcd command handles cumulative sums.
Always double-check the tail (left/right/two) before pressing ENTER.

Exam Watch Points

Hypotheses must refer to parameters (\( \mu \), \( p \)), not statistics (\( \bar{X} \)).
Quote the significance level and justify one- vs two-tailed tests from the context wording (“increase”, “different”).
Round the test statistic to 3 decimal places and p-values to 3 significant figures.
Ensure the final conclusion mentions the context and the significance level (e.g. “At the 5% level…”).
When using approximations, verify assumptions (normality, sample size).

Quick Revision Checklist

[ ] Set up \( H_0 \), \( H_1 \) correctly and select the appropriate test statistic.
[ ] Carry out \(z\), \(t\), and binomial tests with calculator support while writing full working.
[ ] Interpret p-values versus critical regions and articulate conclusions in context.
[ ] Explain possible Type I / Type II errors qualitatively when asked.

Next steps: Pair hypothesis tests with regression outputs (Topic 6.6) and past-year structured questions combining sampling and inference.