Statistical Tests in H2 Biology Paper 4: chi-squared, t-test, and How to Interpret Both

Study guide14 Apr 2026, 00:00 ZUpdated 17 Jul 2026

Download PDF Join our Telegram study group

Q: What does this guide cover?
A: It explains which statistical tests appear in H2 Biology Paper 4, when to apply chi-squared versus a t-test, how to carry out each one with numbers, and how to write conclusions in the language examiners award marks for.

TL;DR
The SEAB 9477 syllabus names both t-tests and chi-squared tests in the mathematical requirements. Know both. Apply chi-squared to frequency or count data, and apply the t-test when comparing two sample means from measurement data. Detailed computation may be tested on Papers 1-3, while Paper 4 may test whether you understand what the tests mean in an investigation. State your null hypothesis before doing any calculation, interpret the p-value at the 5% level, and phrase your conclusion using "reject" or "fail to reject the null hypothesis."

For the full Paper 4 lab sequence, use the H2 Biology practical guide for 9477 Paper 4 alongside this page.

Status: SEAB's current H2 Biology (9477) syllabus PDF is labelled for 2026. Its mathematical requirements include rates, graphical presentation, levels of significance, standard deviation, probability, t-tests, and chi-squared tests. The syllabus states that equations, symbol meanings, a t-table, and a chi-squared table are provided where needed; detailed computation is not required on Paper 4, but understanding of the tests may be tested there.

1 Syllabus context: what the 9477 paper actually requires

The 9477 syllabus places statistics inside its mathematical requirements. Candidates may need to present data graphically, calculate rates, appreciate levels of significance and standard deviation, and use both t-tests and chi-squared tests. The syllabus also states that candidates are given access to the equations, symbol meanings, a t-table, and a chi-squared table when calculation is required. [1]

For Paper 4, the key distinction is scope. Detailed t-test or chi-squared computation is not required on Paper 4, but understanding what the tests mean may be tested. A practical answer can still require the student to decide whether a result supports a conclusion, whether two means differ significantly, or whether observed counts fit an expected pattern.

This guide covers both tests so that the calculation routine and the Paper 4 interpretation routine do not get mixed up.

Paper 4 statistics source checkpoint

Use this checkpoint when a practical question gives statistical output or asks whether data supports a conclusion.

If the data are...	Use this test logic	Paper 4 answer move
Measured values from two groups	t-test	State the null hypothesis, compare calculated and critical values if provided, then say whether the mean difference is significant.
Counts or frequencies in categories	Chi-squared test	Compare observed and expected values, then decide whether the difference from expectation is significant.
A table of rates, means, or percentages	Not always a full test	First calculate the processed value, then decide whether a statistical test is needed.
A supplied p-value or critical value	Interpretation before arithmetic	Say reject or fail to reject the null hypothesis, then connect the decision to the biological context.

For Paper 4, the scoring habit is diagnosis first: data type, null hypothesis, comparison, conclusion, biological meaning.

2 Choosing the right test

Test	Data type	Typical H2 Biology context
Chi-squared ( $\chi^2$ )	Count or frequency data, comparing observed vs expected	Genetics ratios (9:3:3:1, 3:1), ecology species counts across two habitats
Independent samples t-test	Measurement data, comparing two group means	Mean leaf length in sun vs shade, mean enzyme rate at pH 6 vs pH 8, ecology fieldwork measurement comparisons
Paired t-test	Measurement data, same subject measured twice	Before-after enzyme inhibitor experiments (use this instead of independent t-test when the same sample is measured under both conditions)

The decision rule: if your data are counts of outcomes in categories, use chi-squared. If your data are measured values and you want to compare the average of two groups, use the t-test.

Paper 4 statistics tuition checkpoint

Statistics questions become easier when the student separates the test choice from the conclusion wording. Do not calculate first. Decide what kind of data you have, state the null hypothesis, then write the conclusion only after comparing the calculated value with the critical value.

Repeated Paper 4 statistics issue	Drill to use	Evidence of progress
The student chooses the wrong test.	Sort ten prompts into counts, measurements, and paired measurements before doing any arithmetic.	The test name is correct before the formula appears.
The null hypothesis is vague.	Write the null hypothesis with the exact variable and groups.	The sentence says "no significant difference" and names what is being compared.
The conclusion skips the critical-value comparison.	Use the same four-part sentence every time: calculated value, critical value, decision, biological conclusion.	The answer includes reject or fail to reject the null hypothesis.
The biology link is missing.	Add one sentence that connects the significant or non-significant result to the investigation context.	The conclusion explains the leaf, enzyme, ecology, or genetics meaning of the result.

For JC1-JC2 students who need this statistics routine connected to Paper 4 planning, data handling, and the rest of 9477 revision, use H2 Biology tuition Singapore as the main programme route.

3 Chi-squared test: worked example

Context

A genetics experiment crosses two heterozygous parents $AaBb \times AaBb$ and gives an expected 9:3:3:1 phenotype ratio. You observe: 120 round yellow, 38 round green, 36 wrinkled yellow, 6 wrinkled green, from 200 offspring total.

Step 1: State hypotheses

$H_0$ : There is no significant difference between the observed and expected phenotype frequencies. The ratio conforms to a 9:3:3:1 Mendelian expectation.

$H_1$ : There is a significant difference between observed and expected frequencies.

Step 2: Calculate expected values

For a 9:3:3:1 ratio from 200 offspring: $E = \frac{total \times ratio\; share}{16}$

Phenotype	Observed ( $O$ )	Expected ( $E$ )	$O - E$	$(O - E)^2 / E$
Round yellow (9/16)	120	112.5	+7.5	0.50
Round green (3/16)	38	37.5	+0.5	0.01
Wrinkled yellow (3/16)	36	37.5	-1.5	0.06
Wrinkled green (1/16)	6	12.5	-6.5	3.38
Total	200	200		3.95

Step 3: Calculate chi-squared

$\chi^2 = \sum \frac{(O - E)^2}{E} = 3.95$

Step 4: Degrees of freedom

$df = (\text{number of categories}) - 1 = 4 - 1 = 3$

Step 5: Compare to critical value

At $df = 3$ and $p = 0.05$ , the critical value from standard chi-squared tables is 7.815.

Since $\chi^2 = 3.95 \lt 7.815$ , the result is not significant at the 5% level.

Step 6: Write the conclusion

The calculated $\chi^2$ value (3.95) is less than the critical value (7.815) at 3 degrees of freedom ( $p = 0.05$ ). We fail to reject the null hypothesis. There is no significant difference between the observed and expected phenotype frequencies. The data are consistent with a 9:3:3:1 Mendelian ratio.

4 Independent samples t-test: worked example

Context

A student measures the maximum leaf width (mm) of ivy plants growing in two habitats: a shaded woodland understory and an open sunny grassland. She wants to know whether there is a significant difference in mean leaf width between the two habitats.

Shaded leaves (mm)	Sunny leaves (mm)
52	38
48	41
55	35
50	44
54	39
47	37
53	42

Step 1: State hypotheses

$H_0$ : There is no significant difference in mean leaf width between shaded and sunny habitats ( $\mu_{shaded} = \mu_{sunny}$ ).

$H_1$ : There is a significant difference in mean leaf width between shaded and sunny habitats ( $\mu_{shaded} \neq \mu_{sunny}$ ). This is a two-tailed test because you have no prior reason to predict which direction the difference will go.

Step 2: Calculate means and standard deviations

Shaded group ( $n_1 = 7$ ):

$\bar{x}_1 = \frac{52 + 48 + 55 + 50 + 54 + 47 + 53}{7} = \frac{359}{7} = 51.3 \text{ mm}$

Sunny group ( $n_2 = 7$ ):

$\bar{x}_2 = \frac{38 + 41 + 35 + 44 + 39 + 37 + 42}{7} = \frac{276}{7} = 39.4 \text{ mm}$

For the standard deviations, calculate the variance of each group and take the square root. Working shown below:

Shaded deviations from mean (51.3): $+0.7, -3.3, +3.7, -1.3, +2.7, -4.3, +1.7$

Squared deviations: $0.49, 10.89, 13.69, 1.69, 7.29, 18.49, 2.89$

Sum of squared deviations: $55.43$

$s_1^2 = \frac{55.43}{7 - 1} = 9.24 \quad \Rightarrow \quad s_1 = 3.04 \text{ mm}$

Sunny deviations from mean (39.4): $-1.4, +1.6, -4.4, +4.6, -0.4, -2.4, +2.6$

Squared deviations: $1.96, 2.56, 19.36, 21.16, 0.16, 5.76, 6.76$

Sum of squared deviations: $57.72$

$s_2^2 = \frac{57.72}{7 - 1} = 9.62 \quad \Rightarrow \quad s_2 = 3.10 \text{ mm}$

Step 3: Calculate the pooled standard deviation

When group sizes are equal or similar and variances are similar, use the pooled standard deviation formula:

$s_p = \sqrt{\frac{(n_1 - 1)s_1^2 + (n_2 - 1)s_2^2}{n_1 + n_2 - 2}} = \sqrt{\frac{(6)(9.24) + (6)(9.62)}{12}} = \sqrt{\frac{55.44 + 57.72}{12}} = \sqrt{9.43} = 3.07 \text{ mm}$

Step 4: Calculate t

$t = \frac{\bar{x}_1 - \bar{x}_2}{s_p \sqrt{\dfrac{1}{n_1} + \dfrac{1}{n_2}}} = \frac{51.3 - 39.4}{3.07 \times \sqrt{\dfrac{1}{7} + \dfrac{1}{7}}} = \frac{11.9}{3.07 \times 0.535} = \frac{11.9}{1.64} = 7.26$

Step 5: Degrees of freedom

$df = n_1 + n_2 - 2 = 7 + 7 - 2 = 12$

Step 6: Compare to critical value

For a two-tailed test at $p = 0.05$ with $df = 12$ , the critical t-value from standard tables is 2.179.

Since $t = 7.26 \gt 2.179$ , the result is significant.

Step 7: Write the conclusion

The calculated t-value (7.26) exceeds the critical value (2.179) for $df = 12$ at the 5% significance level (two-tailed). We reject the null hypothesis. There is a significant difference in mean leaf width between shaded and sunny habitats ( $p \lt 0.05$ ). Shaded leaves have a significantly greater mean width (51.3 mm) than sunny leaves (39.4 mm), which is consistent with the expectation that plants in low-light environments develop larger leaf laminae to maximise light capture.

5 Interpreting the p-value

The p-value is the probability of observing a difference this large (or larger) by chance if the null hypothesis were true.

$p \lt 0.05$ : the result is statistically significant at the 5% level. Reject $H_0$ .
$p \geq 0.05$ : the result is not statistically significant at the 5% level. Fail to reject $H_0$ .

"Fail to reject" does not mean "accept" or "prove $H_0$ ." It means the data do not provide enough evidence to reject it.

The 5% level is the default in H2 Biology unless the question specifies otherwise (some questions specify 1% or 10%). Always state the level explicitly in your conclusion.

6 Writing conclusions: the examiner-ready phrasing

These sentence structures earn marks. Learn them exactly.

For chi-squared (result is significant)

The calculated $\chi^2$ value ([calculated value]) exceeds the critical value ([critical value]) for [df] degrees of freedom at the 5% significance level. We reject the null hypothesis. There is a significant difference between the observed and expected [frequencies/ratios]. The [describe the pattern] is not consistent with a [expected model].

For chi-squared (result is not significant)

The calculated $\chi^2$ value ([calculated value]) is less than the critical value ([critical value]) for [df] degrees of freedom at the 5% significance level. We fail to reject the null hypothesis. There is no significant difference between the observed and expected [frequencies/ratios]. The data are consistent with [expected model].

For t-test (result is significant)

The calculated t-value ([calculated value]) exceeds the critical value ([critical value]) for [df] degrees of freedom at the 5% significance level (two-tailed). We reject the null hypothesis ( $p \lt 0.05$ ). There is a significant difference in mean [variable] between [group A] and [group B]. [Group A] has a significantly [greater/smaller] mean [variable] than [group B].

For t-test (result is not significant)

The calculated t-value ([calculated value]) is less than the critical value ([critical value]) for [df] degrees of freedom at the 5% significance level (two-tailed). We fail to reject the null hypothesis ( $p \geq 0.05$ ). There is no significant difference in mean [variable] between [group A] and [group B].

7 Common mistakes

Stating "the t-test is significant" without specifying the level. A result is only significant at a stated confidence level. You must write "significant at the 5% level" or "significant at $p \lt 0.05$ ." Without the level, the conclusion is incomplete.

Confusing one-tailed and two-tailed tests. Use a two-tailed test when you have no prediction about the direction of the difference. Use a one-tailed test only when you have a clear biological reason to predict which group will have the higher mean, and you state this in $H_1$ . In most Paper 4 scenarios, use the two-tailed critical value unless instructed otherwise.

Not stating the null hypothesis before calculating. Examiners award marks for the hypothesis statement independently of the calculation. Write $H_0$ and $H_1$ first, before any numbers.

Writing "the hypothesis is proved." Statistical tests never prove a hypothesis. They assess evidence. The conclusion is always about evidence supporting or not supporting $H_0$ .

Using "accept $H_0$ " instead of "fail to reject $H_0$ ." These are not equivalent. Failing to reject $H_0$ means the data are consistent with it. Accepting $H_0$ implies you have proved it true, which you cannot do from a finite sample.

Applying t-test to categorical count data. Chi-squared is for counts of things in categories (observed vs expected frequencies). T-test is for measured values where you are comparing means. Using t-test on count data gives meaningless results.

8 When not to use the independent samples t-test

Use a paired t-test instead when the same subjects are measured under two conditions (before and after a treatment, same plant measured in two seasons). The paired t-test accounts for within-subject variation and is more powerful in these designs.

Use chi-squared instead when your data are counts or frequencies of categorical outcomes, such as phenotype ratios or species presence/absence.

Use Mann-Whitney instead when you cannot assume the data are normally distributed (very small samples, obviously skewed distributions, or ordinal data such as rated scores). Mann-Whitney is the non-parametric equivalent of the independent t-test.

In H2 Biology, if a question provides a small dataset (fewer than 6 values per group) or clearly non-normal data, be cautious about applying t-test and note the assumption of normality as a limitation.

9 Summary: decision tree

Is your data in counts per category? Use chi-squared.
Are you comparing two means? Continue.
Are the two groups the same subjects measured twice? Use paired t-test.
Are the groups independent and data approximately normal? Use independent t-test.
Are the groups independent but data non-normal or very small? Use Mann-Whitney.

Links and next steps

To see chi-squared applied in the ecology fieldwork context with sampling tables: H2 Biology Ecology Fieldwork Practical Guide
For the full Paper 4 lab sequence and assessment structure: H2 Biology Practical Guide for 9477 Paper 4
For the H2 Biology practical hub: H2 Biology practicals, labs, and experiments
For H2 Maths hypothesis testing context (z-test for population means, critical regions): H2 Maths Notes: Hypothesis Testing

Running a centre without lab facilities?
We partner with private schools and homeschool centres to provide fully equipped labs, trained supervisors, and SEAB-aligned practical programmes. Learn more

References

[1] SEAB. (2024). Biology (Syllabus 9477) GCE A-Level 2026 (first year of examination 2026). Singapore Examinations and Assessment Board. https://isomer-user-content.by.gov.sg/334/e3653fcb-bd44-4c59-bed1-48fc3cb47a34/9477_y26_sy.pdf

[2] Campbell, N. A. et al. (2020). Campbell Biology (12th ed.). Pearson - statistical analysis in biological investigations.

[3] Fowler, J., Cohen, L. and Jarvis, P. (1998). Practical Statistics for Field Biology (2nd ed.). Wiley - t-test and chi-squared applications in ecological data analysis.

Reviewed by

Ezekiel Tan·Academic Advisor (Biology)

Practical course completion-record note

For practical, lab, and experiment courses, Eclat Institute maintains centre-held attendance records and may also issue an internal attendance or completion document based on participation and internal assessment.

For SEAB private-candidate declarations, the key evidence is the centre's attendance or completion record, not a government-issued certificate.
This is an internal centre-issued certificate, not an MOE/SEAB qualification or accreditation.
Recognition (if any) is determined by the receiving school, institution, or employer.
For SEAB private candidates taking science practical papers, SEAB states you should either have taken the subject before or attend a practical course and complete it before the practical paper date.

View our sample completion document (Current sample layout (design may be refined over time))

Sources

SEAB Biology (Syllabus 9477) GCE A-Level 2026

Statistical Tests in H2 Biology Paper 4: chi-squared, t-test, and How to Interpret Both

1 Syllabus context: what the 9477 paper actually requires

Paper 4 statistics source checkpoint

2 Choosing the right test

Paper 4 statistics tuition checkpoint

3 Chi-squared test: worked example

Context

Step 1: State hypotheses

Step 2: Calculate expected values

Step 3: Calculate chi-squared

Step 4: Degrees of freedom

Step 5: Compare to critical value

Step 6: Write the conclusion

4 Independent samples t-test: worked example

Context

Step 1: State hypotheses

Step 2: Calculate means and standard deviations

Step 3: Calculate the pooled standard deviation

Step 4: Calculate t

Step 5: Degrees of freedom

Step 6: Compare to critical value

Step 7: Write the conclusion

5 Interpreting the p-value

6 Writing conclusions: the examiner-ready phrasing

For chi-squared (result is significant)

For chi-squared (result is not significant)

For t-test (result is significant)

For t-test (result is not significant)

7 Common mistakes

8 When not to use the independent samples t-test

9 Summary: decision tree

Links and next steps

References

Sources

Timings

Henderson Practical Lab

Jurong East Centre (Vision Exchange)

Pricing

Location

Related Posts