Statistical Tests in H2 Biology Paper 4: chi-squared, t-test, and How to Interpret Both
In one line
The SEAB 9477 syllabus explicitly requires chi-squared tests for genetics ratios and ecological sampling data.
Key points
- The t-test is not named in the syllabus, but it appears regularly in school-based Paper 4 investigations and some data-handling questions where examiners provide two sets of means and ask for a statistical comparison.
- Know both.
Want small-group support? Browse our A-Level Biology Tuition hub. Not sure which level to start with? Visit Biology Tuition Singapore.
Looking for the full lab practical series? Visit the H2 Biology Practicals (Labs & Practicals).
Practical course completion-record note
For practical, lab, and experiment courses, Eclat Institute maintains centre-held attendance records and may also issue an internal attendance or completion document based on participation and internal assessment.
- For SEAB private-candidate declarations, the key evidence is the centre's attendance or completion record, not a government-issued certificate.
- This is an internal centre-issued certificate, not an MOE/SEAB qualification or accreditation.
- Recognition (if any) is determined by the receiving school, institution, or employer.
- For SEAB private candidates taking science practical papers, SEAB states you should either have taken the subject before or attend a practical course and complete it before the practical paper date.
View our sample completion document (Current sample layout (design may be refined over time))
Planning a revision session? Use our study places near me map to find libraries, community study rooms, and late-night spots.
> **Q:** What does this guide cover?\
> **A:** It explains which statistical tests appear in H2 Biology Paper 4, when to apply chi-squared versus a t-test, how to carry out each one with numbers, and how to write conclusions in the language examiners award marks for.
> **TL;DR**\
> The SEAB 9477 syllabus explicitly requires chi-squared tests for genetics ratios and ecological sampling data. The t-test is not named in the syllabus, but it appears regularly in school-based Paper 4 investigations and some data-handling questions where examiners provide two sets of means and ask for a statistical comparison. Know both. Apply chi-squared to frequency or count data. Apply the t-test when comparing two sample means from measurement data. State your null hypothesis before doing any calculation, interpret the p-value at the 5% level, and phrase your conclusion using "reject" or "fail to reject the null hypothesis."
For the full Paper 4 lab sequence, use the [H2 Biology practical guide for 9477 Paper 4](https://eclatinstitute.sg/blog/h2-biology-experiments/H2-Biology-Practical-2026-Lab-Mastery-Guide) alongside this page.
_Status:_ SEAB's current H2 Biology (9477) syllabus PDF is labelled for 2026. Chi-squared is explicitly listed in the mathematical requirements. T-test appears in school practical contexts and data-handling questions but is not named in the published syllabus; it is included here as preparation for investigations where examiners supply a t-value or ask you to compare means.
| If you have... | Read this first |
| --- | --- |
| 1 second | Use chi-squared for counts and t-tests for comparing means. |
| 10 seconds | Check data type, null hypothesis, observed value, expected value, degrees of freedom, critical value, p-value, 5% level, reject, fail to reject, and conclusion wording. |
| 100 seconds | The key is not memorising formulas. First identify whether the data are category counts or measured values, then write the conclusion in hypothesis language. |
| Concrete example | Leaf width in shade versus sun uses a t-test because you compare two sample means from measurement data. |
| Best next step | Before calculating, write the null hypothesis and name the test. |
---
## 1 Syllabus context: what the 9477 paper actually requires
The 9477 syllabus specifies that candidates should be able to "present data graphically, calculate rates, and apply chi-squared tests to results from genetics or ecological sampling." [1]
Chi-squared is the only named statistical test in the published requirements. However, Paper 4 investigations and data-handling questions have presented scenarios where students are expected to recognise a t-test result and interpret it correctly, particularly in ecology fieldwork comparisons and enzyme rate comparisons between treatment groups. School-based practicals and past question styles from analogous A-Level biology syllabuses treat the t-test as part of the expected quantitative toolkit.
This guide covers both. You will not be surprised by either in an exam or internal assessment.
---
## 2 Choosing the right test
| Test | Data type | Typical H2 Biology context |
| ---- | --------- | -------------------------- |
| Chi-squared ($\chi^2$) | Count or frequency data, comparing observed vs expected | Genetics ratios (9:3:3:1, 3:1), ecology species counts across two habitats |
| Independent samples t-test | Measurement data, comparing two group means | Mean leaf length in sun vs shade, mean enzyme rate at pH 6 vs pH 8, ecology fieldwork measurement comparisons |
| Paired t-test | Measurement data, same subject measured twice | Before-after enzyme inhibitor experiments (use this instead of independent t-test when the same sample is measured under both conditions) |
The decision rule: if your data are counts of outcomes in categories, use chi-squared. If your data are measured values and you want to compare the average of two groups, use the t-test.
---
## 3 Chi-squared test: worked example
### Context
A genetics experiment crosses two heterozygous parents (AaBb x AaBb) and gives an expected 9:3:3:1 phenotype ratio. You observe: 120 round yellow, 38 round green, 36 wrinkled yellow, 6 wrinkled green, from 200 offspring total.
### Step 1: State hypotheses
$H_0$: There is no significant difference between the observed and expected phenotype frequencies. The ratio conforms to a 9:3:3:1 Mendelian expectation.
$H_1$: There is a significant difference between observed and expected frequencies.
### Step 2: Calculate expected values
For a 9:3:3:1 ratio from 200 offspring: $E = \frac{total \times ratio\; share}{16}$
| Phenotype | Observed ($O$) | Expected ($E$) | $O - E$ | $(O - E)^2 / E$ |
| --------- | ------------- | -------------- | ------- | --------------- |
| Round yellow (9/16) | 120 | 112.5 | +7.5 | 0.50 |
| Round green (3/16) | 38 | 37.5 | +0.5 | 0.01 |
| Wrinkled yellow (3/16) | 36 | 37.5 | -1.5 | 0.06 |
| Wrinkled green (1/16) | 6 | 12.5 | -6.5 | 3.38 |
| **Total** | **200** | **200** | | **3.95** |
### Step 3: Calculate chi-squared
$$\chi^2 = \sum \frac{(O - E)^2}{E} = 3.95$$
### Step 4: Degrees of freedom
$$df = (\text{number of categories}) - 1 = 4 - 1 = 3$$
### Step 5: Compare to critical value
At $df = 3$ and $p = 0.05$, the critical value from standard chi-squared tables is 7.815.
Since $\chi^2 = 3.95 \lt 7.815$, the result is not significant at the 5% level.
### Step 6: Write the conclusion
> The calculated $\chi^2$ value (3.95) is less than the critical value (7.815) at 3 degrees of freedom ($p = 0.05$). We fail to reject the null hypothesis. There is no significant difference between the observed and expected phenotype frequencies. The data are consistent with a 9:3:3:1 Mendelian ratio.
---
## 4 Independent samples t-test: worked example
### Context
A student measures the maximum leaf width (mm) of ivy plants growing in two habitats: a shaded woodland understory and an open sunny grassland. She wants to know whether there is a significant difference in mean leaf width between the two habitats.
| Shaded leaves (mm) | Sunny leaves (mm) |
| ------------------ | ----------------- |
| 52 | 38 |
| 48 | 41 |
| 55 | 35 |
| 50 | 44 |
| 54 | 39 |
| 47 | 37 |
| 53 | 42 |
### Step 1: State hypotheses
$H_0$: There is no significant difference in mean leaf width between shaded and sunny habitats ($\mu_{shaded} = \mu_{sunny}$).
$H_1$: There is a significant difference in mean leaf width between shaded and sunny habitats ($\mu_{shaded} \neq \mu_{sunny}$). This is a two-tailed test because you have no prior reason to predict which direction the difference will go.
### Step 2: Calculate means and standard deviations
**Shaded group** ($n_1 = 7$):
$$\bar{x}_1 = \frac{52 + 48 + 55 + 50 + 54 + 47 + 53}{7} = \frac{359}{7} = 51.3 \text{ mm}$$
**Sunny group** ($n_2 = 7$):
$$\bar{x}_2 = \frac{38 + 41 + 35 + 44 + 39 + 37 + 42}{7} = \frac{276}{7} = 39.4 \text{ mm}$$
For the standard deviations, calculate the variance of each group and take the square root. Working shown below:
Shaded deviations from mean (51.3): $+0.7, -3.3, +3.7, -1.3, +2.7, -4.3, +1.7$
Squared deviations: $0.49, 10.89, 13.69, 1.69, 7.29, 18.49, 2.89$
Sum of squared deviations: $55.43$
$$s_1^2 = \frac{55.43}{7 - 1} = 9.24 \quad \Rightarrow \quad s_1 = 3.04 \text{ mm}$$
Sunny deviations from mean (39.4): $-1.4, +1.6, -4.4, +4.6, -0.4, -2.4, +2.6$
Squared deviations: $1.96, 2.56, 19.36, 21.16, 0.16, 5.76, 6.76$
Sum of squared deviations: $57.72$
$$s_2^2 = \frac{57.72}{7 - 1} = 9.62 \quad \Rightarrow \quad s_2 = 3.10 \text{ mm}$$
### Step 3: Calculate the pooled standard deviation
When group sizes are equal or similar and variances are similar, use the pooled standard deviation formula:
$$s_p = \sqrt{\frac{(n_1 - 1)s_1^2 + (n_2 - 1)s_2^2}{n_1 + n_2 - 2}} = \sqrt{\frac{(6)(9.24) + (6)(9.62)}{12}} = \sqrt{\frac{55.44 + 57.72}{12}} = \sqrt{9.43} = 3.07 \text{ mm}$$
### Step 4: Calculate t
$$t = \frac{\bar{x}_1 - \bar{x}_2}{s_p \sqrt{\dfrac{1}{n_1} + \dfrac{1}{n_2}}} = \frac{51.3 - 39.4}{3.07 \times \sqrt{\dfrac{1}{7} + \dfrac{1}{7}}} = \frac{11.9}{3.07 \times 0.535} = \frac{11.9}{1.64} = 7.26$$
### Step 5: Degrees of freedom
$$df = n_1 + n_2 - 2 = 7 + 7 - 2 = 12$$
### Step 6: Compare to critical value
For a two-tailed test at $p = 0.05$ with $df = 12$, the critical t-value from standard tables is 2.179.
Since $t = 7.26 \gt 2.179$, the result is significant.
### Step 7: Write the conclusion
> The calculated t-value (7.26) exceeds the critical value (2.179) for $df = 12$ at the 5% significance level (two-tailed). We reject the null hypothesis. There is a significant difference in mean leaf width between shaded and sunny habitats ($p \lt 0.05$). Shaded leaves have a significantly greater mean width (51.3 mm) than sunny leaves (39.4 mm), which is consistent with the expectation that plants in low-light environments develop larger leaf laminae to maximise light capture.
---
## 5 Interpreting the p-value
The p-value is the probability of observing a difference this large (or larger) by chance if the null hypothesis were true.
- $p \lt 0.05$: the result is statistically significant at the 5% level. Reject $H_0$.
- $p \geq 0.05$: the result is not statistically significant at the 5% level. Fail to reject $H_0$.
"Fail to reject" does not mean "accept" or "prove $H_0$." It means the data do not provide enough evidence to reject it.
The 5% level is the default in H2 Biology unless the question specifies otherwise (some questions specify 1% or 10%). Always state the level explicitly in your conclusion.
---
## 6 Writing conclusions: the examiner-ready phrasing
These sentence structures earn marks. Learn them exactly.
### For chi-squared (result is significant)
> The calculated $\chi^2$ value ([calculated value]) exceeds the critical value ([critical value]) for [df] degrees of freedom at the 5% significance level. We reject the null hypothesis. There is a significant difference between the observed and expected [frequencies/ratios]. The [describe the pattern] is not consistent with a [expected model].
### For chi-squared (result is not significant)
> The calculated $\chi^2$ value ([calculated value]) is less than the critical value ([critical value]) for [df] degrees of freedom at the 5% significance level. We fail to reject the null hypothesis. There is no significant difference between the observed and expected [frequencies/ratios]. The data are consistent with [expected model].
### For t-test (result is significant)
> The calculated t-value ([calculated value]) exceeds the critical value ([critical value]) for [df] degrees of freedom at the 5% significance level (two-tailed). We reject the null hypothesis ($p \lt 0.05$). There is a significant difference in mean [variable] between [group A] and [group B]. [Group A] has a significantly [greater/smaller] mean [variable] than [group B].
### For t-test (result is not significant)
> The calculated t-value ([calculated value]) is less than the critical value ([critical value]) for [df] degrees of freedom at the 5% significance level (two-tailed). We fail to reject the null hypothesis ($p \geq 0.05$). There is no significant difference in mean [variable] between [group A] and [group B].
---
## 7 Common mistakes
**Stating "the t-test is significant" without specifying the level.** A result is only significant at a stated confidence level. You must write "significant at the 5% level" or "significant at $p \lt 0.05$." Without the level, the conclusion is incomplete.
**Confusing one-tailed and two-tailed tests.** Use a two-tailed test when you have no prediction about the direction of the difference. Use a one-tailed test only when you have a clear biological reason to predict which group will have the higher mean, and you state this in $H_1$. In most Paper 4 scenarios, use the two-tailed critical value unless instructed otherwise.
**Not stating the null hypothesis before calculating.** Examiners award marks for the hypothesis statement independently of the calculation. Write $H_0$ and $H_1$ first, before any numbers.
**Writing "the hypothesis is proved."** Statistical tests never prove a hypothesis. They assess evidence. The conclusion is always about evidence supporting or not supporting $H_0$.
**Using "accept $H_0$" instead of "fail to reject $H_0$."** These are not equivalent. Failing to reject $H_0$ means the data are consistent with it. Accepting $H_0$ implies you have proved it true, which you cannot do from a finite sample.
**Applying t-test to categorical count data.** Chi-squared is for counts of things in categories (observed vs expected frequencies). T-test is for measured values where you are comparing means. Using t-test on count data gives meaningless results.
---
## 8 When not to use the independent samples t-test
**Use a paired t-test instead** when the same subjects are measured under two conditions (before and after a treatment, same plant measured in two seasons). The paired t-test accounts for within-subject variation and is more powerful in these designs.
**Use chi-squared instead** when your data are counts or frequencies of categorical outcomes, such as phenotype ratios or species presence/absence.
**Use Mann-Whitney instead** when you cannot assume the data are normally distributed (very small samples, obviously skewed distributions, or ordinal data such as rated scores). Mann-Whitney is the non-parametric equivalent of the independent t-test.
In H2 Biology, if a question provides a small dataset (fewer than 6 values per group) or clearly non-normal data, be cautious about applying t-test and note the assumption of normality as a limitation.
---
## 9 Summary: decision tree
1. Is your data in counts per category? Use **chi-squared**.
2. Are you comparing two means? Continue.
3. Are the two groups the same subjects measured twice? Use **paired t-test**.
4. Are the groups independent and data approximately normal? Use **independent t-test**.
5. Are the groups independent but data non-normal or very small? Use **Mann-Whitney**.
---
## Links and next steps
- To see chi-squared applied in the ecology fieldwork context with sampling tables: [H2 Biology Ecology Fieldwork Practical Guide](https://eclatinstitute.sg/blog/h2-biology-experiments/H2-Biology-Ecology-Fieldwork-Practical-Guide)
- For the full Paper 4 lab sequence and assessment structure: [H2 Biology Practical Guide for 9477 Paper 4](https://eclatinstitute.sg/blog/h2-biology-experiments/H2-Biology-Practical-2026-Lab-Mastery-Guide)
- For the H2 Biology practical hub: [H2 Biology practicals, labs, and experiments](https://eclatinstitute.sg/blog/h2-biology-experiments)
- For H2 Maths hypothesis testing context (z-test for population means, critical regions): [H2 Maths Notes: Hypothesis Testing](https://eclatinstitute.sg/blog/H2-Maths-Notes-6-5-Hypothesis-Testing)
---
> **Running a centre without lab facilities?**\
> We partner with private schools and homeschool centres to provide fully equipped labs, trained supervisors, and SEAB-aligned practical programmes. [Learn more](https://eclatinstitute.sg/blog/general/Science-Lab-Access-Private-Schools-Homeschool-Centres-Singapore)
## References
[1] SEAB. (2024). _Biology (Syllabus 9477) GCE A-Level 2026_ (first year of examination 2026). Singapore Examinations and Assessment Board. https://www.seab.gov.sg/files/A%20Level%20Syllabus%20Sch%20Cddts/2026/9477_y26_sy.pdf
[2] Campbell, N. A. et al. (2020). _Campbell Biology_ (12th ed.). Pearson - statistical analysis in biological investigations.
[3] Fowler, J., Cohen, L. and Jarvis, P. (1998). _Practical Statistics for Field Biology_ (2nd ed.). Wiley - t-test and chi-squared applications in ecological data analysis.




