Statistical Tests in H2 Biology Paper 4: chi-squared, t-test, and How to Interpret Both
Q: What does this guide cover?
A: It explains which statistical tests appear in H2 Biology Paper 4, when to apply chi-squared versus a t-test, how to carry out each one with numbers, and how to write conclusions in the language examiners award marks for.
TL;DR
The SEAB 9477 syllabus names both t-tests and chi-squared tests in the mathematical requirements. Know both. Apply chi-squared to frequency or count data, and apply the t-test when comparing two sample means from measurement data. Detailed computation may be tested on Papers 1-3, while Paper 4 may test whether you understand what the tests mean in an investigation. State your null hypothesis before doing any calculation, interpret the p-value at the 5% level, and phrase your conclusion using "reject" or "fail to reject the null hypothesis."
For the full Paper 4 lab sequence, use the H2 Biology practical guide for 9477 Paper 4 alongside this page.
Status: SEAB's current H2 Biology (9477) syllabus PDF is labelled for 2026. Its mathematical requirements include rates, graphical presentation, levels of significance, standard deviation, probability, t-tests, and chi-squared tests. The syllabus states that equations, symbol meanings, a t-table, and a chi-squared table are provided where needed; detailed computation is not required on Paper 4, but understanding of the tests may be tested there.
1 Syllabus context: what the 9477 paper actually requires
The 9477 syllabus places statistics inside its mathematical requirements. Candidates may need to present data graphically, calculate rates, appreciate levels of significance and standard deviation, and use both t-tests and chi-squared tests. The syllabus also states that candidates are given access to the equations, symbol meanings, a t-table, and a chi-squared table when calculation is required. [1]
For Paper 4, the key distinction is scope. Detailed t-test or chi-squared computation is not required on Paper 4, but understanding what the tests mean may be tested. A practical answer can still require the student to decide whether a result supports a conclusion, whether two means differ significantly, or whether observed counts fit an expected pattern.
This guide covers both tests so that the calculation routine and the Paper 4 interpretation routine do not get mixed up.
Paper 4 statistics source checkpoint
Use this checkpoint when a practical question gives statistical output or asks whether data supports a conclusion.
| If the data are... | Use this test logic | Paper 4 answer move |
|---|---|---|
| Measured values from two groups | t-test | State the null hypothesis, compare calculated and critical values if provided, then say whether the mean difference is significant. |
| Counts or frequencies in categories | Chi-squared test | Compare observed and expected values, then decide whether the difference from expectation is significant. |
| A table of rates, means, or percentages | Not always a full test | First calculate the processed value, then decide whether a statistical test is needed. |
| A supplied p-value or critical value | Interpretation before arithmetic | Say reject or fail to reject the null hypothesis, then connect the decision to the biological context. |
For Paper 4, the scoring habit is diagnosis first: data type, null hypothesis, comparison, conclusion, biological meaning.
2 Choosing the right test
| Test | Data type | Typical H2 Biology context |
|---|---|---|
| Chi-squared () | Count or frequency data, comparing observed vs expected | Genetics ratios (9:3:3:1, 3:1), ecology species counts across two habitats |
| Independent samples t-test | Measurement data, comparing two group means | Mean leaf length in sun vs shade, mean enzyme rate at pH 6 vs pH 8, ecology fieldwork measurement comparisons |
| Paired t-test | Measurement data, same subject measured twice | Before-after enzyme inhibitor experiments (use this instead of independent t-test when the same sample is measured under both conditions) |
The decision rule: if your data are counts of outcomes in categories, use chi-squared. If your data are measured values and you want to compare the average of two groups, use the t-test.
Paper 4 statistics tuition checkpoint
Statistics questions become easier when the student separates the test choice from the conclusion wording. Do not calculate first. Decide what kind of data you have, state the null hypothesis, then write the conclusion only after comparing the calculated value with the critical value.
| Repeated Paper 4 statistics issue | Drill to use | Evidence of progress |
|---|---|---|
| The student chooses the wrong test. | Sort ten prompts into counts, measurements, and paired measurements before doing any arithmetic. | The test name is correct before the formula appears. |
| The null hypothesis is vague. | Write the null hypothesis with the exact variable and groups. | The sentence says "no significant difference" and names what is being compared. |
| The conclusion skips the critical-value comparison. | Use the same four-part sentence every time: calculated value, critical value, decision, biological conclusion. | The answer includes reject or fail to reject the null hypothesis. |
| The biology link is missing. | Add one sentence that connects the significant or non-significant result to the investigation context. | The conclusion explains the leaf, enzyme, ecology, or genetics meaning of the result. |
For JC1-JC2 students who need this statistics routine connected to Paper 4 planning, data handling, and the rest of 9477 revision, use H2 Biology tuition Singapore as the main programme route.
3 Chi-squared test: worked example
Context
A genetics experiment crosses two heterozygous parents and gives an expected 9:3:3:1 phenotype ratio. You observe: 120 round yellow, 38 round green, 36 wrinkled yellow, 6 wrinkled green, from 200 offspring total.
Step 1: State hypotheses
: There is no significant difference between the observed and expected phenotype frequencies. The ratio conforms to a 9:3:3:1 Mendelian expectation.
: There is a significant difference between observed and expected frequencies.
Step 2: Calculate expected values
For a 9:3:3:1 ratio from 200 offspring:
| Phenotype | Observed () | Expected () | ||
|---|---|---|---|---|
| Round yellow (9/16) | 120 | 112.5 | +7.5 | 0.50 |
| Round green (3/16) | 38 | 37.5 | +0.5 | 0.01 |
| Wrinkled yellow (3/16) | 36 | 37.5 | -1.5 | 0.06 |
| Wrinkled green (1/16) | 6 | 12.5 | -6.5 | 3.38 |
| Total | 200 | 200 | 3.95 |
Step 3: Calculate chi-squared
Step 4: Degrees of freedom
Step 5: Compare to critical value
At and , the critical value from standard chi-squared tables is 7.815.
Since , the result is not significant at the 5% level.
Step 6: Write the conclusion
The calculated value (3.95) is less than the critical value (7.815) at 3 degrees of freedom (). We fail to reject the null hypothesis. There is no significant difference between the observed and expected phenotype frequencies. The data are consistent with a 9:3:3:1 Mendelian ratio.
4 Independent samples t-test: worked example
Context
A student measures the maximum leaf width (mm) of ivy plants growing in two habitats: a shaded woodland understory and an open sunny grassland. She wants to know whether there is a significant difference in mean leaf width between the two habitats.
| Shaded leaves (mm) | Sunny leaves (mm) |
|---|---|
| 52 | 38 |
| 48 | 41 |
| 55 | 35 |
| 50 | 44 |
| 54 | 39 |
| 47 | 37 |
| 53 | 42 |
Step 1: State hypotheses
: There is no significant difference in mean leaf width between shaded and sunny habitats ().
: There is a significant difference in mean leaf width between shaded and sunny habitats (). This is a two-tailed test because you have no prior reason to predict which direction the difference will go.
Step 2: Calculate means and standard deviations
Shaded group ():
Sunny group ():
For the standard deviations, calculate the variance of each group and take the square root. Working shown below:
Shaded deviations from mean (51.3):
Squared deviations: $0.49, 10.89, 13.69, 1.69, 7.29, 18.49, 2.89$
Sum of squared deviations: $55.43$
Sunny deviations from mean (39.4):
Squared deviations: $1.96, 2.56, 19.36, 21.16, 0.16, 5.76, 6.76$
Sum of squared deviations: $57.72$
Step 3: Calculate the pooled standard deviation
When group sizes are equal or similar and variances are similar, use the pooled standard deviation formula:
Step 4: Calculate t
Step 5: Degrees of freedom
Step 6: Compare to critical value
For a two-tailed test at with , the critical t-value from standard tables is 2.179.
Since , the result is significant.
Step 7: Write the conclusion
The calculated t-value (7.26) exceeds the critical value (2.179) for at the 5% significance level (two-tailed). We reject the null hypothesis. There is a significant difference in mean leaf width between shaded and sunny habitats (). Shaded leaves have a significantly greater mean width (51.3 mm) than sunny leaves (39.4 mm), which is consistent with the expectation that plants in low-light environments develop larger leaf laminae to maximise light capture.
5 Interpreting the p-value
The p-value is the probability of observing a difference this large (or larger) by chance if the null hypothesis were true.
- : the result is statistically significant at the 5% level. Reject .
- : the result is not statistically significant at the 5% level. Fail to reject .
"Fail to reject" does not mean "accept" or "prove ." It means the data do not provide enough evidence to reject it.
The 5% level is the default in H2 Biology unless the question specifies otherwise (some questions specify 1% or 10%). Always state the level explicitly in your conclusion.
6 Writing conclusions: the examiner-ready phrasing
These sentence structures earn marks. Learn them exactly.
For chi-squared (result is significant)
The calculated value ([calculated value]) exceeds the critical value ([critical value]) for [df] degrees of freedom at the 5% significance level. We reject the null hypothesis. There is a significant difference between the observed and expected [frequencies/ratios]. The [describe the pattern] is not consistent with a [expected model].
For chi-squared (result is not significant)
The calculated value ([calculated value]) is less than the critical value ([critical value]) for [df] degrees of freedom at the 5% significance level. We fail to reject the null hypothesis. There is no significant difference between the observed and expected [frequencies/ratios]. The data are consistent with [expected model].
For t-test (result is significant)
The calculated t-value ([calculated value]) exceeds the critical value ([critical value]) for [df] degrees of freedom at the 5% significance level (two-tailed). We reject the null hypothesis (). There is a significant difference in mean [variable] between [group A] and [group B]. [Group A] has a significantly [greater/smaller] mean [variable] than [group B].
For t-test (result is not significant)
The calculated t-value ([calculated value]) is less than the critical value ([critical value]) for [df] degrees of freedom at the 5% significance level (two-tailed). We fail to reject the null hypothesis (). There is no significant difference in mean [variable] between [group A] and [group B].
7 Common mistakes
Stating "the t-test is significant" without specifying the level. A result is only significant at a stated confidence level. You must write "significant at the 5% level" or "significant at ." Without the level, the conclusion is incomplete.
Confusing one-tailed and two-tailed tests. Use a two-tailed test when you have no prediction about the direction of the difference. Use a one-tailed test only when you have a clear biological reason to predict which group will have the higher mean, and you state this in . In most Paper 4 scenarios, use the two-tailed critical value unless instructed otherwise.
Not stating the null hypothesis before calculating. Examiners award marks for the hypothesis statement independently of the calculation. Write and first, before any numbers.
Writing "the hypothesis is proved." Statistical tests never prove a hypothesis. They assess evidence. The conclusion is always about evidence supporting or not supporting .
Using "accept " instead of "fail to reject ." These are not equivalent. Failing to reject means the data are consistent with it. Accepting implies you have proved it true, which you cannot do from a finite sample.
Applying t-test to categorical count data. Chi-squared is for counts of things in categories (observed vs expected frequencies). T-test is for measured values where you are comparing means. Using t-test on count data gives meaningless results.
8 When not to use the independent samples t-test
Use a paired t-test instead when the same subjects are measured under two conditions (before and after a treatment, same plant measured in two seasons). The paired t-test accounts for within-subject variation and is more powerful in these designs.
Use chi-squared instead when your data are counts or frequencies of categorical outcomes, such as phenotype ratios or species presence/absence.
Use Mann-Whitney instead when you cannot assume the data are normally distributed (very small samples, obviously skewed distributions, or ordinal data such as rated scores). Mann-Whitney is the non-parametric equivalent of the independent t-test.
In H2 Biology, if a question provides a small dataset (fewer than 6 values per group) or clearly non-normal data, be cautious about applying t-test and note the assumption of normality as a limitation.
9 Summary: decision tree
- Is your data in counts per category? Use chi-squared.
- Are you comparing two means? Continue.
- Are the two groups the same subjects measured twice? Use paired t-test.
- Are the groups independent and data approximately normal? Use independent t-test.
- Are the groups independent but data non-normal or very small? Use Mann-Whitney.
Links and next steps
- To see chi-squared applied in the ecology fieldwork context with sampling tables: H2 Biology Ecology Fieldwork Practical Guide
- For the full Paper 4 lab sequence and assessment structure: H2 Biology Practical Guide for 9477 Paper 4
- For the H2 Biology practical hub: H2 Biology practicals, labs, and experiments
- For H2 Maths hypothesis testing context (z-test for population means, critical regions): H2 Maths Notes: Hypothesis Testing
Running a centre without lab facilities?
We partner with private schools and homeschool centres to provide fully equipped labs, trained supervisors, and SEAB-aligned practical programmes. Learn more
References
[1] SEAB. (2024). Biology (Syllabus 9477) GCE A-Level 2026 (first year of examination 2026). Singapore Examinations and Assessment Board. https://www.seab.gov.sg/files/A%20Level%20Syllabus%20Sch%20Cddts/2026/9477_y26_sy.pdf
[2] Campbell, N. A. et al. (2020). Campbell Biology (12th ed.). Pearson - statistical analysis in biological investigations.
[3] Fowler, J., Cohen, L. and Jarvis, P. (1998). Practical Statistics for Field Biology (2nd ed.). Wiley - t-test and chi-squared applications in ecological data analysis.
Practical course completion-record note
For practical, lab, and experiment courses, Eclat Institute maintains centre-held attendance records and may also issue an internal attendance or completion document based on participation and internal assessment.
- For SEAB private-candidate declarations, the key evidence is the centre's attendance or completion record, not a government-issued certificate.
- This is an internal centre-issued certificate, not an MOE/SEAB qualification or accreditation.
- Recognition (if any) is determined by the receiving school, institution, or employer.
- For SEAB private candidates taking science practical papers, SEAB states you should either have taken the subject before or attend a practical course and complete it before the practical paper date.
View our sample completion document (Current sample layout (design may be refined over time))



