r Datasets for Statistical Analysis: Generalized Linear Models
Dept of Maths and Computing Datasets for Statistical Analysis USQ Homepage

Datasets for Statistical Analysis: One- and two-sample data

The entries on this page have been classified as follows:

One-sample data


Two-sample data

Two independent samples

  • Neural tube defects
    The number of neural tube defects per 10 000 births for England and Wales for every two years between 1977 and 1985 inclusive.
  • Broad bean plants
    Can be used for a two-sample t-test, but plot the data first!
  • Estimating lengths
    Estimates of the length of a room in metres and in feet (unpaired).
  • Whisky prices
    Whisky prices in 1991 in the USA for 16 monopoly states and 26 private-ownership states
  • Salaries in two occupations
    The salaries from 72 randomly chosen jobs ads in The Guardian on April 6, 1992 in pounds sterling.
  • Tomatos yields
    The yield of tomatoes using two different fertilizers. A good small example.
  • Height and weight of eleven years old girls
    The heights and weights of eleven year old girls;
  • Caesarean in Canadian hospitals
    The data give the number of births in Ontario, Canada, from 1982/83 to 1991/92, and the number of caesarean section in the same years. The data can be used for a two-sample test of proportions for two years.

Two paired samples

  • Hardness of bamboo flooring
    A small experiment (5 samples; two reps) to measure the hardness of bamboo flooring
  • Bouncing squash balls
    Squash balls are bounced at room temperature and after being submerged in boiling water
  • Fingers ridges on identical twins
    The number of fingers ridges on identical twins. The within-sibling variation is small, while the between sibling variation is much larger.
  • Statures of brother and sister
    The heights of 11 pairs of brothers and sisters.
  • Brisbane Lions AFL club
    The data give the weights of the eleven Brisbane Bears (an Australian Rules Football club)) players who played in the 1995 finals series and also the 2002 premiership flag team.
  • Hospital admissions and discharges
    The admissions are uniform; the discharges show people leaving on Friday, and fewer on weekends.
  • Toowoomba fuel prices
    The monthly average price of unleaded and LRP fuel in Toowoomba, Qld, Australia. A paired test shows definite differences, but a non-paired test does not.
  • SMS speed
    The speed at which people can SMS a message; paired (within age groups, using own and control phones) and non-paired tests (teens versus over 30s) are possible.
Also see the simple linear regression data sets.


Constructed by Peter Dunn
Last change: 07 June 2007