15 Permutation Tests

15.1 Group Comparisons

Consider comparing two indpendent groups. Example: comparing different bug sprays

  • Consider the null hypothesis that the distribution of the observations from each group is the same
  • Then, the group labels are irrelevant
  • Consider a data frame with counts in one column and spray label in another
  • Permute the spray (group) labels
  • Recalcuate the statistic
    • Mean difference in counts
    • Geomatric means
    • T-statistic
  • Calculate the percentage of simulations where the simulted statistic was mroe extreme (toward the alternative) than the observed. This will create a permutation-based p-value.

15.2 Variations on Permutation Testing

Data Type, Statistic, Test Name

  • Ranks, Rank Sum, Rank Sum Test
  • Binary, Hypergeometric Prob, Fisher’s exact test
  • Raw data, …, Ordinary Permutation Test

So-called Randomization tests are exactly oerutation tests, with different motivations.

  • For matched data, one can randomize the signs
    • For ranks, the results in the signed rank test
  • Permutation strategies work for regression as well
    • Permuting a regressor of interest
  • Permutation tests work very well in multivariate settings