Tests for equivalence or non-inferiority for paired binary data

Stat Med. 2002 Jan 30;21(2):231-45. doi: 10.1002/sim.1012.

Abstract

Assessment of therapeutic equivalence or non-inferiority between two medical diagnostic procedures often involves comparisons of the response rates between paired binary endpoints. The commonly used and accepted approach to assessing equivalence is by comparing the asymptotic confidence interval on the difference of two response rates with some clinical meaningful equivalence limits. This paper investigates two asymptotic test statistics, a Wald-type (sample-based) test statistic and a restricted maximum likelihood estimation (RMLE-based) test statistic, to assess equivalence or non-inferiority based on paired binary endpoints. The sample size and power functions of the two tests are derived. The actual type I error and power of the two tests are computed by enumerating the exact probabilities in the rejection region. The results show that the RMLE-based test controls type I error better than the sample-based test. To establish an equivalence between two treatments with a symmetric equivalence limit of 0.15, a minimal sample size of 120 is needed. The RMLE-based test without the continuity correction performs well at the boundary point 0. A numerical example illustrates the proposed procedures.

Publication types

  • Comparative Study

MeSH terms

  • Biopsy
  • Diagnostic Imaging / standards
  • Diagnostic Tests, Routine / methods*
  • Histocytochemistry
  • Humans
  • Likelihood Functions
  • Liver Diseases / diagnosis
  • Magnetic Resonance Imaging
  • Numerical Analysis, Computer-Assisted*
  • Sample Size
  • Statistics as Topic / methods*
  • Therapeutic Equivalency
  • Tomography, X-Ray Computed