Preface and Acknowledgement ix
About the Companion Website xi
1 Introduction 1
1.1 A Brief History of Multiple Testing 1
1.2 Outline of the Book 9
1.3 Summary 11
References 13
2 The Meaning of the False Discovery Rate (FDR) 15
2.1 True Hypothesis Versus Conclusion from Evidence: The Confusion Matrix 15
2.2 The Meaning of the p-Value 16
2.3 The Meaning of the FDR: Its Relationship to the Confusion Matrix and the p-Value 17
2.4 Control of the FDR While Minimising False-Negative Results: The Benjamini–Hochberg (BH) Criterion 19
2.5 Graphical Illustration of the Benjamini–Hochberg FDR Criterion 22
2.6 Use of the Q–Q Plot in Other Contexts 26
2.7 Alternatives to the BH Criterion 27
2.8 Consequences of Correlations Among the Hypotheses Tested 30
2.9 The FDR in a Non-Statistical Context: A Diagnostic Test 42
2.10 Summary 44
References 45
3 Graphical Presentation of the FDR 47
3.1 Presentation of the Q–Q Plot on the −log 10 (p) Scale 47
3.2 Association of the BH-FDR with Individual p-Values 48
3.3 Distinctive Plotting Symbols for Plotting of BH-FDR Values 50
3.4 Non-Monotonicity of the BH-FDR: Detection of Correlation Among p-Values from the −log 10 -Transformed Q–Q Plot 51
3.5 Summary 53
Reference 54
4 Application of the FDR to Multiple Hypothesis Testing in Real-World Data 55
4.1 Collation of Gene-Expression Data from the Plant-Genetics Model Organism Arabidopsis thaliana 55
4.2 Hypotheses Concerning Multiple Response Variables in the Analysis of a Balanced Experimental Design 59
4.3 Partitioning of Model Terms in a Balanced Experimental Design: Hypotheses Concerning Individual Terms 62
4.4 Comparison of the Results of Multiple Testing for Contrasting Subsets of Response Variables 66
4.5 Representation of the FDR on a Volcano Plot: Selection of Hypotheses for Further Investigation 71
4.6 Summary 74
References 76
5 Alternative Approaches to the Multiple-Testing Problem 79
5.1 An FDR Is Not a p-Value: The Formal Distinction 79
5.2 Retaining the p-Value Conceptual Basis: The Šidak and Bonferroni ‘Corrections’ 79
5.3 Multiple Testing of Pairwise Comparisons Among Groups of Samples 81
5.4 Repeated Testing in Interim Analyses Before Study Completion: Alpha Spending 95
5.5 Is Control of the Family-Wise Error Rate (FWER) a Desirable Goal? 99
5.6 Holm’s Method: A Generalisation of the Bonferroni Correction 106
5.7 Summary 113
References 115
6 The FDR in the Context of Bayesian Statistics 117
6.1 The Bayesian Interpretation of the BH-FDR 117
6.2 Numerical Equivalence Between a One-Sided p-Value and a Posterior Probability 119
6.3 Does the Bayesian Interpretation of a p-Value Offer a Solution to the Multiplicity Problem? 122
6.4 Numerical Equivalence of p-Value, Posterior Probability and BH-FDR: The Prosecutor’s Answer to the Accusation of Fallacy? 125
6.5 Summary 130
References 131
7 Alternative Specifications of the FDR 133
7.1 The Local and Non-Local FDR (LFDR and NFDR) 133
7.2 Direct Estimation of the LFDR 140
7.3 Estimation of the NFDR 142
7.4 Estimation of the LFDR from the NFDR: Re-Ranking Approach 144
7.5 Estimation of the LFDR from the NFDR: Power Parameter Approach 150
7.6 Review of Methods for Estimation of the LFDR 158
7.7 Summary 158
References 161
8 The FDR in Relation to an ‘Uninteresting’ Rather Than a Null Hypothesis 163
8.1 The Vulnerability of the FDR to Mis-Specification of the Statistical Model 163
8.2 ‘Uninteresting’ and ‘Interesting’ Distributions of Test Results Versus Distributions on H 0 and H 1 : A Defence Against Model Mis-Specification 164
8.3 An ‘Uninteresting’ Distribution to Account for Unrecognised Pseudoreplication: Fewer Discoveries Announced 165
8.4 Unrecognised Pseudoreplication: Results When Real Effects Are Present 177
8.5 An ‘Uninteresting’ Distribution to Account for Unrecognised Balanced-Block Effects: More Discoveries Announced 184
8.6 The Relative Merits of the Correct Model and an ‘Uninteresting’ Distribution as a Basis for Testing 193
8.7 Summary 197
References 200
9 Supplementation of p-Values with an Auxiliary Covariate: The Conditional FDR (cFDR) 201
9.1 Extension of the Relationship Between FDR and p to Take Account of an Additional Relevant Variable q 201
9.2 Method for the Evaluation of the cFDR from Data 204
9.3 Application of the cFDR to Non-Genetic Data 210
9.4 Summary 222
References 224
10 An FDR-Based Analogue of the Confidence Interval: The False Coverage Rate (FCR) 225
10.1 The Concept of the Coverage of a Confidence Interval 225
10.2 A Confidence Interval Based on the FDR-Determined Significance Threshold 231
10.3 Numerical Illustration of the FCR 233
10.4 Summary 236
References 237
11 The FDR as a Criterion for Sample Size Calculations 239
11.1 Review of the Standard Methods for Power and Sample Size Calculations 239
11.2 Connection of Statistical-Power-Related Concepts to FDR-Related Concepts 245
11.3 Sample Size Required to Achieve a Specified FDR 247
11.4 Significance Threshold (α) Required to Achieve a Specified FDR When Sample Size Is Fixed 251
11.5 Summary 254
References 255
Index 257