Proofs of Main Results - Simultaneous Testing of Grouped Hypotheses: Finding Needles in Multipl

We shall prove here the main results, Theorems 1-5. The proofs of other results are given in Appendix II.

Proof of Theorem 1. Denote the group labels by ggg = (g₁, · · · , g_m), i.e., g_i = k if case i comes from group k. Suppose the group labels are unknown, we have

P (θi|xi) = X k P (θi|gi = k, xi)P (gi = k|xi) = X k (1 − pk)fk0(xi) fk(xi) πkfk(xi) P kπkfk(xi) = P π_Pk(1 − pk)fk0(xi) kπkfk(xi) ≡ PLfdr(xi).

Let R and N₁₀be the number of rejections and number of false positives of the PLfdr procedure. Note E(N10|xxx) =

P_m

i=1I(δi = 1)P (θi = 0|xxx) =

P_R

i=1PLfdr(i), we have

FDR_PLfdr = E (N10/R) P (R > 0) = E [(1/R)E(N10|xxx)] P (R > 0) = E " 1 R R X i=1 PLfdr_(i) # P (R > 0).

The claim follows by noting that the PLfdr procedure guarantees that (1/R)PR_i=1PLfdr_(i) ≤ α for all realizations of xxx.

Proof of Theorem 2. Let Rk and N10k be the number of rejections and the number of false

positives in group k. Note that E(N_10k|ggg, xxx) =Pmk_i=1I(δ_ki= 1)P (θ_ki = 0|x_ki) =PRk_i=1CLfdrk

(i),

and that the SLfdr procedure guarantees (1/Rk)

P_Rk

i=1CLfdrk(i) ≤ α for all realizations of xxx,

we have FDR_SLfdr = E µ P kN10k P kRk ¶ P (X k R_k> 0) = E ( 1 P kRk X k E(N10k|ggg, xxx) ) P (X k Rk > 0) ≤ E ( 1 P kRk (X k αRk) ) P (X k Rk> 0) ≤ α.

Proof of Theorem 3. Let R and N10 be the number of rejections and number of false

positives. Then E(N10|ggg, xxx) =

P_m

i=1I(δi = 1)P (θi = 0|xi, gi) =

P_R

i=1CLfdr(i). The SLfdr

procedure guarantees that (1/R)PR_i=1CLfdr_(i) ≤ α for all realizations of xxx, it follows that

FDR_CLfdr = E(N₁₀/R)P (R > 0) = E {E(N₁₀/R|ggg, xxx)} P (R > 0) ≤ E ( 1 R R X i=1 CLfdr_(i) ) P (R > 0) ≤ α.

Proof of Theorem 4. (i) In the CLfdr procedure, the group labels were only used for calculating the CLfdr statistics and then dropped afterwards. Hence when the interest is to evaluate global FDR and FNR levels of the CLfdr procedure, the group labels provide no information, i.e., let Ti = CLfdr(xi), then {Ti : i = 1, · · · , m} can be viewed as a random

sample from GOR, the cdf of the pooled sample. The null, non-null cdf’s of Ti are denoted by

GOR₀ and GOR₁ , respectively. Let tOR and ˆtOR be the thresholds of the oracle procedure and

CLfdr procedure, respectively. Note mFNROR= pP_{P (T}H1(Ti > tOR)

i> tOR)

and mFNR_CLfdr= pPH1(Ti > ˆtOR) P (Ti > ˆtOR)

, It is sufficient to show that ˆtOR−→ tp OR.

Let Q_OR(t) = (1 − p)G0(t)/G(t) and ˆQOR(t) = {

iI(Ti < t)Ti}/{

iI(Ti < t)}. Ap-

plying law of large numbers, we have (1/m)P_iI(Ti < t) −→ E(Tp i < t) = GOR(t) and

(1/m)Pm_i=1I(Ti < t)Ti −→ E{I(Tp i < t)Ti} = E[E{I(Ti < t)Ti|gi}] = πk(1 − pk)GORk0 (t) =

(1 − p)GOR

0 (t). It follows that ˆQOR(t)−→ Qp OR(t). Subsequent arguments are similar to the

proof of Lemma A.5 in Sun and Cai (2007). Note that ˆQ_OR(t) is a step function with jump at T_(i). For T_(k) < t < T_(k+1), we construct an envelope for ˆQOR(t) using two monotone

continuous functions: ˆ Q−_OR(t) = T(k+1)− t T_(k+1)− T_(k)QˆOR(T(k−1)) + t − T_(k) T_(k+1)− T_(k)QˆOR(T(k)); ˆ Q+_OR(t) = T(k+1)− t T_(k+1)− T_(k)QˆOR(T(k)) + t − T_(k) T_(k+1)− T_(k)QˆOR(T(k+1)).

increasing in t, and (iii) | ˆQ+_OR(t) − ˆQ−_OR(t)| ≤ 1/R(t) → 0. Note that ˆ−p QOR(t)−→ Qp OR(t), we

have ˆQ−_OR(t)−→ Qp OR(t) and ˆQ+OR(t) p

−

→ QOR(t).

Now define ˆt−_OR = sup{t ∈ (0, 1) : ˆQ−_OR(t) ≤ α} and ˆt+_OR = sup{t ∈ (0, 1) : ˆQ+_OR(t) ≤ α}; then ˆt+_OR ≤ ˆtOR ≤ ˆt−OR. We claim that ˆt−OR

−

→ tOR. If not, there exists ²0 and η0 such

that for any M > 0, P (|ˆt−_OR− tOR| > ²0) ≥ 4η0 holds for some Z+ 3 m ≥ M . Suppose

P (K_m1) = P (ˆt−_ORtOR > ²0) ≥ 2η0. The MRC implies that QOR(t) is strictly increasing in t

and QOR(tOR) = α. Let 2δ0 = QOR(tOR+ ²0) − α. ˆQOR(t) −→ Qp OR(t) implies that there

exists M such that P (K_m2) = P (| ˆQ−_OR(tOR+ ²0) − QOR(tOR+ ²0)| < δ0) ≥ 1 − η0 holds for

all m ≥ M . Consider Km = Km1 ∩ Km2, then there exists m ∈ Z+ such that P (Km) ≥ η0.

However, note ˆQ−_OR(t) is strictly increasing in t, on Km we must have α = ˆQ−_OR(ˆt−_OR) >

Q−_OR(tOR+ ²0) > QOR(tOR+ ²0) − η0 = α + δ0. Hence Km cannot have positive measure.

This is a contradiction. Therefore we must have ˆt−_OR −→ tp OR. Similarly we can show that

ˆt+

OR p

−

→ t_OR. Note ˆt+_OR≤ ˆt_OR≤ ˆt−_OR, it follows that ˆt_OR−→ tp _OR.

(ii) Since ˆtOR−→ tp OR, the group-wise FDR level yielded by the CLfdr procedure converges

in probability to the FDR level of the oracle procedure. Next, let 0 < λ < 1 be a threshold and Rk be the number of rejections in group k. Then (1/Rk)

P_Rk

i=1CLfdrk(i) = [

P_mk

i=1I(CLfdri >

λ)CLfdr_i]/[Pmk_i=1I(CLfdr_i > λ)]→ E[I(CLfdr−p _i < λ)CLfdr_i]/E[I(CLfdr_i < λ)]. In Section 4.3 we have shown that (1 − pk)Gk0(λ) = E[I(CLfdri < λ)CLfdri] and Gk(λ) = E[I(CLfdri< λ)].

Therefore (1/R_k)PRk_i=1CLfdrk (i) p − → (1 − p_k)G_k0(λ)/G_k(λ) = mFDRk OR= FDRkOR+ o(1).

Proof of Theorem 5. (i). Define the plug-in statistic ˆTi = \CLfdr(xi). Let ˆQP I(t) =

{P_iI( ˆT_i < t)T_i}/{P_iI( ˆT_i < t)}. The threshold of the data-driven procedure can be defined as ˆtP I = sup{t ∈ (0, 1) : ˆQP I(t) ≤ α}. Lemma A.1 in Sun and Cai (2007) implies that ˆTi−→ Tp i.

We only need to show that ˆtP I −→ tp OR. Similar to Lemma A.4 of Sun and Cai (2007), it can

be shown that ˆQP I(t)−→ Qp OR(t). Then by using the same constructions and arguments as in

Theorem 6, we can obtain that ˆtP I −→ tp OR. (ii) Note that ˆTi → T−p i and ˆtP I −→ tp OR, the proof

References

[1] Benjamini, Y., Hochberg, Y. (1995), “Controlling the False Discovery Rate: a Practical and Powerful Approach to Multiple Testing, ” Journal of the Royal Statistical Society, Series B, 57, 289-300.

[2] Considine, G., and Zappal`a, G. (2002), “The Influence of Social and Economic Disadvan- tage in the Academic Performance of School Students in Australia”, Journal of Sociology, 38, 129-148.

[3] Copas, J. (1974), “On Symmetric Compound Decision Rules for Dichotomies”, The An- nals of statististics, 2, 199-204.

[4] Efron, B., Tibshirani, R., Storey, J. and Tusher, V. (2001), “Empirical Bayes Analysis of a Microarray Experiment,” Journal of the American Statistical Association, 96, 1151-1160. [5] Efron, B. (2004), “Large-Scale Simultaneous Hypothesis Testing: the Choice of a Null

Hypothesis,” Journal of the American Statistical Association, 99, 96-104.

[6] Efron, B. (2007), “Doing Thousands of Hypothesis Tests at the Same Time”, METRON Internal Journal of Statistics, 65, 3-21.

[7] Efron, B. (2008a), “Simultaneous inference: When Should Hypothesis Testing Problems Be Combined?”, The Annals of Applied Statistics, 1, 197-223.

[8] Efron, B. (2008b), “Microarrays, Empirical Bayes, and the Two-Groups Model”, Statis- tical Science, 23, 1-22.

[9] Ferkinstad, E., Frigessi, A., Thorleifsson, G. and Kong, A. (2008), “Unsupervised Empiri- cal Bayesian Multiple Testing with External Covariates”, The Annals of Applied Statistics, 2, 714-735.

[10] Genovese, C., and Wasserman, L. (2002), “Operating Characteristic and Extensions of the False Discovery Rate Procedure, ” Journal of the Royal Statistical Society , Series B, 64, 499-517.

[11] Genovese, C., and Wasserman, L. (2004), “A Stochastic Process Approach to False Dis- covery Control,” The Annals of Statistics, 32, 1035-1061.

[12] Genovese, C., Roeder, K., Wasserman, L. (2006), “False Discovery Control with p value Weighting. ”Biometrika, 93 (3), 509-524.

[13] Jin, J. and Cai, T. (2007), “Estimating the Null and the Proportion of Non-Null Effects in Large-scale Multiple Comparisons,” Journal of the American Statistical Association, 102, 495-506.

[14] Morris, B. (2008), “Discussion of Efron’s ‘Microarrays, Empirical Bayes, and the Two- Groups Model’” Statistical Science, 23, 34-40.

[15] Newton, M., Noueiry, A., Sarkar, D., and Ahlquist, P. (2004), “Detecting differential gene expression with a semiparametric hierachical mixture method,” Biostatstics, 5, 155-176. [16] Robbins, H. (1951), “Asymptotically Subminimax Solutions of Compound Statistical De-

cision Problems,” in Proceedings of Second Berkeley Symposium on Mathematical Statis- tics and Probability, Berkeley, CA: University of California Press, 131-148.

[17] Rogasa, D. (2003), “Accuracy of API index and school base report elements: 2003 Academic Performance Index, California Department of Education”, Technical Report, Department of Statistics and School of Education, Stanford University, available at “http://www.cde.ca.gov/ta/ac/ap/researchreports.asp”.

[18] Sparkes, J. (1999), “Schools, Education and Social Exclusion,” CASE Paper 29, Centre for Analysis of Social Exclusion, London School of Economics, London.

[19] Spjøtvoll, E. (1972), “On the Optimality of Some Multiple Comparison Procedures,” The Annals of Mathematical Statistics, 43, 398-411.

[20] Storey, J. (2002), “A Direct Approach to False Discovery Rates,” Journal of the Royal Statistical Society, Series B, 64, 479-498.

[21] Storey, J. (2007), “The Optimal Discovery Procedure: A New Approach to Simultaneous Significance Testing,” Journal of Royal Statistical Society, Series B, 69, 347-368.

[22] Sun, W., and Cai, T. (2007), “ The Oracle and Adaptive Compound Decision Rules for False Discovery Rate Control.” Journal of the American Statistical Association, 102, 901-912.

[23] Sun, W., and Cai, T. (2008), “Large-Scale Multiple Testing under Dependency”, Journal of the Royal Statistical Society, Seires B, 71, 393-424.

[24] Schwartzman, A., Dougherty R., and Taylor, J. (2008), “False Discovery Rate Analysis of Brain Diffusion Direction Maps”, The Annals of Applied Statistics, 2, 153-175.

[25] van der Laan, M., Dudoit, S. and Pollard, K. (2004). “Augmentation procedures for control of the generalized family-wise error rate and tail probabilities for the proportion of false positives”, Statistical Applications in Genetics and Molecular Biology, 3, Article 15.

In document Simultaneous Testing of Grouped Hypotheses: Finding Needles in Multiple Haystacks (Page 33-38)