Methods Based on Combining P-Values from Individual Genes within

CHAPTER 3. Testing for Monotonic Changes in Multivariate Gene Ex-

3.2 Proposed Methods

3.2.2 Methods Based on Combining P-Values from Individual Genes within

In Sections 3.2.2.1through 3.2.2.3, we introduce three testing procedures that begin by computing a test statistic for each individual gene within a gene set. The gene-specific test statistic differs for each method, but the way the test statistic values are used to draw a conclusion about the entire gene set is the same for all three methods. We now describe this approach in general terms before introducing the specific test statistics in Section 3.2.2.1 through 3.2.2.3.

First, the observed test statistics for the G genes in the gene set (denoted by S₁, . . . , S_G) are converted to permutation p-values. As described in Section 2.1, the treatment labels are randomly permuted relative to the gene expression vectors M times. Let Sgm denote the value of the test statistic computed for gene g and permutation m. This yields G × (M + 1) values that can be organized in the matrix







S1 S11 · · · S_1M S2 S21 · · · S2M

... ... . .. ... SG SG1 · · · SGM







. (3.3)

The first column of this matrix contains the test statistics calculated using the original data, and the m + 1st column (S_1m, . . . , S_Gm) contains the test statistics calculated using the mth permutation, m = 1, . . . , M . Then the permutation p-values for the original data and the permutation data also construct a G × (M + 1) matrix







p₁ p₁₁ · · · p_1M p2 p21 · · · p_2M ... ... . .. ... p_G p_G1 . . . p_GM







, (3.4)

where the first column are the permutation p-values for gene g using the original data, and the m + 1st column contains the corresponding permutation p-values for the mth

permuta-tion. If the null hypothesis is rejected with large statistic values, then pg =

m=1I(Sgm≥Sg)+1 M +1

with g = 1, . . . , G, and p_gm=

m∗=1I(Sgm∗≥Sgm)+I(Sg≥Sgm)

M +1 , g = 1, . . . , G, m = 1, . . . , M . If the null hypothesis is rejected with small statistic values, then the direction of the sign is changed from “≥” to “≤” in the above two formulas.

For each column of the p-value matrix, we combine the G elements into one statistic.

There are many different ways to combine p-values: the maximum, the minimum, the mean or the median of the G p-values, etc. Here we use Fisher’s combined probability method (1925): X² = −2P_G

g=1ln(p_g) for the original data and X_m² = −2P_G

g=1ln(p_gm) for the permutation data, m = 1, . . . , M . For a level α = 0.05 test, the null hypothesis is rejected if X² is larger than the 95th percentile among {X², X₁², . . . , X_M² }, i.e., if

m=1I(Xm²≥X²)+1

M +1 <

0.05.

In the following three testing methods described in subsections 3.2.2.1 to 3.2.2.3, we first compute a test statistic for each gene in a gene set and then combine the result across genes within a gene set. To simplify notations in these subsections, for gene g (g = 1, ..., G), we use Y_ij instead of Y_ijg to denote the gene expression value for the jth replication under the ith treatment.

3.2.2.1 Method II – Linear Regression Permutation Test (LRPT)

The second method is motivated by simple linear regression models. Consider a simple linear regression for each gene in the gene set as follows:

Yij = β0+ β1i + ij i = 1, . . . , T, j = 1, . . . , ni,

where the ij terms are assumed to be independent, zero-mean random variables with constant variance. Let |t₁| = ^{| ˆ}_s^β¹^|

β1ˆ denote the absolute value of the t-statistic for testing β₁ = 0 vs. β₁ 6= 0, where ˆβ₁ =

i=1i·ni· ¯Yi·−(PT

i=1i·ni) ¯Y··

i=1i²·ni−_N¹(PT

i=1i·ni)² , and s_β_ˆ

1 =

₁

N −2

PT i=1

P_ni

j=1(Yij− ˆYij)² PT

i=1i²·ni−_N¹(PT i=1i·ni)²

¹₂ . If the alternative hypothesis of interest is true for this gene, |t1| will tend to be large.

Similar testing ideas can be found in Armitage (1955), and Bartholomew (1961a). The Linear Regression Permutation Test statistic is defined as S_II = |t₁|. The null hypothesis is rejected with large SII and the permutation p-value is p =

m=1I(|t1m|≥|t₁|)+1

M +1 , where |t1m| is the test statistic calculated using the mth permuted data (m = 1, . . . , M ).

makes it applicable in general cases in which only the treatment indices are available to use. In the calculation of S_II, it is straightforward to use the real values of the treatment levels as the regressor variable instead of using the treatment indices for the cases where the treatment factor is numerical.

3.2.2.2 Method III – Isotonic Estimator Permutation Test (IEPT)

The basic idea of IEPT is as follows. If the alternative hypothesis is true, the distance between the mean gene expression under the first treatment index and the mean gene expression under the last treatment index should be the larger than the distances between the mean gene expressions for any other pair of treatments. This idea is motivated by Peddada et al.’s (2003) method for clustering genes using order-restricted inference.

Suppose ˆµ^(I) = (ˆµ^(I)₁ , . . . , ˆµ^(I)_T )⁰ and ˆµ^(D) = (ˆµ^(D)₁ , . . . , ˆµ^(D)_T )⁰ are the projections of ( ¯Y1·, . . . , ¯YT ·)⁰ onto the increasing order restricted profile CI = {µ ∈ R^T : µ1 ≤ µ₂ ≤ · · · ≤ µT} and the decreasing order restricted profile C_D = {µ ∈ R^T : µ1 ≥ µ₂ ≥ · · · ≥ µ_T}, respectively, using isotonic estimation. Our proposed test statistic for gene g is defined as

S_III = `∞= max{`^(I)_∞, `^(D)_∞ },

where `^(I)∞ ≡ |ˆµ^(I)₁ − ˆµ^(I)_T | and `^(D)_∞ ≡ |ˆµ^(D)₁ − ˆµ^(D)_T |. The null hypothesis is rejected for large

`∞values. The permutation p-values is p =

m=1I(`^m∞≥`∞)+1

M +1 , where `^m_∞ is the test statistic calculated from the mth permutation (m = 1, . . . , M ).

3.2.2.3 Method IV – Isotonic Likelihood Ratio Permutation Test (ILRPT) Consider ˆµ^(I) = (ˆµ^(I)₁ , ˆµ^(I)₂ , . . . , ˆµ^(I)_T )⁰ ∈ C_I and ˆµ^(D)= (ˆµ^(D)₁ , ˆµ^(D)₂ , . . . , ˆµ^(D)_T )⁰ ∈ C_D de-fined in Section3.2.2.2. The corresponding sums of squares are: SS_{T otal} =PT

i=1

Pni

j=1(Yij− Y¯··)², SSEI =PT

i=1

Pni

j=1(Yij−ˆµ^(I)_i )², SSED =PT i=1

Pni

j=1(Yij−ˆµ^(D)_i )², SSTI =PT

i=1ni(ˆµ^(I)_i − Y¯··)², SST_D =PT

i=1n_i(ˆµ^(D)_i − ¯Y··)². Define the Isotonic Likelihood Ratio Permutation Test (ILRPT) statistic as SIV = ¯B^∗ ≡ max{ ¯B^(I), ¯B^(D)}, where ¯B^(I) = _SS^SST^I

T otal and ¯B^(D) =

SST_D SST otal.

Permute the treatment labels relative to gene set expression vectors for M times, and the test statistic using the mth (m = 1, . . . , M ) permuted data ¯B_m^∗ is also calculated. The null

hypothesis is rejected when the statistic value is large enough. The permutation p-values for ILRPT is p =

m=1I( ¯B_m^∗≥ ¯B^∗)+1

M +1 .

Consider the assumption that gene expressions follow normal distributions:

Y_ij ∼ N (µ_i, σ²), i = 1, . . . , T, j = 1, . . . , n_i, (3.5) with independence among all N ≡ PT

i=1ni random variables. Also, denote the parameter space as

θ = (µ₁, µ₂, . . . , µ_T, σ²)⁰ ∈ Θ, Θ = R^T × R⁺.

Under this assumption, ˆµ^I is the MLE of µ under restriction µ1 ≤ µ₂ ≤ . . . ≤ µ_T, and ˆµ^D is the MLE of µ under restriction µ₁ ≥ µ₂ ≥ . . . ≥ µ_T. Also, rejecting the null hypothesis with ¯B^∗ large enough is equivalent to rejecting the null hypothesis with the LRT statistic small enough for testing the hypotheses

H0 : θ ∈ Θ0 vs. H1 : θ ∈ Θ1 = (ΘI∪ Θ_D) \ Θ0, where

Θ0 = {θ ∈ Θ : µ₁ = µ2. . . = µT},

ΘI = {θ ∈ Θ : µ₁ ≤ µ₂. . . ≤ µT} = C_I× R⁺, and ΘD = {θ ∈ Θ : µ₁ ≥ µ₂. . . ≥ µT} = C_D× R⁺. The derivation is provided in Section 3.7.1of the Appendix.

Moreover, with the normal distribution assumption in (3.5), ¯B^(I) and ¯B^(D) follow weighted sums of Beta distributions (Bartholomew, 1961b) under the null hypothesis, i.e.,

P ( ¯B^(I)≥ c) = P ( ¯B^(D)≥ c) =

q=2

w_{(q,T )}P (B(q−1)/2,(N −q)/2≥ c), (3.6) with weights w_{(q,T )}, q = 2, . . . , T . We present the derivation in Section 3.7.2 of the Ap-pendix.

For the “two-sided” order restricted testing problem discussed in this paper, Bartholomew (1959b) proposed an approximation method to obtain the critical value:

P (max{ ¯B^(I), ¯B^(D)} ≥ C_α) ≈ 2P ( ¯B^(I) ≥ C_α) = 2P ( ¯B^(D)≥ C_α). (3.7) So the critical value Cα is approximated by the upper ^α₂ quantile of the distribution of B¯^(I) or ¯B^(D). For balanced data, Table 1 in Bartholomew’s paper (1959a) gives the exact

approximated using Plackett’s (1954) formula. With modern computers, we are able to obtain the distribution of ¯B^(I) (or ¯B^(D)) and ¯B^∗ directly using simulation.

In document Application of order restricted statistical inference and hidden Markov modeling to problems in biology and genomics (Page 60-64)