• No results found

Extensions

In document Testing Infection Graphs (Page 34-39)

In this section, we discuss three extensions. The first of these is an extension to likelihood- based test statistics. Second, we discuss multidimensional weights. Finally, we make a few remarks on graph confidence sets Proofs for all of the extensions are given in Chapter5.

4.2.1 Likelihood-Based Tests

The setup that we have discussed so far has let us consider statistics such as ๐‘Š1โˆ’๐‘๐‘Š0

for some constant ๐‘. However, the most obvious statistic to use is the log-likelihood ratio. Specifically, the log-likelihood ratio is

โ„“(๐›ฝ0, ๐›ฝ1;๐ฝ) = log๐ฟ(๐’ข1, ๐›ฝ1;๐ฝ)โˆ’log๐ฟ(๐’ข0, ๐›ฝ0;๐ฝ) =โ„“1(๐›ฝ1;๐ฝ)โˆ’โ„“0(๐›ฝ0;๐ฝ).

Since the log-likelihood ratio is difficult to compute for our stochastic spreading model, we might also be interested in approximations

๐‘†(๐›ฝ0, ๐›ฝ1;๐ฝ) =๐‘†1(๐›ฝ1;๐ฝ)โˆ’๐‘†0(๐›ฝ0;๐ฝ), (4.2)

where ๐‘†๐‘– is a computable approximation ofโ„“๐‘–. In particular, we saw in Chapter 3 that

๐‘†๐‘Š(๐›ฝ0, ๐›ฝ1;๐ฝ) =๐›ฝ1๐‘Š1(๐ผ)โˆ’๐›ฝ0๐‘Š0(๐ผ)

is the first-order approximation of the log-likelihood ratio when๐‘= 0.

However, there is a key difference in this case compared to the statistics considered in the preceding section. Here,๐‘† is a function ofโ„ and the parameters๐›ฝ0and๐›ฝ1. So, there are two

choices that we consider to overcome this difficulty. First, instead of finding a threshold ^๐‘ก to compare๐‘†(๐ผ) against, we find a function ^๐‘ก:B0ร—B1โ†’Rsuch that ^๐‘ก(๐›ฝ0, ๐›ฝ1)โ‰ค๐‘†(๐ผ;๐›ฝ0, ๐›ฝ1)

with probability at most๐›ผ under the null hypothesis across values of๐›ฝ0 and๐›ฝ1. The second

option is to skirt dependencies on the parameter value in the statistic. For example, using ๐‘†๐‘Š as inspiration, we might consider a class of linear statistics

๐‘†(๐›ฝ0, ๐›ฝ1;๐ฝ) =๐›ฝ1๐‘†1(๐ผ)โˆ’๐›ฝ0๐‘†0(๐ผ). (4.3)

Then by considering ๐‘†0 and ๐‘†1 separately, we can reject the null hypothesis if for each value

of ๐›ฝ0, either๐‘†0 is too small or๐‘†1 is too large.

Since the threshold function method is more cumbersome, we defer it to Chapter5. Now, we consider the second approach: two-statistic tests. Let ๐‘ 0,1โˆ’๐›ผ,๐›ฝ and ๐‘ 1,๐›ผ,๐›ฝ denote the ๐›ผ and

1โˆ’๐›ผ quantiles of ๐‘†0 and ๐‘†1 respectively, and let ^๐‘ 0,1โˆ’๐›ผ,๐›ฝ and ^๐‘ 1,๐›ผ,๐›ฝ denote their empirical

counterparts. Thus, we can state Algorithm4.2.1.

Proposition 4.2.1. If the discretization B๐ท with discretization function ๐น๐ท satisfies equa- tion (4.1), then Algorithm 4.2.1 controls the Type I error at a level ๐›ผ.

Algorithm 4.2.1:Two-Statistic Test

Input :Type I error tolerances๐›ผ0 and ๐›ผ1 where๐›ผ0+๐›ผ1 =๐›ผ, approximation

parameters๐œ–,๐›ฟ,๐›พ, and ๐œ‰, observed infection vectorโ„, null graph ๐’ข0,

statistics๐‘†0 and๐‘†1, discretizationB0,๐ท, discretization function ๐น๐ท 1 Define ๐‘sims โ‰ฅ (๏ธ 1 2๐œ–2 +38๐œ– )๏ธ log๐‘๐œ‰.

2 For each ๐›ฝ inB0,๐ท, simulate an infection ๐‘sims times on ๐’ข0 to obtain the

approximate the (๐›ผ0โˆ’๐›พโˆ’๐œ–) quantile ^๐‘ 0,1โˆ’๐›ผ0โˆ’๐›พโˆ’๐œ–,๐›ฝ and the 1โˆ’(๐›ผ1โˆ’๐›พโˆ’๐œ–) quantile ^๐‘ 1,(๐›ผ1โˆ’๐›พโˆ’๐œ–),๐›ฝ.

3 Reject the null hypothesis if for each ๐›ฝ โˆˆB0,๐ท, either๐‘†0 <๐‘ ^0,1โˆ’(๐›ผ0โˆ’๐›พโˆ’๐œ–),๐›ฝ or ๐‘†1>๐‘ ^1,(๐›ผ1โˆ’๐›พโˆ’๐œ–),๐›ฝ.

Now, Proposition 4.2.1is hardly revolutionary in terms of dividing the๐›ผ budget between two separate tests. However, in a simple example, this is shown to lead to a better Type II error than other methods, and we imagine that this is helpful even in cases where the Type II error is difficult to compute exactly.

4.2.2 Multidimensional Weights

In this section, we consider a modification of our previous setup. Here, the edge weights

๐‘ค(๐‘ข, ๐‘ฃ) = (๐‘ค1(๐‘ข, ๐‘ฃ), . . . , ๐‘ค๐‘‘(๐‘ข, ๐‘ฃ)) are vectors inR๐‘‘+. As a result, we have C=R๐‘‘+, and so

we denote our parameter vectors by ๐›ฝ. In our stochastic spreading model, the exponential random variables ๐‘‡๐‘ข๐‘ฃ now have parameter ๐›ฝ|๐‘ค(๐‘ข, ๐‘ฃ). Additionally, in this context ๐‘Š๐‘ก,๐‘ฃ

and ๐‘Š๐‘ก are also vectors in R๐‘‘+. The former is the multidimensional infection weight into

vertex๐‘ฃ at time ๐‘ก, and the latter is the multidimensional weighted cut between the infected and uninfected vertices at time ๐‘ก. We denote the ๐‘–th coordinate of these by๐‘Š๐‘ก,๐‘ฃ๐‘– and๐‘Š๐‘ก๐‘– respectively. Additionally, we refer to the edges where ๐‘ค๐‘–(๐‘ข, ๐‘ฃ)>0 as being edges of type๐‘–. However, there is one additional difficulty in the multidimensional case that does not appear in one dimension. In one dimension, we have let ๐›ฝ =โˆž to enforce spreading over edges when there is an edge between an infected and an uninfected vertex. From a topological view, this is a natural way to obtain a compactC that also maintains the continuity of the parametrizationโ„ณ.

In the multidimensional case, enforcing spreading over edges depends on the relative impor- tance of the different dimensions of the edge weights. Thus, finding a compactification of

C=R๐‘‘+ that preserves the continuity of the parametrizationโ„ณis simply far more difficult

for ๐‘‘ > 1. We give an illustration of this in Figure 4.1. As a result, we shall stick to a discretization for boundedB for๐‘‘ >2 and for a potentially unbounded parameter set when ๐‘‘= 2. Now, we can define the discretization.

Definition 4.2.1. Let ๐’ข be a graph, and let C๐‘…= [0, ๐‘…]๐‘‘. Suppose the edge weights are inR๐‘‘+. Let ๐‘€๐‘– 0 be ๐‘€0๐‘– =๐‘‘(๐‘‘+ 1) ๐‘˜+๐‘ โˆ‘๏ธ ๐‘ก=1 ๐‘Š๐‘ก๐‘–(๐‘ƒ1:๐‘กโˆ’1) ๐‘›+ 1โˆ’๐‘ก

Figure 4.1Discretization regions for๐‘‘= 2. The lightly shaded hexagon is discretized byC๐‘…,๐ท.

The dark square is discretized by ฮ”2

๐ท. Finally, the top left and bottom right trapezoidal regions

are discretized byH1,๐ท andH2,๐ท respectively.

Define ๐‘€1๐‘– to be an integer satisfying

๐‘€1๐‘– โ‰ฅ 1

2๐›ฟ๐‘€

๐‘–

0.

Let ๐‘€2๐‘– be an integer such that

๐‘€2๐‘– โ‰ฅ ๐‘‘(๐‘‘+ 1)(๐‘˜+๐‘)

2๐›ฟ

Let ๐‘1๐‘– be an integer satisfying

๐‘1๐‘– โ‰ฅ (๏ธƒ log1+ 2๐›ฟ ๐‘‘(๐‘‘+1)(๐‘˜+๐‘)๐‘… + log1+ 2๐›ฟ ๐‘‘(๐‘‘+1)(๐‘˜+๐‘) ๐‘€1๐‘– ๐‘€๐‘– 2 )๏ธƒ .

Define a one-dimensional discretization C๐‘…,๐ท,๐‘– to include the following values of ๐›ฝ:

โˆ™ ๐›ฝ๐‘— = ๐‘€๐‘—๐‘– 1 for ๐‘—= 0,1, . . . , ๐‘€๐‘– 2, โˆ™ ๐›ฝ๐‘€๐‘– 2+๐‘— =๐›ฝ๐‘€2๐‘–+๐‘—โˆ’1 (๏ธ 1 +๐‘‘(๐‘‘+1)(2๐›ฟ๐‘˜+๐‘))๏ธ for ๐‘—= 1, . . . , ๐‘1๐‘–.

Then, the multidimensional discretization is C๐‘…,๐ท=C๐‘…,๐ท,1ร— ยท ยท ยท ร—C๐‘…,๐ท,๐‘‘, i.e. the cartesian product of ๐‘‘-univariate discretizations. Finally, the discretization function is

๐น๐‘…,๐ท(๐›ฝ) := (๐น๐‘…,๐ท,1(๐›ฝ1), . . . , ๐น๐‘…,๐ท,๐‘‘(๐›ฝ๐‘‘)) where

Lemma 4.2.1. The set C๐‘…,๐ท with discretization function ๐น๐‘…,๐ท satisfies equation (4.1).

Since Algorithm4.1.1and Algorithm 4.2.1only depend on having a discretization satisfying equation (4.1), we can use them for multidimensional testing.

Now, we consider the 2-dimensional case with large๐›ฝ. There are two kinds of large๐›ฝ: the case in which both ๐›ฝ1 and๐›ฝ2 are large and another in which only one of the two is large.

From a topological perspective, to compactify the upper right quadrant and maintain the continuity of parametrization function, we need to add three line segments at infinity, two of which are almost mirror images of each other.

We start with the former. To differentiate this case, in place of๐›ฝ, we use the parameter๐œ‚

instead, and our process evolves according to

P๐œ‚(๐’ซ๐‘ก=๐‘ฃ|๐’ซ1:๐‘กโˆ’1 =๐‘ƒ1:๐‘กโˆ’1) = โŽง โŽช โŽช โŽจ โŽช โŽช โŽฉ ๐‘Š๐‘ก,๐‘ฃ(๐‘ƒ1:๐‘กโˆ’1)|๐œ‚ ๐‘Š๐‘ก(๐‘ƒ1:๐‘กโˆ’1)|๐œ‚ ๐‘Š๐‘ก(๐‘ƒ1:๐‘กโˆ’1)ฬธ=0 1 ๐‘›+1โˆ’๐‘ก ๐‘Š๐‘ก(๐‘ƒ1:๐‘กโˆ’1) =0, ๐‘ฃ ฬธโˆˆ๐‘ƒ1:๐‘กโˆ’1 0 ๐‘ฃโˆˆ๐‘ƒ1:๐‘กโˆ’1.

Additionally, we only need to consider ๐œ‚ in the 2-simplex ฮ”2:={๏ธ(๐œ‚1, ๐œ‚2)โˆˆ[0,1]2 :๐œ‚1+๐œ‚2 = 1

}๏ธ

. At this point, we can consider the discretization.

Definition 4.2.2. Let๐’ข be a given graph with a weight function๐‘ค. Then, let ๐‘€โˆžbe an integer

satisfying ๐‘€โˆžโ‰ฅ min ๐‘–=1,2max๐‘ƒโˆˆP ๐‘˜+๐‘ โˆ‘๏ธ ๐‘ก=1 ๐‘Š๐‘ก๐‘–(๐’ซ1:๐‘กโˆ’1)

Then, we define the discretization

ฮ”2๐ท := {๏ธ‚(๏ธ‚ ๐‘– ๐‘€โˆž ,๐‘€โˆžโˆ’๐‘– ๐‘€โˆž )๏ธ‚ :๐‘–= 0, . . . , ๐‘€โˆž }๏ธ‚ .

Finally, we define the discretization function

๐นฮ”,๐ท(๐›ฝ) = arg min (๐œ‚1,1โˆ’๐œ‚1)โˆˆฮ”2๐ท โƒ’ โƒ’ โƒ’ โƒ’ ๐œ‚1โˆ’ ๐›ฝ1 ๐›ฝ1+๐›ฝ2 โƒ’ โƒ’ โƒ’ โƒ’ where ๐›ฝ1, ๐›ฝ2 โ‰ฅ 2๐‘›(๐‘˜+๐‘) ๐›ฟ .

Lemma 4.2.2. Let ๐’ข be a graph where all non-zero edge weights are grater than 1, i.e. for

(๐‘ข, ๐‘ฃ)โˆˆ โ„ฐ, ๐‘ค๐‘–(๐‘ข, ๐‘ฃ)>0 for an ๐‘–= 1, . . . , ๐‘‘implies ๐‘ค๐‘–(๐‘ข, ๐‘ฃ)โ‰ฅ1. The discretization ฮ”2๐ท with discretization function ๐นฮ”,๐ท satisfies equation (4.1).

Now, we consider the second type of of large ๐›ฝ. Without loss of generality, assume that ๐›ฝ1 is small and๐›ฝ2 is large. In this case, we use a modified spreading process ๐’ฎ. Here, to

choose the infected vertex๐’ฎ๐‘ก at time๐‘ก, we spread over an edge of type two if we can do so,

and if we cannot, then the procedure evolves according to our univariate spreading model with parameter ๐œ and edge weights๐‘ค1, i.e. the edges of the first type. Now, we define the

necessary discretization.

Definition 4.2.3. Let ๐’ข be a given graph. Let๐‘€๐‘–

0 be ๐‘€0๐‘– = ๐‘˜+๐‘ โˆ‘๏ธ ๐‘ก=1 max ๐‘ƒ1:๐‘กโˆ’1 ๐‘Š๐‘ก๐‘–(๐‘ƒ1:๐‘กโˆ’1) ๐‘›+ 1โˆ’๐‘ก .

Define ๐‘€1๐‘– to be an integer satisfying

๐‘€1๐‘– โ‰ฅ 1

2๐›ฟ๐‘€

๐‘–

0.

Define ๐‘€2๐‘– to be an integer such that

๐‘€2๐‘– โ‰ฅ 2(๐‘˜+๐‘) ๐›ฟ Let ๐‘0 be ๐‘0= (๐‘˜+๐‘) (๏ธ‚ ๐‘›โˆ’๐‘˜+๐‘โˆ’1 2 )๏ธ‚

Let ๐‘1 be an integer satisfying

๐‘1 โ‰ฅ2 (๏ธ‚๐‘˜+๐‘ ๐›ฟ )๏ธ‚(๏ธƒ log๐‘0 ๐›ฟ + log ๐‘€1๐‘– ๐‘€2๐‘– )๏ธƒ .

Then, we define H๐‘–,๐ท to consist of the following values of ๐œ:

โˆ™ ๐œ๐‘— = ๐‘€๐‘—๐‘– 1 for ๐‘— = 0,1, . . . , ๐‘€2๐‘–, โˆ™ ๐œ๐‘€๐‘– 2+๐‘— =๐œ๐‘€ โ‹† ๐‘–+๐‘—โˆ’1exp (๏ธ ๐›ฟ 2(๐‘˜+๐‘) )๏ธ for ๐‘–= 1, . . . , ๐‘1.

Define the constants

๐พ = 2๐‘›(๐‘˜+๐‘) ๐›ฟ and ๐‘„= ๐‘€ ๐‘– 0 ๐›ฟ

in forming H๐‘–,๐ท. Finally, the discretization function is

๐นH,๐‘–,๐ท(๐›ฝ) = max{๐œ โˆˆC๐ท,๐‘…:๐œ โ‰ค๐›ฝ๐‘–} where ๐นH,๐‘–,๐ท is only defined for๐›ฝ satisfying๐พ+๐‘„๐›ฝ๐‘–โ‰ค๐›ฝ๐‘— for ๐‘— ฬธ=๐‘–.

Lemma 4.2.3. The discretization H๐‘–,๐ท with discretization function ๐นH,๐‘–,๐ท satisfies equa- tion (4.1).

Finally, we want to conclude that the three discretizations together comprise a discretization ofC=R2

+. To this end, we need a discretization function of the entire space. For simplicity,

let ๐‘…=๐พ = 2๐‘›(๐‘˜+๐‘)/๐›ฟ, and let๐‘„ be defined as in Definition 4.2.3. Then, we define

๐น๐ท,2(๐›ฝ) := โŽง โŽช โŽช โŽช โŽช โŽช โŽจ โŽช โŽช โŽช โŽช โŽช โŽฉ ๐นฮ”,๐ท(๐›ฝ) ๐‘… โ‰ค๐›ฝ1, ๐›ฝ2 ๐นH,1,๐ท(๐›ฝ) ๐›ฝ1โ‰ค๐‘… and๐พ+๐‘„๐›ฝ1โ‰ค๐›ฝ2 ๐นH,2,๐ท(๐›ฝ) ๐›ฝ2โ‰ค๐‘… and๐พ+๐‘„๐›ฝ2โ‰ค๐›ฝ1 ๐น๐‘…,๐ท(๐›ฝ) otherwise.

Now, we can state the corollary.

Corollary 4.2.1. Let ๐‘… = ๐พ(1 +๐‘„) where ๐พ and ๐‘„ are defined in Definition 4.2.3. The discretization C๐ท = C๐‘…,๐ทโˆชฮ”2๐ท โˆชH1,๐ท โˆชH2,๐ท with discretization function ๐น๐ท,2 satisfies

equation (4.1).

4.2.3 Graph Confidence Sets

While Algorithm 4.1.2 and its two-statistic equivalent are phrased as a algorithms for constructing confidence sets for the parameter๐›ฝ, note that we can use these algorithms to construct confidence sets of pairs (๐’ข, ๐›ฝ). For example, we might be interested in knowing not only if our HIV graph is in an (1โˆ’๐›ผ)-confidence set for a given range of ๐›ฝ, but we might also wish to know whether close variants of the graph are also in the confidence set. One curious detail is the choice of statistic when testing different graphs. Our theory allows for different statistics for each graph, and so for graph๐’ข, it might be reasonable to use a statistic such as ๐‘Š๐’ข. If we make this choice though, note that if we consider๐’ข to be the

empty or complete graphs, then the statistic is constant and so (๐’ข, ๐›ฝ) is in the confidence set for any๐›ฝ and any confidence level. We can think of this as a specific case of centering the confidence set. If we are fairly certain that our confidence set should include a particular graph๐’ขโ‹†, then for testing a graph๐’ข, we can use the statistic such๐‘†

๐’ข =๐‘Š๐’ขโˆ’๐‘Š๐’ขโ‹†. Then,

we see that ๐‘†๐’ขโ‹† is constant, which is the phenomenon we observed earlier for the empty

graph.

It is not entirely clear why this occurs or if there is a good alternative. We could specify a fixed statistic ๐‘†, which would not lead to a constant statistic on any graph but may not capture the structure of the graph being tested. Of course, we could always consider the intersection of two (1โˆ’๐›ผ/2)-confidence sets, but this does not answer the question of how to choose the statistics for each of the confidence sets.

In document Testing Infection Graphs (Page 34-39)

Related documents