In this section, we discuss three extensions. The first of these is an extension to likelihood- based test statistics. Second, we discuss multidimensional weights. Finally, we make a few remarks on graph confidence sets Proofs for all of the extensions are given in Chapter5.
4.2.1 Likelihood-Based Tests
The setup that we have discussed so far has let us consider statistics such as ๐1โ๐๐0
for some constant ๐. However, the most obvious statistic to use is the log-likelihood ratio. Specifically, the log-likelihood ratio is
โ(๐ฝ0, ๐ฝ1;๐ฝ) = log๐ฟ(๐ข1, ๐ฝ1;๐ฝ)โlog๐ฟ(๐ข0, ๐ฝ0;๐ฝ) =โ1(๐ฝ1;๐ฝ)โโ0(๐ฝ0;๐ฝ).
Since the log-likelihood ratio is difficult to compute for our stochastic spreading model, we might also be interested in approximations
๐(๐ฝ0, ๐ฝ1;๐ฝ) =๐1(๐ฝ1;๐ฝ)โ๐0(๐ฝ0;๐ฝ), (4.2)
where ๐๐ is a computable approximation ofโ๐. In particular, we saw in Chapter 3 that
๐๐(๐ฝ0, ๐ฝ1;๐ฝ) =๐ฝ1๐1(๐ผ)โ๐ฝ0๐0(๐ผ)
is the first-order approximation of the log-likelihood ratio when๐= 0.
However, there is a key difference in this case compared to the statistics considered in the preceding section. Here,๐ is a function ofโ and the parameters๐ฝ0and๐ฝ1. So, there are two
choices that we consider to overcome this difficulty. First, instead of finding a threshold ^๐ก to compare๐(๐ผ) against, we find a function ^๐ก:B0รB1โRsuch that ^๐ก(๐ฝ0, ๐ฝ1)โค๐(๐ผ;๐ฝ0, ๐ฝ1)
with probability at most๐ผ under the null hypothesis across values of๐ฝ0 and๐ฝ1. The second
option is to skirt dependencies on the parameter value in the statistic. For example, using ๐๐ as inspiration, we might consider a class of linear statistics
๐(๐ฝ0, ๐ฝ1;๐ฝ) =๐ฝ1๐1(๐ผ)โ๐ฝ0๐0(๐ผ). (4.3)
Then by considering ๐0 and ๐1 separately, we can reject the null hypothesis if for each value
of ๐ฝ0, either๐0 is too small or๐1 is too large.
Since the threshold function method is more cumbersome, we defer it to Chapter5. Now, we consider the second approach: two-statistic tests. Let ๐ 0,1โ๐ผ,๐ฝ and ๐ 1,๐ผ,๐ฝ denote the ๐ผ and
1โ๐ผ quantiles of ๐0 and ๐1 respectively, and let ^๐ 0,1โ๐ผ,๐ฝ and ^๐ 1,๐ผ,๐ฝ denote their empirical
counterparts. Thus, we can state Algorithm4.2.1.
Proposition 4.2.1. If the discretization B๐ท with discretization function ๐น๐ท satisfies equa- tion (4.1), then Algorithm 4.2.1 controls the Type I error at a level ๐ผ.
Algorithm 4.2.1:Two-Statistic Test
Input :Type I error tolerances๐ผ0 and ๐ผ1 where๐ผ0+๐ผ1 =๐ผ, approximation
parameters๐,๐ฟ,๐พ, and ๐, observed infection vectorโ, null graph ๐ข0,
statistics๐0 and๐1, discretizationB0,๐ท, discretization function ๐น๐ท 1 Define ๐sims โฅ (๏ธ 1 2๐2 +38๐ )๏ธ log๐๐.
2 For each ๐ฝ inB0,๐ท, simulate an infection ๐sims times on ๐ข0 to obtain the
approximate the (๐ผ0โ๐พโ๐) quantile ^๐ 0,1โ๐ผ0โ๐พโ๐,๐ฝ and the 1โ(๐ผ1โ๐พโ๐) quantile ^๐ 1,(๐ผ1โ๐พโ๐),๐ฝ.
3 Reject the null hypothesis if for each ๐ฝ โB0,๐ท, either๐0 <๐ ^0,1โ(๐ผ0โ๐พโ๐),๐ฝ or ๐1>๐ ^1,(๐ผ1โ๐พโ๐),๐ฝ.
Now, Proposition 4.2.1is hardly revolutionary in terms of dividing the๐ผ budget between two separate tests. However, in a simple example, this is shown to lead to a better Type II error than other methods, and we imagine that this is helpful even in cases where the Type II error is difficult to compute exactly.
4.2.2 Multidimensional Weights
In this section, we consider a modification of our previous setup. Here, the edge weights
๐ค(๐ข, ๐ฃ) = (๐ค1(๐ข, ๐ฃ), . . . , ๐ค๐(๐ข, ๐ฃ)) are vectors inR๐+. As a result, we have C=R๐+, and so
we denote our parameter vectors by ๐ฝ. In our stochastic spreading model, the exponential random variables ๐๐ข๐ฃ now have parameter ๐ฝ|๐ค(๐ข, ๐ฃ). Additionally, in this context ๐๐ก,๐ฃ
and ๐๐ก are also vectors in R๐+. The former is the multidimensional infection weight into
vertex๐ฃ at time ๐ก, and the latter is the multidimensional weighted cut between the infected and uninfected vertices at time ๐ก. We denote the ๐th coordinate of these by๐๐ก,๐ฃ๐ and๐๐ก๐ respectively. Additionally, we refer to the edges where ๐ค๐(๐ข, ๐ฃ)>0 as being edges of type๐. However, there is one additional difficulty in the multidimensional case that does not appear in one dimension. In one dimension, we have let ๐ฝ =โ to enforce spreading over edges when there is an edge between an infected and an uninfected vertex. From a topological view, this is a natural way to obtain a compactC that also maintains the continuity of the parametrizationโณ.
In the multidimensional case, enforcing spreading over edges depends on the relative impor- tance of the different dimensions of the edge weights. Thus, finding a compactification of
C=R๐+ that preserves the continuity of the parametrizationโณis simply far more difficult
for ๐ > 1. We give an illustration of this in Figure 4.1. As a result, we shall stick to a discretization for boundedB for๐ >2 and for a potentially unbounded parameter set when ๐= 2. Now, we can define the discretization.
Definition 4.2.1. Let ๐ข be a graph, and let C๐ = [0, ๐ ]๐. Suppose the edge weights are inR๐+. Let ๐๐ 0 be ๐0๐ =๐(๐+ 1) ๐+๐ โ๏ธ ๐ก=1 ๐๐ก๐(๐1:๐กโ1) ๐+ 1โ๐ก
Figure 4.1Discretization regions for๐= 2. The lightly shaded hexagon is discretized byC๐ ,๐ท.
The dark square is discretized by ฮ2
๐ท. Finally, the top left and bottom right trapezoidal regions
are discretized byH1,๐ท andH2,๐ท respectively.
Define ๐1๐ to be an integer satisfying
๐1๐ โฅ 1
2๐ฟ๐
๐
0.
Let ๐2๐ be an integer such that
๐2๐ โฅ ๐(๐+ 1)(๐+๐)
2๐ฟ
Let ๐1๐ be an integer satisfying
๐1๐ โฅ (๏ธ log1+ 2๐ฟ ๐(๐+1)(๐+๐)๐ + log1+ 2๐ฟ ๐(๐+1)(๐+๐) ๐1๐ ๐๐ 2 )๏ธ .
Define a one-dimensional discretization C๐ ,๐ท,๐ to include the following values of ๐ฝ:
โ ๐ฝ๐ = ๐๐๐ 1 for ๐= 0,1, . . . , ๐๐ 2, โ ๐ฝ๐๐ 2+๐ =๐ฝ๐2๐+๐โ1 (๏ธ 1 +๐(๐+1)(2๐ฟ๐+๐))๏ธ for ๐= 1, . . . , ๐1๐.
Then, the multidimensional discretization is C๐ ,๐ท=C๐ ,๐ท,1ร ยท ยท ยท รC๐ ,๐ท,๐, i.e. the cartesian product of ๐-univariate discretizations. Finally, the discretization function is
๐น๐ ,๐ท(๐ฝ) := (๐น๐ ,๐ท,1(๐ฝ1), . . . , ๐น๐ ,๐ท,๐(๐ฝ๐)) where
Lemma 4.2.1. The set C๐ ,๐ท with discretization function ๐น๐ ,๐ท satisfies equation (4.1).
Since Algorithm4.1.1and Algorithm 4.2.1only depend on having a discretization satisfying equation (4.1), we can use them for multidimensional testing.
Now, we consider the 2-dimensional case with large๐ฝ. There are two kinds of large๐ฝ: the case in which both ๐ฝ1 and๐ฝ2 are large and another in which only one of the two is large.
From a topological perspective, to compactify the upper right quadrant and maintain the continuity of parametrization function, we need to add three line segments at infinity, two of which are almost mirror images of each other.
We start with the former. To differentiate this case, in place of๐ฝ, we use the parameter๐
instead, and our process evolves according to
P๐(๐ซ๐ก=๐ฃ|๐ซ1:๐กโ1 =๐1:๐กโ1) = โง โช โช โจ โช โช โฉ ๐๐ก,๐ฃ(๐1:๐กโ1)|๐ ๐๐ก(๐1:๐กโ1)|๐ ๐๐ก(๐1:๐กโ1)ฬธ=0 1 ๐+1โ๐ก ๐๐ก(๐1:๐กโ1) =0, ๐ฃ ฬธโ๐1:๐กโ1 0 ๐ฃโ๐1:๐กโ1.
Additionally, we only need to consider ๐ in the 2-simplex ฮ2:={๏ธ(๐1, ๐2)โ[0,1]2 :๐1+๐2 = 1
}๏ธ
. At this point, we can consider the discretization.
Definition 4.2.2. Let๐ข be a given graph with a weight function๐ค. Then, let ๐โbe an integer
satisfying ๐โโฅ min ๐=1,2max๐โP ๐+๐ โ๏ธ ๐ก=1 ๐๐ก๐(๐ซ1:๐กโ1)
Then, we define the discretization
ฮ2๐ท := {๏ธ(๏ธ ๐ ๐โ ,๐โโ๐ ๐โ )๏ธ :๐= 0, . . . , ๐โ }๏ธ .
Finally, we define the discretization function
๐นฮ,๐ท(๐ฝ) = arg min (๐1,1โ๐1)โฮ2๐ท โ โ โ โ ๐1โ ๐ฝ1 ๐ฝ1+๐ฝ2 โ โ โ โ where ๐ฝ1, ๐ฝ2 โฅ 2๐(๐+๐) ๐ฟ .
Lemma 4.2.2. Let ๐ข be a graph where all non-zero edge weights are grater than 1, i.e. for
(๐ข, ๐ฃ)โ โฐ, ๐ค๐(๐ข, ๐ฃ)>0 for an ๐= 1, . . . , ๐implies ๐ค๐(๐ข, ๐ฃ)โฅ1. The discretization ฮ2๐ท with discretization function ๐นฮ,๐ท satisfies equation (4.1).
Now, we consider the second type of of large ๐ฝ. Without loss of generality, assume that ๐ฝ1 is small and๐ฝ2 is large. In this case, we use a modified spreading process ๐ฎ. Here, to
choose the infected vertex๐ฎ๐ก at time๐ก, we spread over an edge of type two if we can do so,
and if we cannot, then the procedure evolves according to our univariate spreading model with parameter ๐ and edge weights๐ค1, i.e. the edges of the first type. Now, we define the
necessary discretization.
Definition 4.2.3. Let ๐ข be a given graph. Let๐๐
0 be ๐0๐ = ๐+๐ โ๏ธ ๐ก=1 max ๐1:๐กโ1 ๐๐ก๐(๐1:๐กโ1) ๐+ 1โ๐ก .
Define ๐1๐ to be an integer satisfying
๐1๐ โฅ 1
2๐ฟ๐
๐
0.
Define ๐2๐ to be an integer such that
๐2๐ โฅ 2(๐+๐) ๐ฟ Let ๐0 be ๐0= (๐+๐) (๏ธ ๐โ๐+๐โ1 2 )๏ธ
Let ๐1 be an integer satisfying
๐1 โฅ2 (๏ธ๐+๐ ๐ฟ )๏ธ(๏ธ log๐0 ๐ฟ + log ๐1๐ ๐2๐ )๏ธ .
Then, we define H๐,๐ท to consist of the following values of ๐:
โ ๐๐ = ๐๐๐ 1 for ๐ = 0,1, . . . , ๐2๐, โ ๐๐๐ 2+๐ =๐๐ โ ๐+๐โ1exp (๏ธ ๐ฟ 2(๐+๐) )๏ธ for ๐= 1, . . . , ๐1.
Define the constants
๐พ = 2๐(๐+๐) ๐ฟ and ๐= ๐ ๐ 0 ๐ฟ
in forming H๐,๐ท. Finally, the discretization function is
๐นH,๐,๐ท(๐ฝ) = max{๐ โC๐ท,๐ :๐ โค๐ฝ๐} where ๐นH,๐,๐ท is only defined for๐ฝ satisfying๐พ+๐๐ฝ๐โค๐ฝ๐ for ๐ ฬธ=๐.
Lemma 4.2.3. The discretization H๐,๐ท with discretization function ๐นH,๐,๐ท satisfies equa- tion (4.1).
Finally, we want to conclude that the three discretizations together comprise a discretization ofC=R2
+. To this end, we need a discretization function of the entire space. For simplicity,
let ๐ =๐พ = 2๐(๐+๐)/๐ฟ, and let๐ be defined as in Definition 4.2.3. Then, we define
๐น๐ท,2(๐ฝ) := โง โช โช โช โช โช โจ โช โช โช โช โช โฉ ๐นฮ,๐ท(๐ฝ) ๐ โค๐ฝ1, ๐ฝ2 ๐นH,1,๐ท(๐ฝ) ๐ฝ1โค๐ and๐พ+๐๐ฝ1โค๐ฝ2 ๐นH,2,๐ท(๐ฝ) ๐ฝ2โค๐ and๐พ+๐๐ฝ2โค๐ฝ1 ๐น๐ ,๐ท(๐ฝ) otherwise.
Now, we can state the corollary.
Corollary 4.2.1. Let ๐ = ๐พ(1 +๐) where ๐พ and ๐ are defined in Definition 4.2.3. The discretization C๐ท = C๐ ,๐ทโชฮ2๐ท โชH1,๐ท โชH2,๐ท with discretization function ๐น๐ท,2 satisfies
equation (4.1).
4.2.3 Graph Confidence Sets
While Algorithm 4.1.2 and its two-statistic equivalent are phrased as a algorithms for constructing confidence sets for the parameter๐ฝ, note that we can use these algorithms to construct confidence sets of pairs (๐ข, ๐ฝ). For example, we might be interested in knowing not only if our HIV graph is in an (1โ๐ผ)-confidence set for a given range of ๐ฝ, but we might also wish to know whether close variants of the graph are also in the confidence set. One curious detail is the choice of statistic when testing different graphs. Our theory allows for different statistics for each graph, and so for graph๐ข, it might be reasonable to use a statistic such as ๐๐ข. If we make this choice though, note that if we consider๐ข to be the
empty or complete graphs, then the statistic is constant and so (๐ข, ๐ฝ) is in the confidence set for any๐ฝ and any confidence level. We can think of this as a specific case of centering the confidence set. If we are fairly certain that our confidence set should include a particular graph๐ขโ, then for testing a graph๐ข, we can use the statistic such๐
๐ข =๐๐ขโ๐๐ขโ. Then,
we see that ๐๐ขโ is constant, which is the phenomenon we observed earlier for the empty
graph.
It is not entirely clear why this occurs or if there is a good alternative. We could specify a fixed statistic ๐, which would not lead to a constant statistic on any graph but may not capture the structure of the graph being tested. Of course, we could always consider the intersection of two (1โ๐ผ/2)-confidence sets, but this does not answer the question of how to choose the statistics for each of the confidence sets.