• No results found

Toward a Confidence Set for

In document Testing Infection Graphs (Page 67-72)

In order to build a confidence set for the entire graph structure, we can use the same simulation method on other graphs. For example, we could consider a graph that we call β€œgraph 25” or 𝒒25, since it is the HIV graph with 25 randomly-chosen edges removed. A

picture is available in Figure6.3.

As discussed in Section 4.2.3, there are a number of different statistics that one might consider. Here, we consider two-sided tests for π‘Šπ’’25, π‘Šπ’’HIV, and π‘Šπ’’25 βˆ’π‘Šπ’’HIV, i.e. the edges within statistic computed with respect to 𝒒25, the edges within statistic computed with respect to the HIV graph, and the difference of these statistics. The results are shown in Figure 6.4.

There are a number of things to observe. First, using the statistics π‘Šπ’’25 andπ‘Šπ’’HIV lead to confidence sets for 𝛽 of approximately [1.148,10] and [1.27347,10] respectively. On the other hand, the difference π‘Šπ’’25 βˆ’π‘Šπ’’HIV leads to the confidence interval [0,10]. This is unsurprising, since the 𝒒25 and 𝒒HIV are not too different.

We can repeat this procedure on a different graph,𝒒100, which has 100 edges removed from the HIV graph uniformly at random. An image may be found in Figure6.5, and note that

𝒒100 is a subgraph of 𝒒25. The results of the analysis are shown in Figure6.6.

The major difference for 𝒒100 is that no value of 𝛽 is included in the confidence set for the difference statistic π‘Šπ’’100 βˆ’π‘Šπ’’HIV. This is likely due to there being many edges in 𝒒HIV between infected vertices that do not exist when simulating the distribution using 𝒒100. In

Figure 6.3The graph𝒒25 used in computing the joint confidence set of (𝒒25, 𝛽). This graph was formed from the HIV graph by removing 25 edges uniformly at random. As in Figure2.1, black vertices are infected, white vertices are uninfected, and gray vertices are censored.

0.0 0.2 0.4 0.0 2.5 5.0 7.5 10.0 Ξ² pβˆ’v alue W Difference G HIV

Figure 6.4The 𝑝-values as a function of 𝛽 for the HIV graph using the two-statistic test with different test statistics. The statistics areπ‘Šπ’’25,π‘Šπ’’HIV, andπ‘Šπ’’25βˆ’π‘Šπ’’HIV, and the𝑝-values for

these are given by the black line, the light gray line, and the dark graph line respectively.

short, for the values of 𝛽 from [0,10], we have found a graph that is not in the confidence set of graphs given by the observed infection.

Figure 6.5The graph𝒒100used in computing the joint confidence set of (𝒒100, 𝛽). This graph was formed from the HIV graph by removing 100 edges uniformly at random. As in Figure2.1, black vertices are infected, white vertices are uninfected, and gray vertices are censored.

0.0 0.2 0.4 0.0 2.5 5.0 7.5 10.0 Ξ² pβˆ’v alue W Difference G HIV

Figure 6.6The 𝑝-values as a function of 𝛽 for the HIV graph using the two-statistic test with different test statistics. The statistics areπ‘Šπ’’100,π‘Šπ’’HIV, and π‘Šπ’’100βˆ’π‘Šπ’’HIV, and the 𝑝-values

for these are given by the black line, the light gray line, and the dark graph line respectively.

set for this particular infection. This provides a very general method of graph testing at the expense of requiring many simulations.

Part III

Permutation

7.

Theory

In this chapter, we cover the primary contribution of this dissertation: tests based on permutation. Note that we restrict our inquiry to the case where the edges are unweighted, or equivalently𝑀(𝑒, 𝑣) = 1.

7.1

Permutation-Invariant Statistics

A natural statistic to consider for the purpose of graph testing is the likelihood ratio. For the stochastic spreading model, the likelihood ratio is often difficult to compute and depends on𝛽 in a nontrivial manner, making the theoretical derivations somewhat challenging. Our main focus will be on a class of statistics that are invariant under a group of permutations, which allow us to perform permutation testing based on symmetries in the graph sets. We first introduce some terminology regarding permutations and group actions, and then introduce a class of invariant statistics that will be central to our analysis.

7.1.1 Permutations and Group Actions

Recall that agraph automorphism 𝒒= (𝒱,β„°) is an elementπœ‘of the permutation group 𝑆𝑛

such that (𝑒, 𝑣)∈ β„° if and only if (πœ‘(𝑒), πœ‘(𝑣))∈ β„°. For simple hypotheses, we denote the automorphism groups of 𝒒0 and 𝒒1 by Ξ 0 = Aut(𝒒0) and Ξ 1= Aut(𝒒1), respectively. We also need to define theaction of a permutation on vertices, graphs, and infections. The action of a permutationπœ‹ on a vertex𝑒 is simply the imageπœ‹(𝑒). This is easily extended to tuples and subsets of vertices by applyingπœ‹ to the underlying vertices. A specific example is the action on edges of the graph:

πœ‹β„° ={(πœ‹(𝑒), πœ‹(𝑣)) : (𝑒, 𝑣)∈ β„°}. The action of πœ‹ on a graph 𝒒= (𝒱,β„°) is then defined to be

πœ‹π’’:= (πœ‹π’±, πœ‹β„°) = (𝒱, πœ‹β„°).

Another natural extension is to define the action of a set of permutations on a set of graphs: Ξ G={πœ‹π’’ :πœ‹βˆˆΞ  and𝒒 ∈G}.

If G𝑖 = 𝑆𝑛{𝒒𝑖}, we say that hypothesis 𝑖 corresponds to a hypothesis of a particular

graphtopology, since all node labelings are included in the set. We also define the action Ξ Ξ˜π‘– = Ξ G𝑖×𝐴𝑖. Finally, we define the action of a permutation πœ‹ on an infection𝐽:

πœ‹π½ :=(οΈπ½πœ‹βˆ’1(1), . . . , π½πœ‹βˆ’1(𝑛)

)︁

.

In other words, the infection status of the image vertex πœ‹(𝑒) is the infection status of 𝑒 under𝐽.

7.1.2 Invariant Statistics

The theory presented in our paper applies to the following class of statistics:

Definition 7.1.1. SupposeΞ  is a subgroup of 𝑆𝑛. A statistic 𝑆 is Ξ -invariant if𝑆(𝐽) =𝑆(πœ‹π½) for any𝐽 ∈Iπ‘˜,𝑐 and πœ‹βˆˆΞ .

In our permutation test, we will compute the edges-within statistic with respect to the graph

𝒒1 appearing in the alternative hypothesis in the case of a simple test, so we reject 𝐻0 when

π‘Š1(𝐽) :=π‘Šπ’’1(𝐽) exceeds a certain threshold. We derive the invariance of the statistic π‘Š under the permutation group Ξ  = Aut(𝒒) in Chapter 8.

In document Testing Infection Graphs (Page 67-72)

Related documents