• No results found

Skeleton Discovery

5.2 The Mixture Approach

5.2.2 Skeleton Discovery

The first stage of constraint-based causal discovery involves skeleton discovery. The F2CI algorithm will use FCI’s skeleton discovery procedure on PXm. I therefore analyze the properties of FCI’s skeleton discovery procedure when using PXm.

Denote the possible d-separating set of Oi at t ∈ S(f(T)) as P DS(Oti). We have

Ojt ∈P DS(Oti) if and only if there exists a pathπt betweenOti and Otj such that, for every subpath hOt

m, Otl, Othi of πt, Olt is a collider on the subpath or hOtm, Olt, Ohti is a triangle.

Next, denote the possible d-separating set of Oi asP DS(Oi), where Oj ∈P DS(Oi) if and

only if Ot

j ∈P DS(Oit) at some t ∈S(f(T)).

Now recall that PC’s skeleton discovery and v-structure orientation procedures discover a superset ofP DS(Oit),∀iusingPXt under d-separation faithfulness with respect to PXt. I want to show that PC’s skeleton discovery and v-structure orientation procedures will also discover a superset of P DS(Oi),∀iusing PXm under d-separation faithfulness with respect toPXt,∀t∈S(f(T)).

Consider the graph Ge constructed by running PC’s skeleton discovery and v-structure orientation procedures over O with a CI oracle. Let Ok ∈ P DS^(Oi) if and only if there

exists a path eπ in Ge between Oi and Ok such that, for every subpath hOm, Ol, Ohi of eπ,

Ol is a collider on the subpath or hOm, Ol, Ohi is a triangle according to PC’s skeleton

discovery and v-structure orientation procedures. The following lemma shows that we can compute a superset ofP DS(Oi),∀iusing PC’s skeleton discovery and v-structure orientation

procedures:

Lemma 5. Under mixture faithfulness and d-separation faithfulness with respect toPXt,∀t∈

S(f(T)), PC’s skeleton discovery and v-structure orientation procedures will discover a su- perset of P DS(Oi),∀i with a CI oracle; i.e., P DS(Oi)⊆P DS^(Oi),∀i.

Proof. I need to show that, for any arbitraryOk ∈P DS(Oi), we also haveOk∈P DS^(Oi).

This means that whenever ∃t ∈ S(f(T)) with a path πkt between Oit and Otk such that, for every subpath hOt

m, Olt, Othi of πt, Olt is a collider on the subpath or hOtm, Otl, Othi is a

triangle, then there also exists a path πek inGe between Oi and Ok such that Ol is a collider onhOm, Ol, Ohi orhOm, Ol, Ohi is a triangle.

I first show that all adjacencies on πt

k are oneπk. Suppose not. LetO

t

p∗—∗Oqt denote an

arbitrary adjacency present in πkt such that Op and Oq are not adjacent in eπk. Then, there exists a conditional independenceOp ⊥⊥Oq|(W,S) whenOtp 6⊥⊥Oqt|(Wt,St) by d-separation

faithfulness. But this violates the mixture faithfulness assumption via the contrapositive of Lemma 4. Hence, every adjacency on πt

k is also on eπk.

I next show that ifOtl is a collider on the subpathhOmt , Olt, Othionπkt, thenOlis either (1)

a collider on the unshielded subpath hOm, Ol, Ohior (2) hOm, Ol, Ohi is a triangle. Suppose

first that Ol is on the unshielded subpath, so that Om and Oh are non-adjacent in Ge; in other words, Om ⊥⊥Oj|(W,S) for some W ⊆O\ {Oi, Oj}. We already know that Om and

Ol as well as Ol and Oh are adjacent in Ge by the previous paragraph. Now assume that

Ol ∈W for a contradiction. If Olt is a collider on the subpath hOmt , Olt, Ohti on πkt, then we

may invoke Lemma 2.4 in [Colombo et al., 2012] to conclude that there exists an inducing path between Ot

m and Olt that is into Olt and there exists an inducing path betweenOth and

Olt that is into Olt. We may now invoke Lemma 2.5 in [Colombo et al., 2012] with the two inducing paths to conclude thatOt

m andOht must be d-connected given{Wt,St}, so we have

by applying the contrapositive of Lemma4; we have arrived at a contradiction. We therefore conclude that Ol 6∈ W, so Ol must be a collider on the unshielded subpath in Ge. Finally, suppose that Ol is on the shielded subpath, so that Om and Oh are adjacent in Ge. Then (2) holds because all three vertices in hOm, Ol, Ohi must be adjacent again by the previous

paragraph.

Recall that P DS(Ot

i) and P DS(Ojt) are sufficiently large sets for determining the ex-

istence of an inducing path between Oit and Otj. Thus, we may claim that P DS^(Oi) and

^

P DS(Oj) are also sufficiently large for determining the existence of an inducing path be-

tween Oit and Ojt for any t∈S(f(T)) by invoking Lemma 5.

We may therefore consider the following modified edge interpretations for the skeleton: • The presence of an edge between two vertices Oi and Oj implies that Oi 6⊥⊥ Oj|(W,S)

for all W ⊆P DS^(Oi) and for all W ⊆ P DS^(Oj). Thus, there may exist an inducing

path between Ot

i and Otj at some t∈S(f(T)) in this case by Lemma1.

• The absence of an edge between two vertices Oi and Oj implies that Oi ⊥⊥ Oj|(W,S)

for some W ⊆O\ {Oi, Oj}. Thus, there cannot exist an inducing path betweenOit and

Otj at any t∈S(f(T)) in this case by Lemma4.

In other words, I can use FCI’s skeleton discovery procedure equipped with a CI oracle over

PXm to rule out the existence of inducing paths.