Proof of Theorem 10.2 - arxiv: v1 [cs.cc] 2 Jan 2020

Throughout the proof, we use the notations and concepts introduced in Section7. We fix a problem D ∈ NP^dy and construct an algorithm-family A for Theorem 10.2. In more details, we define a function m : N⁺ → N⁺ of the form m(n) = O(2^polylog(n)), and show that for every n ∈ N⁺, there exists an algorithm An that solves Dn with Time_A(n) = O(polylog(n)) and Space_A(n) = O(poly(n)), using the oracleO_m(n)^′ for First-DT^dy. To ease notation, henceforth we write m instead of m(n).

10.3.1 High-level Strategy

Suppose that the algorithm An is given the instance-sequence (I₀, . . . , I_k) of Dn. Since D ∈ NP^dy, there exists a verifier Vn for Dn as per Definition 7.3. At a high-level, the algorithm An uses the verifierVn as a subroutine and works as follows.

Preprocessing step. At step t = 0,An does the following:

1. Call the verifier subroutineVn with input in_Vn(t_Vn) = I0 at step t_Vn = 0. Then Vn returns an ordered pair (x₀, y₀).

2. Call the oracle Om^′ in a certain way.

3. OutputAn(in_An(0)) = x0.

Update step. Subsequently, at each step t > 0,An does the following:

1. Call the oracleOm^′ (several times) in a certain way, and use the outputs of these calls to come up with a reward maximizing proof π_tas per Definition 7.3.

2. Call the verifier subroutine Vn with input in_V_n(t_V_n) = ((I_t−1, I_t), π_t) at step t_V_n = t. The verifier Vn returns an ordered pair (x_t, y_t).

3. Call the oracle Om^′ (several times) in a certain way.

4. OutputAn(in_A_n(0→ t)) = xt.

As the verifier Vn is given a reward-maximizing proof-sequence as inputs, Definition 7.3 implies that the algorithmAn solves Dn:

Lemma 10.4. For every t∈ {0, . . . , k}, we have An(in_A_n(0→ t)) = D(It).

Proof. As An calls Vn exactly once at each step, we always have t_V_n = t. Recall Definition 7.3, observe that if Vn is always given a reward-maximizing proof at each step, i.e. (π₁, . . . , π_k) is a reward-maximizing proof-sequence w.r.t. (I₀, . . . , I_k), then x_t = D(ItVn) for all t_Vn ∈ {0, . . . , k}.

Since the algorithmAnoutputs x_t at every step t, the lemma holds.

It now remains to specify Item2in the preprocessing step, and Item1and Item3in the update step. These steps are key to constructing the reward-maximizing proof-sequence forVn. We specify these steps respectively in the subsections below.

10.3.2 Initializing the Oracle

We specify Item 2 in the preprocessing step of An as follows: An calls Om^′ by giving the initial input I^′ where I^′= (mem_I^′,TI^′) is an instance of First-DT^dy as defined below.

• We set memI^′ = mem_V_n(0) where mem_V_n(0) is just the memory state of Vn after Item 1 in the preprocessing step of An (i.e. afterVn returns).

• We set TI^′ =TVⁿ, where the collection TVⁿ of decision-trees is defined as follows. Recall that the input ofVnat step t is of the form ((I_t−1, It), πt) consisting of the instance-update (I_t−1, It) and the proof π_t, respectively. Let T_V_n be the decision tree of Vn. For each possible proof π∈ {0, 1}^polylog(n), let T_π be the decision tree obtained from T_V_n by “fixing” the “proof-part"

of the input be to π. More specifically, consider every read node u∈TVⁿ whose index points to some i-th bit π[i] of π (in the proof-part of the input).¹⁶ Let p(u), r(u), l(u) be the parent, right child, and left child of u, respectively. To construct T_π, we remove u and if π[i] = 0, then add an edge (p(u), l(u)), and remove the subtree rooted at r(u). Else, if π[i] = 1, then add (p(u), r(u)) and remove the subtree rooted at l(u). We setTVⁿ ={Tπ | π ∈ {0, 1}^polylog(n)}.

• We now define the ranks r(v). Consider any proof π ∈ {0, 1}^polylog(n) and the correspond-ing decision tree T_π ∈ TVⁿ. Definition 7.3 guarantees that when given any instance-update (I_t−1, I_t) and the proof π as input, the verifier Vn outputs some ordered pair (x, y) where

x∈ {0, 1} and y ∈ {0, 1}^polylog(n). This has the following important implication.

16Recall the formal description of a decision tree from Section6.2.1.

– Consider any leaf node v in the decision tree T_π. We can associate an ordered pair (x_v, y_v) with this leaf-node v, where x_v ∈ {0, 1} and yv∈ {0, 1}^polylog(n), such that whenever the decision-tree T_π follows the root-to-leaf execution path ending at the node v, it writes (x_v, y_v) in the output memory. Note that the ordered pair (x_v, y_v) does not depend on the contents of the memory the decision tree T_π operates on.¹⁷ We define the rank of a leaf-node v in T_π to be: r(v) = y_v.

In order to prove that I^′ is indeed an instance of First-DT^dy, it remains to show the following:

Lemma 10.5. All the decision trees T_π ∈ TI^′ are shallow.

Proof. Note that there are 2^polylog(n) many decision trees in the collection TI^′, one for each bit string π ∈ {0, 1}^polylog(n). Furthermore, since the verifierVn has O(polylog(n)) update time, each decision tree T_π ∈ TI^′ has depth O(polylog(n)). Hence, the depth of each tree T_π ∈ TI^′ is at most O(poly log|TI^′|), which implies that all the trees in TI^′ are shallow according to our definition.

10.3.3 Constructing the Reward-Maximizing Proof

We maintain the following two invariants in the beginning of every step t > 0.

1. The sequence of proofs (π₁, . . . , π_t−1) that the verifier Vn received so far is the reward-maximizing proof-sequence w.r.t. (I0, . . . , I_t−1).

2. The First-DT^dy instance I^′ = (mem_I^′,TI^′) maintained byO^′mis such that mem_I^′ = mem_V_n(t− 1) where mem_V_n(t− 1) is the memory state of Vnafter finishing the step t− 1.

We now describe Item 1 and Item 3 in the update step of An. Specifically, we show how to call Om^′ several times to keep the invariant. After receiving the input in_An(t) = (I_t−1, It) at step t, the precise description of Item 1in the update step of An is as follows:

• Write (It−1, It) in the input part memⁱⁿ_V_n of mem_Vn.

• Make a sequence of calls to O^′m to update mem_I^′ so that mem_I^′ = mem_V_n.

At this point, let T_π_t ∈ TI^′ =TVⁿ be the output of the oracleOm^′ . We claim the following:

Claim 10.6. The proof πt is the reward-maximizing proof at step t.

Proof. From Section 10.1 and Section 10.3.2, it follows that the answer fromOm^′ is P^′(I^′) = T_π_t, where T_π_t is the decision tree T ∈ TVⁿ which maximizes the rank r(v_T^∗).

By the invariant, it holds that the sequence of proofs (π1, . . . , π_t−1) that the verifierVn received so far is the reward-maximizing proof-sequence w.r.t. (I₀, . . . , I_t−1). Moreover, recall that (I_t−1, I_t) is just written the input part memⁱⁿ_V_n of mem_V_n. This implies that π_tis the proof which maximizes y_t (the second part of the output (xt, yt) of the verifier Vn when it is given ((I_t−1, It), πt) as input in

step t). Hence, we conclude that π_t is the desired reward-maximizing proof at step t.

Finally, Item 3 in the update step ofAn does the following:

• Make a sequence of calls to O^′m to update mem_I^′ so that mem_I^′ = mem_V_n. Clearly, this ensures that the invariant is maintained.

17The contents of the memory determines which root-to-leaf path becomes the execution-path followed by Tπ, and every root-to-leaf path is associated with an ordered pair (xv, yv) where v is the end leaf-node of the concerned path.

10.3.4 Analyzing Update Time, Space Complexity and Blow-up Size of An

We are now ready to prove Theorem 10.2. The theorem holds since An solves Dn (Lemma 10.4) and has small blow-up size (Lemma 10.7) and update time and space complexity (Lemma10.8).

Lemma 10.7. It takes m = O(2^polylog(n)) bits to encode the First-DT^dy instance I^′. In other words, the algorithm An has blow-up size BlowUp_A(n) = O2^polylog(n).

Proof. From the proof of Lemma 10.5, we deduced that there are O2^polylog(n) decision trees in the collection TI^′. Furthermore, each tree T_π ∈ TI^′ has depth at most O(polylog(n)), which implies that each tree T_π ∈ TI^′ contains at most O2^polylog(n) nodes. Thus, there are at most O2^polylog(n)× O2^polylog(n)= O2^polylog(n)many nodes over the collection of trees TI^′. This also implies that the memory mem_I^′ contains at most O2^polylog(n)bits, for the number of bits in memI^′ can w.l.o.g. be assumed to be upper bounded by the number of nodes inTI^′ (otherwise, there will be some bits in mem_I^′ that are not accessible to any tree Tπ ∈ TI^′). We therefore conclude that the running instance I^′ = (mem_I^′,TI^′) that the algorithm An asks the oracle Om^′ to maintain an answer to can be encoded using m = O2^polylog(n) bits.

Lemma 10.8. The algorithm An has update time O(polylog(n)) and space complexity O(poly(n)).

Proof. We analyze the time of Anin the update steps. In Item1, the algorithmAnwrites (I_t−1, I_t) in the input-part memⁱⁿ_V_n. Since (I_t−1, It) is specified using O(polylog(n)) bits (see Assumption6.7), the algorithmAnmakes at most O(polylog(n)) many calls toO^′to update mem_I^′ = mem_V_n. In Item2, the algorithm An calls Vn which takes O(polylog(n)) time. Hence, the call to Vn changes at most O(polylog(n)) bits in mem_V_n. Accordingly, in Item3, the algorithmAnmakes at most O(polylog(n)) many calls to update mem_I^′ = mem_V_n. Thus, in total the algorithm An takes O(polylog(n)) time.

Finally, the space complexity of An is given by: Space_A(n) = O (Space_V(n)) = O (poly(n)).

The last equality follows from Definition 7.3.

In document arxiv: v1 [cs.cc] 2 Jan 2020 (Page 40-43)