Leonid A. Levin Boston University β
Ramarathnam Venkatesan Microsoft Research β‘
Abstract
NP-complete problems should be hard on some instances but those may be extremely rare.
On generic instances many such problems, especially related to random graphs, have been proven easy.
We show the intractability of random instances of a graph coloring problem: this graph problem is hard on average unless all NP problem under all samplable (i.e., generatable in polynomial time) distributions are easy. Worst case reductions use special gadgets and typically map instances into a negligible fraction of possible outputs. Ours must output nearly random graphs and avoid any super-polynomial distortion of probabilities. This poses significant technical difficulty.
1 Introduction
Many NP-complete problems are easy for random inputs: see, e.g., [Karp 76]. Reductions π΄ β€ π π΅ between NP problems preserve only the worst case hardness: some instances of π΅ β π (π΄) are at least as hard as those of π΄. [Johnson 84], [Gurevich 91] give a good account of some of the issues in average case analysis of NP-problems. For instance, the Hamilton path problem takes linear average time [Angluin, Valiant 79], [Gurevich, Shelah 87]. In such cases, NP-completeness may be misleading as an evidence of hardness, say for cryptography. (Many Crypto applications are based on pseudorandomness, pioneered in [Blum, Micali 82], [Goldreich, Goldwasser, Micali 86], [Yao 82], those in turn require hard on average one-way functions.) Such a disappointment has happened with the Knapsack problem: [Shamir 82], [Lagarias, Odlyzko 83].
Another misinterpretation may cause premature abandoning of the search for algorithms that are efficient on all but extremely rare and peculiar instances. Some such algorithms are neat and simple, e.g., the graph isomorphism algorithm in [Babai, Erdos, Selkow 80]. Some problems (decoding of linear codes, versions of graph coloring and independent sets, etc.), however, have eluded such βaverage case attacksβ. So, one needs stronger hardness results related to βtypicalβ or βaverageβ instances of the problem. Such hardness would be sensitive to the choice of a particular NP-complete problem and its input distribution.
As an evidence that a problem with a particular distribution is βhard on averageβ, we use a no- tion analogous to NP-completeness: our problem has no fast on average solution, unless every NP prob- lem under every samplable distribution has one. In that case there is no chance to generate, in poly- nomial time, hard NP instances and the P=?NP question becomes academic. Other examples (Tiling, Matrix Decomposition, etc.) are given in [Levin 86], [Gurevich 90]. These papers show that some NP problems with uniform probability distributions are average case complete. This concept was further analyzed in [Ben-David, Chor, Goldreich, Luby 89], [Impagliazzo, Levin 90], [Venkatesan, Rajagopalan 92], [Blass, Gurevich 95], [Wang 95], [Ajtai 96], [Micciancio, Regev 04], and other works.
In typical NP-completeness proofs π΄ β€ π π΅, the reduction π of π΄ to π΅ uses special structures (gadgets) occurring only in a negligible fraction of instances of π΅ except under strange distributions such as the one generated by π itself. Thus, the βhard instancesβ of π΅, may be extremely rare, and the problem may be easy on average under the distributions of interest. The need to avoid concentrating outputs to negligible sets presents a significant difficulty in designing π . Our π must output uniformly (within a polynomial factor) random graphs and only use gadgets available in them.
*
An abstract of a 20-color version of this result appears in [Venkatesan, Levin 88].
This work (and updates in http://arxiv.org/abs/cs/0112001 ) was supported by NSF grant CCF-1049505.
β
Department of Computer Science, 111 Cummington Mall, Boston MA 02215. http://www.cs.bu.edu/fac/lnd/
β‘
One Microsoft Way, Redmond WA 98052, [email protected]
1
Below, we show the first such simple problem on graphs. This hardness result is to be contrasted with previous works that used random graphs only to point out that many NP-complete problems are easy on average. Moreover, as [Gurevich 87] has shown, average case completeness of a random graph problem is unlikely (unless DEXP=NEXP) without introducing randomizing reductions as we do here.
To motivate our problem, we restate the classical edge-coloring problem (where all edges incident on a node must have distinct colors) in terms of 3-graphs, i.e., 3-node induced subgraphs with induced colors and unlabeled nodes: Given a simple graph and π (unbounded in general) colors, edge-color it so that no 3-graph contains two edges of the same color. We generalize this notion by allowing the list πΆ β² of permissible types of 3-graphs to be arbitrary. As πΆ β² poses only local restrictions, we specify an additional global parameter:
the number π of edges to be left blank. Set πΆ = (πΆ β² , π). This formalism has notable power to express many graph problems, even using only one color and blank. One example is the matching problem on a 2π node graph: πΆ β² contains 3-graphs with at most one blank edge each. Another is a similarly restated problem of finding π-node clique in given graph: πΆ β² restricts blanks to be self-loops, all pairwise connected. However, our results below have no direct bearing on the complexity of these classical problems on random instances.
Our graphs are directed (digraphs). We will color some of their edges and loops with 3 colors, leaving π edges blank. So, each of the 9 edges of a 3-graph has 5 options: to be colored red, green, yellow, left blank, or be absent. πΆ is taken by picking at random a number π β [1, π 2 ] and a subset πΆ β² of all 5 9 possible 3-graphs.
A πΆ-colored graph is a graph so edge-colored that it has π blank edges and all its 3-graphs are in πΆ β² . In the above examples a random πΆ β² is correct with a constant probability (if π = π(1)); π is with probability 1/π 2 . Now, our problem admits a simple statement: Given a random digraph πΊ and a randomly chosen πΆ, πΆ-color πΊ. Using a randomized reduction we show the problem is complete in the average case; thus, if this problem turns out easy on average, complexity-based cryptography will be impossible. Without requiring π blanks, the problem is trivial, but even with only one color, say red, (and blanks) no polynomial time algorithm is known: πΆ can restrict all blank edges to be self-loops on a tournament.
A number of papers argued average hardness of problems by reducing to them specific, obviously not proven NP-complete, but presumed hard worst case problems. For example, solving a noticeable fraction of instances would allow solving all instances for random self-reducible problems such as taking square roots modulo a composite, or discrete logarithm 1 These examples randomize only arguments but not the modulus.
[Ajtai 96] randomizes the entire instance for a problem: Given π, π and a random matrix π΄, solve π΄π₯ β‘ 0 (mod π), βπ₯β=π. It is shown at least as hard as the worst case versions of finding (up to a polynomial approximation factor) Shortest Vector or the Closest Vector or a basis with the smallest orthogonality defect in lattices. The approximation factor is what separates these problem from NP-hardness making such reductions (and analysis of lattice based cryptosystems) possible. 2 But self-reducible or Ajtaiβs problems may turn out to be easy on average while some similar problems would still be hard. In fact, these mentioned problems, if hard, provide one-way functions, a very desirable tool similar to hard on average problems but not yet known to follow from the existence of the latter. And even from existence of some one-way function, the hardness of the mentioned problems does not follow. But if any problem is hard on average, ours is, too.
2 Basic Concepts and Claims.
It is essential to consider reductions to (or hardness of) problems with specific distributions. If π΄ is hard on average under some distribution, and π΄ β€ π π΅, then so is π΅ under the induced distribution of π βs outputs.
But the latter may not correspond to any distribution of interest. It is interesting, to show the reducibility under some common distributions, e.g., uniform distribution on graphs. To preserve their average case complexity, our reductions must map typical instances of one problem into typical instances of another. The definitions of distributions and averaging run times involve subtleties, some discussed in this section.
Our strings and logarithms are binary; π={0, 1} * ; βπ₯β is the bit-length of π₯; (π₯, π¦) is a string representing a pair of strings π₯, π¦. Algorithmsβ average run times may be considered on inputs with probability distributions on {0, 1} π , e.g., the family π π ({π₯}) = 2 βπ . This may be artificial sometimes: the code lengths of two strings
1
over πΊπΉ (π) (find π₯ from (π, πβZ
*π, π
π₯)) or one elliptic curve or all ([Jao Miller Venkatesan 09]) elliptic curves of the same order mod π. The best known algorithms (see [Lenstra, Lenstra 91]) run in conjectured time ππ₯π((βπβ(log βπβ)
2)
1/3).
2
See, e.g., [Aharonov, Regev 05]. Analogous rounding problems in non-abelian discrete groups have tighter P-time approxi-
mation limits: any factor depending only on ambient dimensions would imply P=NP; see [Begelfor, Miller, Venkatesan 15].
may be the same in some binary encodings of instances but differ in others. Instead, distributions may be over the entire {0, 1} * , such as the uniform one π({π₯}) = π(π+1) 2
βπ. But for tighter analysis on specific lengths, distributions π π may be conditioned to {0, 1} π . We say a property holds for almost every (a.e.) π₯ if it fails with probability π π = π(1). We may also consider uniform distribution on {0, 1} N and use a cut-off algorithm πΆ : {0, 1} N β {0, 1} * selecting a finite prefix of a stream of bits after rejecting all shorter prefixes.
These probabilities differ by βπ₯β factors, but our completeness results are only up to polynomial factors in probability and, thus, in average running time. This is similar to the worst case reduction theories, which typically ignore polynomial factors in run times. So, we use π for any of these types of uniform distributions when unambiguous. Our distributions may sum to <1 allowing processes a chance to fail in producing outputs. π (π) is the output distribution of π on π-distributed inputs; Β― Β― π (π) is samplable if π runs in polynomial time (P-time) with respect to (w.r.t.) output lengths. We use asymptotic notations Ξ, π, π, βΌ.
2.1 Random Inversion Problems and Reductions.
Our version of NP is constructive: the witnesses must be explicitly produced. It is convenient to refer to them in the inversion form: for an algorithm π with βπ€β = βπ (π€)β π(1) , given an instance π₯, find a witness π€ β π β1 (π₯). E.g., π ((π, π)) may output the graph π, if π is its Hamiltonian cycle (or else an error). A random inversion NP-problem (RIP) is a pair (π, π ) where π is computable in P-time and the distribution π of the instances is samplable. The choice of a particular distribution π is significant: different distributions may concentrate on different subsets of instances thus specifying completely unrelated problems. RIPs require actually finding witnesses for instances, so our reductions are pairs (π , π) of P-time algorithms 3 to map both. π is (π, π )-injective if π (π₯)ΜΈ=π (π¦) for all π₯, π¦ in π ({0, 1} * ), when π₯ΜΈ=π¦, π({π₯, π¦}) > 0. π π is π ({0, 1} * ) excluding π₯ with π({π₯}) = 0. A reduction of a RIP (π, π ) to (π, π) is a pair (π , π) such that:
1. π is (π, π )-injective and π (π π ) β π({0, 1} * ), i.e., π maps solvable instances to distinct solvable ones;
2. π (π (π(π€)))=π(π€), i.e., π solves π -instances π₯ given a π-witness for π (π₯);
3. Β― π (π)({π¦}) β€ π({π¦})βπ¦β π(1) , i.e., π maps π-typical instances into π-typical ones.
The reductions are closed under composition and any fast on average algorithm for (π, π) yields a fast on average algorithm for (π, π ). We consider two generalizations. A padded RIP (π, π) is a modification of a RIP (π, π β² ) where π((π€, πΌ)) = (π β² (π€), πΌ). The distribution π is only on π¦=π β² (π€), combining all πΌ.
(Padding (π, π) above with π βs input circumvents the injection requirement.) A randomization (π, π ) of (π β² , π β² ), is a RIP where the padding πΌ is also random. It includes a βcut-offβ P-time algorithm πΆ(π₯, πΌ) that selects the padding length. The digits of πΌ are chosen at random but πΌ is restricted to sets π π₯ of measure π(π π₯ ) = βπ₯β βπ(1) (in our case actually βΌ 1). Then π({(π₯, πΌ)}) is π β² ({π₯})/2 βπΌβ for πΌ β π π₯ , or 0 for πΌ / β π π₯ . Remark 1. π π₯ consists of those βhelpfulβ strings πΌ which allow π to output instances π¦ whose witnesses can be transformed by π into witnesses for π₯, if any. π π₯ is not required to be decidable in P-time, so this artificial restriction breaks samplability of π. But π π₯ has a polynomial probability, so a randomized algorithm can try several πΌ, shrinking exponentially the probability of failure for solvable instances. Some algorithms inverting π give also negative answers βno inverses exist.β Unlike inverses, they are not verifiable and so can be used in reductions only if their chance exceeds by βπ₯β βπ)1) a known π(π π₯ ) bound.
Definition 1. A RIP (π, π) is complete if every RIP has a randomization reducible to it (or to its padding).
The average complexity of (π, π ) is an upper bound π (π) for the time needed to find π€βπ β1 (π₯) expressed in terms of π(π₯)=βπ₯βπ(π₯), where the instanceβs βrarenessβ or βexceptionalityβ π has βοΈ π({π₯})π(π₯) < β.
Reductions and randomizations preserve average complexity (up to a polynomial) and completeness. Note that complete problems must have many easy instances, since both hard and easy problems are reducible to them. Note also that if two distributions π and π β² are such that π({π₯})/π β² ({π₯}) = βπ₯β π(1) , then π (π₯) = π₯ reduces (π, π ) to (π β² , π ): the polynomial factors in distributions are absorbed by the definition.
3
The P-time requirement is essential for π only on average over π₯. This allows a possibility to generalize completeness to
problems, whose worst-case versions are not NP-complete. Also π may be not a samplable distribution, only dominated by one.
2.2 Main Result
Let π’ be a digraph with edges (including, loops, i.e., self-loops) colored red, green, or yellow, or left blank.
Blank edges play a different role than colored ones in that we limit the number of colored (non-blank) edges.
A spot in π’ is a 3-node subgraph with induced edges and colors, and the nodes unlabeled (i.e., replaced by 1, 2, 3). So there are 5 9 distinct spots. The coloration πΆ(π’) is the set πΆ β² of all spots in π’ and the number π of its blank edges; πΊ=π’ is obtained from π’ by removing the colors. As noted in the introduction, the classical edge coloring problem restricts the set of spots so that all edges in a spot have different colors.
We choose a random restriction instead. We similarly have 2-node spots called links.
Graph Coloration Problem:
Invert the function π (π’) = (π’, πΆ(π’)), i.e., color the edges of a given graph to achieve the given coloration.
Uniform distribution: π({πΊ}) = π(π+1) 2
βπ2, π({πΆ β² }) = 5 β9 , π({π}) = π β2 for all πΆ β² , π, and π-node graphs πΊ.
Theorem 1. Graph Coloration is a complete random inversion problem.
We could use as well a monotone variation, accepting also any smaller πΆ β² and/or larger π. The function to invert would then map (π’, πΆ β² , π) to (π’, πΆ β² , π), if πΆ(π’) = (πΆ β² , π), πΆ β² β πΆ β² , π β₯ π, to error otherwise.
2.3 Outline of the paper
The rest of the paper is devoted to proving this theorem. First, section 3 proves several lemmas on random graphs to show they are likely to have the structures needed for our reductions. (Some of these properties require tighter bounds than the literature offers so we must go into some computations.) One is the likely existence of a unique (log π)-node tournament. Connection patterns to this tournament are used to order the graphβs nodes. Another property to mention is a bound on probability of existence of matchings that allow embeddings of our constant degree templates into the random graph.
Section 4 defines a restricted type of Tiling Problem RRTP which retains its completeness and which we reduce to our graph coloring. Limited to 3-color 3-node patterns we must carefully select a universal Turing machine, bit-split its tokens, and economically represent them in the tiling grid. We use [Ikeno 58] UTM and manipulate its spare capacities to accommodate non-deterministic choices, border effects, etc.
Section 5 presents our reduction. It generates roughly uniformly random graphs with an embedded instance of our random tiling problem. We generate a graph at random, except for several features imposed on it. These features are likely to exist on their own, but may be hard to discover (e.g., it is not known how to find a (log π)-node tournament in polynomial time). Some features are imposed for convenience, to reduce to π(1) the probability of unsuitable graphs. Among these features are counts of certain types of nodes to match their expected value, the distinct patterns of nodesβ connection to the tournament, etc.
One feature is of a different nature: the connections of sequential nodes in a certain chain (Input Chain) must reflect the input bits of the tiling instance. This feature appears in random graphs only because the instance of the Tiling Problem is itself randomly chosen. The chain is the sequence of unlooped nodes connected to all nodes of tournament, and ordered according to the directions of these connections.
Section 6 describes the required coloring patterns of all 3-node induced subgraphs and proves the equiv- alence of solving the Tiling Problem to the graph coloring it is reduced to. The tournament π is marked with blank loops; its two-way connections to looped nodes are allowed to include βfreeβ blank edges (the only kind not limited to π(π) number by color patterns). The limit on the non-blank edges then assures the log π size of π , which renders it unique. We then transform a solved Tiling instance into a template colored graph and use a section 3 lemma to embed this graph into the graph produced by the reduction.
The coloring is forced to mark a grid on all nodes connected to the whole tournament. Its edges to π are colored to demonstrate the correct ordering of the input chain. The colors of the grid edges must display the tiling solution. This used to be the trickiest part of the construction but is now much simpler.
3 Random Graph Lemmas
We describe the structures needed for the reductions and prove that they exist on a polynomial fraction of
graphs. Such questions, when restricted to properties of almost all graphs, are addressed in the theory of
random graphs (see [BollobΒ΄ as 01]).
Transitive tournament (thereafter simply tournaments) are complete acyclic, except for loops at each node, digraphs. E π (π ) stands for the expected value of the random variable π w.r.t. the distribution π. We omit π when the distribution is understood. Unless stated otherwise, our random graphs have edges drawn independently with probability 1/2. In this section we consider distributions on graphs with a given (not chosen at random) number of nodes. π βΌ π means π (π) = (1 + π(1))π(π).
Remark 2. We use the following facts on probability and approximations:
1. (a) π! = (οΈ π π
)οΈ π β
2ππ + π π , where π π β [π/3, π 2 β2π] β [1, 1.11].
(b) For π = π( β π), (οΈ π
π )οΈ βΌ π π!
π, (c) Let π = π( β
π) integers be chosen from {1, . . . , π} randomly and independently.
The probability that all are distinct is Ξ πβ1 π=1 (1β π π ) = π βΞ(π
2/π) βΌ 1.
2. Assume that the random variable π = π with probability π π > 0, π β {1, . . . , π }.
Let π¦ π be the number of occurrences of π in π independent Bernoulli trials of π . Then, from the above item 1a and the central limit theorem:
(a) ππ π =E(π¦ π ); π π 2 = Variance(π¦ π ) = ππ π (1 β π π )
(b) If π₯ 3 = π(π π ), then βπ¦ π β ππ π β β₯ π π π₯ with probability π βπ₯
2/2 /Ξ(π₯ + 1) (c) Let π π be integers with βοΈ
π π π = π, π‘ π be ππ π
ππ
β1, and βοΈ
π ππ π π‘ 2 π = π(1).
Then Prob [π¦ π = π π , π = 1, . . . , π ] = β
π/π((2ππ/π + 1.11) π /2 ).
Proof of item 2(c): The number of ways in which the event [π¦ π = π π , π := 1, . . . , π ] can occur is π!/ βοΈ
πβ€π (π π !) and each of these has probability βοΈ π
π=1 π π π
π. Thus the required probability is π!
π
βοΈ
π=1
π π π
ππ π ! = π π
π
βοΈ
π=1
(οΈ π π π π
)οΈ π
πβοΈ
2ππ + π π
βοΈ π
π=1 (2ππ π + π π
π) Now, π π βοΈ π
π=1
(οΈ π
π
π
π)οΈ π
π= π βπ( βοΈ
ππ
π(1+π‘
π) log(1+π‘
π)) , which we will now see is 1/π(1). Indeed, log(1+π‘) β€ π‘, so βοΈ
π ππ π (1+π‘ π ) log(1+π‘ π ) β€ βοΈ
π ππ π π‘ π + βοΈ
π ππ π π‘ 2 π = 0 + π(1), by the assumptions.
Lemma 1. Let π₯ π be the number of π-node tournaments in a random π-node digraph, π = πβ log π, |π|=π(π), π‘ = 2 βππ . Then E(π₯ π ) βΌ π‘; E(π π 2 ) βΌ π‘ + π‘ 2 .
Proof: We use the second moment method. First, note that π₯ 2 π = βοΈ
π π π , where π π is the number of ordered pairs of π-node tournaments sharing πβπ nodes. By linearity of expectation, E(π₯ 2 π ) = βοΈ
π E(π π ). There are (οΈ π
π )οΈ ways to choose the positions of shared nodes in each tournament in a pair of π-node tournaments and (οΈ π
π+π )οΈ(π + π)! ways to map this structure in our graph. The probability of each pair to form the tournaments is 2 β(π
2βπ
2+2ππ) . Using item 1b of the Remark 2 we get (οΈ π
π+π )οΈ(π + π)! βΌ π π+π and substituting π = 2 πβπ below, E(π π ) βΌ 2 π
2βπ
2β2ππ (οΈπ
π )οΈ 2
π π+π = (οΈπ π
)οΈ 2
2 βπ(π+π) 2 π(πβπ) . Now, E(π₯ π ) = E(π 0 ) βΌ π‘, E(π π ) βΌ π‘ 2 , and E(π₯ 2 π ) = βοΈ π
π=0 E(π π ) βΌ π‘ + π‘ 2 + βοΈ πβ1 π=1 E(π π ).
It remains to show that βοΈ πβ1
π=1 E(π π ) = π(π‘+π‘ 2 ). Note that for 0<π<π, 2 βπ(π+π) β€ 2 β|π| (π‘+π‘ 2 ). So, E(π π )/(π‘+π‘ 2 ) β€ (οΈπ
π )οΈ 2
2 π(πβπ) β€ 2 βπ((πβπ)β2 log π) for 0<π<π/2
2 β(πβπ)(πβ2 log π) for π/2β€π<π. β€ 2π 2 /2 π = π(1/π).
Let 0 < π + 1/π = π(π) and π π be the probability distribution of a β2 πβπ β-node graph πΊ generated as
follows: First we choose randomly a sequence of π nodes of πΊ and force a tournament π on them. Then, we
choose at random the set of all other edges (π, π) ΜΈβ π (π ) 2 . The following corollary implies that a random
graph on β2 πβπ β nodes has a unique π-node tournament with a polynomial probability if and only if π = π(1).
Corollary 1. 1. For those πΊ with unique π-node tournaments, π π ({πΊ})/π({πΊ}) βΌ 2 ππ . 2. π π -almost every πΊ has only one π-node tournament.
Proof: First note that π π -probability of all such pairs (π, πΊ) is the same, and the total number of such pairs is βοΈ
πΊ π₯ π (πΊ) = E π (π₯ π (πΊ))/π(πΊ). Thus, π π ({πΊ})/π({πΊ}) = π₯ π /E π (π₯ π ), which is βΌ 2 ππ , if π₯ π =1. Then, the expected number of tournaments
E π
π(π₯ π ) = βοΈ
πΊ
π₯ π (πΊ)π π ({πΊ}) = βοΈ
πΊ
π₯ π (πΊ) π₯ π (πΊ)π({πΊ})
E π (π₯ π ) = E π (π₯ 2 π )
E π (π₯ π ) βΌ 2 βππ + 2 β2ππ
2 βππ = 1 + 2 βππ βΌ 1.
In other words, the expected number of tournaments other than the forced one is π(1).
Next is a lemma on matchings in random graphs. We need it to show that a.e. πΊ contains any π(1)- degree graph (e.g., Hamilton path, grids) as a spanning subgraph. We follow the method of [BollobΒ΄ as 01], Lemma 7.12, which proves the βΌ 1 probability of existence of matchings. But we need a tighter bound:
Lemma 2. Let πΊ be a bipartite undirected graph with edges in π 0 Γ π 1 chosen independently with probability π/π, π = |π π |. Then, for large enough π, the probability π that πΊ has no perfect matching is < 2π/π π . Proof: With no complete matching, πΊ has an (π+1)-node independent set πΏ 0 βͺπΏ 1 , πΏ π βπ π . Let πΏ be a smallest such πΏ π in πΊ. Then π
def= |πΏ|β1 < π/2, and no node has exactly one edge to πΏ. Let π π be the probability that πΊ has such an πΏ in π 0 . Then π < 2 βοΈ (πβ1)/2
0 π π . Assume π<π<π π /2 (else Lemma is trivial).
A node is isolated with probability π = (1βπ/π) π < π βπβπ
2/2π < π βπ /(1+π 2 /2π). Then
π π <
(οΈ π π+1
)οΈ(οΈπ π
)οΈ(οΈπ+1 2
)οΈ π (οΈ π π
)οΈ 2π (οΈ
1β π π
)οΈ (π+1)(πβπ)
<
(οΈ ππ π+1
)οΈ π+1 (οΈ ππ π
)οΈ π (οΈ π(π+1) 2
)οΈ π (οΈ π π
)οΈ 2π
π (π+1)(πβπ)/π ,
since (οΈπ π )οΈ
< (οΈ ππ π
)οΈ π
. For 2 β€ π < π/2, β
def= 1β 1 π β π+1
π β₯ 1 2 β 3
π , the above is (οΈ π 2 π 2 π β 2
)οΈ π π 2 ππ
(π+1) = π(π 2 π).
By inclusion-exclusion, π 0 < ππβ (οΈ π 2 )οΈπ 2 + (οΈ π
3 )οΈπ 3 < ππ(1βπ π 3 ). Also, π 1 < (οΈ π
2 )οΈπ( π π ) 2 π 2β2/π < π 2 π 2 π 2β2/π . So, π < 2π 0 +2π 1 +π(π 2 π)π < 2ππ βπ (1βππ/3+π 2 π 1β2/π /2+π(ππ))/(1+π 2 /2π) < 2ππ βπ since βππ/3+π 2 π 1β2/π /2 +π(ππ) < π 2 /2π, or equivalently, π π + π π (2/3βπ(1))π > π 1β2/π π, as π₯+π¦ β₯ 2 β
π₯π¦ and π < β log π.
Lemma 3. Equitable Colouring ([Hajnal, Szemeredi 70]) Any undirected graph of degree < π can be partitioned into π independent sets whose sizes differ at most by one.
See [BollobΒ΄ as 04] for a proof. [Kierstead, Kostochka 08] give a P-time algorithm but we need here only the existence of the partition. Next we use this lemma to find embeddings in random digraphs of some structures we need in our reduction. We first describe these embeddings.
Write π» @ πΊ if π» is an induced subgraph of πΊ. We call nodes π’ and π£ connected by a single edge if exactly one of the two directed edges (π’, π£), (π£, π’) exists, by a double edge if both do, and non-adjacent or disconnected if nether does. The degree deg(π’) of a node π’ is the number of adjacent nodes, and deg(πΊ) of a graph is the maximum degree of its nodes. A graph di-embedding π : π (πΊ) Λβ π (π») is a bijection isomorphic on each connected 2-node subgraph of πΊ. Note, disconnected nodes can be mapped to any nodes in π», to connected, too. We call compatible graphs with the same number of looped and unlooped nodes.
Lemma 4. (Embedding Lemma) Let π»@πΉ , π := deg(πΉ ), and π : =βπ (πΉ β π»)β = 4 π π 3 /π(1).
Consider digraphs πΊ, compatible with πΉ , having π» @ πΊ and random otherwise.
For a.e. such graph, there exists a di-embedding π of πΉ into πΊ which is an identity on π».
We will use this lemma for a graph with π = π(1): a grid with some additional edges. Now we want to
find this grid in the random digraph by constructing a suitable π. We reduce the task of constructing π to
that of finding perfect matchings in a sequence of random bipartite undirected graphs: using the lemma 3
we partition πΉ β π» suitably into some independent sets πΌ π and extend π on each πΌ π one by one.
Proof: Initialization: First, we partition πΉ β π» into independent sets πΌ π , π β€ π 2 + 1, such that βπΌ π β β₯ π β² := βπ/π 2 β, and each πΌ π consists of either only looped nodes or only unlooped nodes. Let πΉ β² be a graph obtained by adding edges between nodes in πΉ β π» that share a neighbor in π». Clearly deg(πΉ β² ) < π 2 . Now, πΉ β² can be partitioned into independent sets using the lemma 3 as needed. Next, we split π (πΊ β π») into disjoint π π , compatible with βπΌ π β. (The latter condition can be met since πΊ, πΉ are compatible.) Set πΎ = π».
Below πΎ will denote the set of nodes on which π is defined (i.e., di-embedding is found).
Repeat the following steps for π := 1, 2, . . ., until the extension of π is completed.
Extension Step(j): Call π£βπ π a candidate for π’βπΌ π if setting π(π’) : =π£ will extend π as a di-embedding over πΎ βͺ {π’}. For this, π£ must satisfy at most 2π connectivity constraints specifying which of the directed edges are present. Now we construct an undirected bipartite graph πΊ β² π by connecting every node in πΌ π to all of its candidates in π π . Then, with a high probability (over πΊ), we find a perfect matching that maps πΌ π to π π satisfying the connectivity requirements. This way we extend π over πΎ βͺ πΌ π and update πΎ to πΎ βͺ πΌ π .
Analysis of Extension Step (j): In πΊ β² π the probability of an edge {π’, π£} is 4 βdeg(π’) . Also the edges are chosen independently: the conditional probability of π£ being a candidate for π’ is the same, given the set of all the edges from π£ to Ξ(πΌ π β {π’}) β© πΎ, since Ξ(πΌ π β {π’}) and Ξ({π’}) have no nodes in common in πΊ β² π βͺ πΎ, which is guaranteed by construction of πΉ β² in the initialization and partitioning steps. The existence of matching is a monotone property i.e., cannot be broken by adding edges. So, it suffices to estimate its probability by decreasing the chance of all edges in πΊ β² π to the uniform 4 βπ . Now we come to the overall success probability (over πΊ) of finding π. Put π/π β² = 4 βπ . Then, using lemma 2, the probability that one of the π 2 matching steps fails is π(π 2 π/π 2 )π βπ = π(π)/π π
β²/4
π= π(π β(
π2 4ππβlog π) ) = π(1).
4 Turing Machines and Tilings.
Our proof uses the completeness of the Random Tiling Problem (RTP). Any samplable Random Inver- sion Problem (RIP) reduces to a problem with P-time computable distribution [Impagliazzo, Levin 90], i.e.
with the measure of intervals [0, π₯] β N computable in time βπ₯β π(1) . [Levin 86] proved RTP to be complete for such problems using deterministic reductions. A tile is a square with with letter-marked corners. A tiling of an π Γ π square, involves covering it with π 2 tiles, so that the letters on the touching corners of adjacent tiles match. Our RIP will be inverting a function π― that, given a tiled square input, returns the set of tiles (called legal) used in it, and its lowest (floor) row of tiles.
Problem: Invert π― , i.e., given a set of legal tiles and a floor row, extend it into a tiled square.
Distribution: Uniform: choose randomly π with probability π(π+1) 1 , the set of legal tiles from all possible 26 4 tiles, and a legal tile at the lower left corner; choose each successive tile of the floor row with equal probability from the legal tiles matching the previously chosen.
Our graphs have only 3 colors, so we need careful restrictions on letters in the complete RTP we use.
These symbols have four fields: a trit π β {0, 1, *} and three bits π, π , π β . We refer to π β {π, π} as the direction: leftward or rightward, to π β β {π, π} as the previous direction, to π as priming (and picture π=1 by priming the trit (e.g., * β¦β * β² ). In no tile lower π differs from π β in the corner above it, or π = π is to the left of π. Thus each row will consist of two segments: rightward pointing segment at the left and leftward pointing one at the right. In tiles with π = π on the lower left corner, the two right corner symbols must coincide. If lower right corner has π =π, the left corner symbols coincide. Two tiles cannot agree on both lower symbols but differ on both upper ones. The left segment of the floor row must have π=1, the right segment β π=0, π ΜΈ=*. The top and floor symbols carry a special flag and must agree so that the entire tiled square can be wrapped into a cylinder. The cylinder has constant symbols at each end; it is further wrapped into a torus by a ring of the special wall tiles made by repeating this pair of symbols.
We use just one particular tile set and it meets the above restrictions. Any tile set has a constant
probability among all tile sets, so the density requirements of the reductions are not affected. Note that
tiling an π Γ π square is easily reduced to tiling a larger 2π Γ π rectangle with a prime π β [π, 2π]. Our
argument will use only such rectangles. We reduce such restricted form of RTP (called RRTP) to Graph
Coloration Problem. Thus, we need to see that RRTP restrictions are compatible with [Levin 86] reductions
of non-deterministic computations by a universal Turing Machine (TM) to tiling.
4.1 Completeness of the RRTP
Any RIP can be stated as accepting random instances by a run of a given non-deterministic P-time TM.
All TMs can be simulated by a universal TM with a polynomial time overhead and some special short (logarithmic) input prefix. We will now describe such a universal TM of [Ikeno 58].
Turing Machines work on a tape consisting of cells. One cell contains the TMβs head state π, the others β its tape symbols π. A cell content at a given step is called an event. The space-time table of events (i.e., a rectangular grid where π-th row contains tape contents at π-th step) describes the history of the computation.
At each step the head acts on one of its two adjacent cells. This cell and the headβs are active, others β idle.
The tape symbols and the headβs state specify the direction π to the headβs cell or to the active tape cell respectively. The only adjacent cells with facing directions are those active. Idle cells do not change content at the step. Active cells do, performing a transition. Exactly one of them changes direction. That one carries the head at the next step. Figure 1 pictures the space-time history of a transition. Its box containing the state π 1 points toward π 1 (left or right as reflected in cases (a) and (b) respectively). In case (a), the right move merely causes the head to flip its direction; otherwise, the head switches places with the other active cell of the current row. Left moves work similarly.
a:
π π 2 π 2 π β² π β²β²
β β β β β
π π 1 π 1 π β² π β²β²
β β β β β
b:
π π 2 π 2 π β² π β²β²
β β β β β
π π 1 π 1 π β² π β²β²
β β β β β
Figure 1: Space-Time History for a Right Move (π 1 , π 1 ) β (π 2 , (π 2 , π πΌπΊπ»π ))
A binary TM has {0, 1} tape alphabet. To convert a regular TM into a binary one π we must deal with its lack of a blank symbol that usually denotes the end of input. For this we precede its input π₯ with padding: input length π = βπ₯β as a binary string preceded by a string of 2βπβ zeros. This padding initiates a pair of counters that monitor the distance to both ends of the used part of π βs tape (initially the input).
Mβs head moving on the tape keeps these counters updated and pulls them along. When the right end of the used tape is reached, any subsequent characters are treated as blanks. M and its counter are initialized so that the head never approaches the tape ends. π may have a βwrite 0 or 1β nondeterministic command.
It is used to guess the solution of the instance, and also to restore the initial content of the tape at the successful end of the computation (to meet the βwrap aroundβ requirement of RRTP).
We describe below the [Ikeno 58] TM π that simulates any such binary π . π has 6 leftward and 5 rightward head states and 6 tape symbols. They are represented by the same fields π, π (plus the directions π , π β ) as used in RRTP. Head states differ from tape symbol by π ΜΈ= π β , i.e., changed from its previous value.
The tape of π simulating π consists of a program segment π followed by the tape of π segment π . At each step, π marks the current cell it is working on, goes all the way left to program segment, decides on the next step to perform, and returns to the current cell to do so. The tape trits in π never change. In π , they mirror the current tape bits of π except that the π βs active tape symbol may be replaced by a *. This
* and the priming bits π are the only non-π data π uses. The simulation starts with the whole π primed, followed by the head, then by π β {0, 1} * .
The transition table on the right has one command (originally one of the βhaltβ commands denoted by =) modified to a choice of entering state π΄ or π΅. This simulates π βs non-deterministic command. The tableβs columns are indexed by tape symbols π, the rows by states (rightward directed β by uppercase letters, leftward β by lowercase). The entries reflect the resulting state (only if changed), the trit of the new π (if changed), and its bit (always, even if unchanged, so in the transition from (π΄, 1 β² ) the symbol 1 β² gets unprimed, while from (π, 1 β² ) it stays primed). The (π·, 0) command is unused. We turn it into the βwall tileβ to make a column of such tiles on the cylinder merging the left border of π·βs with the right one of 0s.
1β 0β *β 1 0 *
A f f e0
B F F e1
f,F c b* a* F
c = F Eβ β β
a bβ F Eβ β β β
b aβ D β β β
d β β D β β
D eβ dβ β
E β β eβ = β β
e B A A/B β β β
π comes to its βchoose π΄ or π΅ stateβ command (* β² , π) if the digit that π is to write in the Ikenoβs representation of π βs command is replaced with a *. Then π has a choice to act as if it was 0 or 1. We modify the initial state to be (* β² , π), too; the effect of the non-determinism of π βs first transition to π΄/π΅ disappears in 3 steps: easy to see by tracing them. Afterwards, the simulation proceeds in a regular way.
5 The Reduction Algorithm R
The reduction π described below produces a graph πΊ from any RRTP instance π supplemented with a random π(βπβ 7 )-bit πΌ. π pairs πΊ with a coloration: standard πΆ β² and π described in the next section.
We show that πΆ-colorings of such πΊ exist for any solvable π, and all of them easily yield solutions for π.
Remark 3. Recall that the reduction involves a pair of algorithms π and π such that:
(Randomized) tiling instance π = (π, πΌ) ββ graph coloration instance (πΊ, πΆ) π Solution π€ β² of tiling π ββ πΆ-colored πΊ π
To assure solvability of π (π ), section 6 transforms a tiling solution to a colored graph template πΊ β² which is further di-embedded into a suitably colored πΊ. This can be seen as a (not necessarily P-time) reverse of π:
Tiling solution π€ β² π
β1