Efficient Removal Lemmas for Matrices

(1)

Noga Alon

∗1

and Omri Ben-Eliezer

2

1 Departments of Mathematics and Computer Science, Tel Aviv University, Tel Aviv, Israel

[email protected]

2 Department of Computer Science, Tel Aviv University, Tel Aviv, Israel [email protected]

Abstract

The authors and Fischer recently proved that any hereditary property of two-dimensional matrices (where the row and column order is not ignored) over a finite alphabet is testable with a constant number of queries, by establishing the following (ordered) matrix removal lemma: For any finite alphabet Σ, any hereditary property P of matrices over Σ, and any > 0, there exists fP() such that for any matrix M over Σ that is -far from satisfying P, most of thefP()×fP() submatrices ofM do not satisfyP. Here being-far fromP means that one needs to modify at least an-fraction of the entries ofM to make it satisfyP.

However, in the above general removal lemma,fP() grows very fast as a function of−1_{, even} whenP is characterized by a single forbidden submatrix. In this work we establish much more efficient removal lemmas for several special cases of the above problem. In particular, we show the following: For any fixeds×tbinary matrixAand any >0 there existsδ >0 polynomial in

, such that for any binary matrixM in which less than aδ-fraction of thes×tsubmatrices are equal toA, there exists a set of less than an-fraction of the entries ofM that intersects every

A-copy inM.

We generalize the work of Alon, Fischer and Newman [SICOMP’07] and make progress to-wards proving one of their conjectures. The proofs combine their efficient conditional regularity lemma for matrices with additional combinatorial and probabilistic ideas.

1998 ACM Subject Classification F.2.2 Nonnumerical Algorithms and Problems

Keywords and phrases Property Testing, Removal Lemma, Matrix Regularity Lemma, Binary Matrix

Digital Object Identifier 10.4230/LIPIcs.APPROX/RANDOM.2017.25

1 Introduction

Removal lemmas are structural combinatorial results that relate thedensity of “forbidden” substructures in a given large structureS with thedistance of S from not containing any of the forbidden substructures, stating that if S contains a small number of forbidden substructures, then one can make S free of such substructures by making only a small number of modifications in it. Removal lemmas are closely related to many problems in Extremal Combinatorics, and have direct implications in Property Testing and other areas of Mathematics and Computer Science, such as Number Theory and Discrete Geometry.

The first known removal lemma has been the celebrated (non-induced) graph removal lemma, established by Rusza and Szemerédi [24] (see also [3, 4]). This fundamental result in

∗ _{Research supported in part by a USA-Israeli BSF grant 2012/107, by an ISF grant 620/13 and by the}

Israeli I-Core program.

licensed under Creative Commons License CC-BY

(2)

(APPROX/RAN-Graph Theory states that for any fixed graphH on hvertices and any >0 there exists

δ >0, such that for any graphGonnvertices that contains at leastn2_{copies of}_H _{that are} pairwise edge-disjoint, the total number ofH-copies inGis at leastδnh_{. Many extensions}

and strengthenings of the graph removal lemma have been obtained, as is described in more detail in Section 2.

In this work, we consider removal lemmas for two-dimensionalmatrices(with row and column order) over a finite alphabet. For simplicity, the results are generally stated for square matrices, but are easily generalizable to non-square matrices. Some of the results also hold for matrices in more than two dimensions.

The notation below is given for two-dimensional matrices, but carries over naturally to other combinatorial structures, such as graphs and multi-dimensional matrices.

Anm×nmatrixM over the alphabet Γ is viewed here as a functionM : [m]×[n]→Γ, and the row and column order is dictated by the natural order on their indices. Any matrix that can be obtained from a matrixM by deleting some of its rows and columns (while preserving the row and column order) is considered asubmatrix of M. We say thatM is binary if the alphabet is Γ ={0,1} andternary if Γ ={0,1,2}. Amatrix property P over Γ is simply a collection of matricesM : [m]×[n]→Γ. A matrix is -far fromP if one needs to change at least an-fraction of its entries to get a matrix that satisfiesP. A propertyP

ishereditaryif it is closed under taking submatrices, that is, ifM ∈ P then any submatrix

M0 ofM satisfiesM0∈ P. For any familyF of matrices over Γ, the property ofF-freeness, denoted by PF, consists of all matrices over Γ that do not contain a submatrix fromF. Observe thatP is hereditary if and only if it is characterized by some familyF of forbidden submatrices, i.e.P =PF.

While the investigation of graph removal lemmas has been quite extensive, as described in Section 2 below, the first known removal lemma for ordered graph-like two-dimensional structures, and specifically for (row and column ordered) matrices, was only obtained very recently by the authors and Fischer [2].

ITheorem 1([2]). Fix a finite alphabetΓ. For any hereditary propertyP of matrices over Γand any >0there existsfP()satisfying the following. If a matrix M is -far fromP then at least a 2/3-fraction of thefP()×fP()submatrices of M do not satisfy P.

However, even whenP is characterized by a single forbidden submatrix, the upper bound on

fP() guaranteed by the removal lemma in [2] is very large; in fact, it is at least as large as a wowzer (tower of towers) type function of. On the other hand, a lower bound of Fischer and Rozenberg [15] implies that one cannot hope for a polynomial dependence offP() in

−1 in general (for the non-binary case), even whenP is characterized by a single forbidden submatrix.

Thus, it is natural to ask for which hereditary matrix propertiesP there exist removal lemmas with more reasonable upper bounds on fP(), and specifically, to identify large families of propertiesP for whichfP() ispolynomial in−1. In this work we focus on this question, mainly for matrices over a binary alphabet.

A natural motivation for the investigation of removal lemmas comes fromproperty testing. This active field of study in computer science, initiated by Rubinfeld and Sudan [23] (see [18] for the graph case), is dedicated to finding fast algorithms to distinguish between objects that satisfy a certain property and objects that are far from satisfying this property; these algorithms are called testers. An-tester for a matrix propertyP is a (probabilistic) algorithm that is given query access to the entries of the input matrixM, and is required to distinguish, with error probability at most 1/3, between the case thatM satisfiesP and the

(3)

case thatM is-far fromP. If the tester always answers correctly whenM satisfiesP, we say that the tester has a one-sided error. We say thatP is testable if there is a one-sided error tester forP that makes a constant number of queries (that depends only onP and

but not on the size of the input). Furthermore,P iseasily testableif the number of queries is polynomial in−1. Clearly, any hereditary property of matrices is testable by Theorem 1, while any propertyP for whichfP() is shown to be polynomial in −1 is easily testable.

1.1 Background and main results

The results here are stated and proved for squaren×nmatrices, but can be generalized to non-square matrices in a straightforward manner. Our first main result is an efficientweak removal lemmafor binary matrices.

ITheorem 2. If ann×nbinary matrixM containsn2 _{pairwise-disjoint copies of an}_s_×_t binary matrix A, then the total number of A-copies inM is at least δns+t_{, where} _δ−1 _is polynomial in −1.

Here a set of pairwise-disjointA-copies inM is a set ofs×tsubmatrices ofM, all equal to

A, such that any entry ofM is contained in at most one of the submatrices.

Theorem 2 is an analogue for binary matrices of the non-induced graph removal lemma. However, in the graph removal lemma,δ−1 is not polynomial in−1 in general, in contrast to the situation in Theorem 2.

Alon, Fischer and Newman [5] proved an efficient induced removal lemma for a certain type of finite families F of binary matrices. A family F of matrices, or equivalently, a hereditary matrix propertyPF, isclosed under row (column) permutations if for anyA∈ F, any matrix created by permuting the rows (columns respectively) of Ais inF. F isclosed under permutations if it is closed under row permutations and under column permutations.

ITheorem 3 ([5]). Let F be a finite family of binary matrices that is closed under per-mutations. For any >0there existsδ >0, whereδ−1 _{is polynomial in}−1_{, such that any}

n×nbinary matrix that is-far fromF-freeness containsδns+t _{copies of some}_s_×_t_matrix A∈ F.

The main consequence of Theorem 3 is an efficient induced removal lemma for bipartite graphs. Indeed, when representing a bipartite graph by its (bi-)adjacency matrix, a forbidden subgraph H is represented by the family F of all matrices that correspond to bipartite graphs isomorphic toH. Note thatF is indeed closed under permutations in this case. Thus, any hereditary bipartite graph property characterized by a finite set of forbidden induced subgraphs is easily testable.

The problem of understanding whether the statement of Theorem 3 holds forany finite family F of binary matrices, was raised in [5] and is still open. Only recently in [2] it was shown that the statement holds if we ignore the polynomial dependence, as stated in Theorem 1.

IProblem 4. Is it true that for any fixed finite family F of binary matrices and any >0, there existsδ >0 withδ−1 polynomial in −1, such that anyn×nbinary matrix M that is

-far from F-freeness containsδns+t copies of somes×t matrixA∈ F?

Theorem 2 implies that to settle Problem 4 it is enough to show the following. Fix a finite familyF of binary matrices. Then for any >0 there exists τ >0, withτ−1polynomial in

−1, such that anyn×nbinary matrix that is-far fromF-freeness containsτ n2 _pairwise disjoint copies of matrices fromF.

(4)

Our second main result makes progress towards solving Problem 4 by generalizing the statement of Theorem 3 to any familyF of binary matrices that is closed under row (or column) permutations. From now on we only state the results for families that are closed under row permutations, but analogous results hold for families closed under column permutations.

ITheorem 5. LetF be a finite family of binary matrices that is closed under row permuta-tions. For any >0there existsδ >0, whereδ−1 _{is polynomial in}−1_{, such that any} _n_×_n binary matrix that is-far fromF-freeness containsδns+t_{copies of some}_s_×_t_matrix_A_{∈ F}_.

ICorollary 6. Any hereditary property of binary matrices that is characterized by a finite forbidden family closed under row permutations is easily testable.

Our proof of Theorem 5 is somewhat simpler than the original proof of Theorem 3. One of the main tools in the proofs of Theorems 2 and 5 is an efficient conditional regularity lemma for matrices developed in [5] (see also [20]). In the proof of Theorem 5 we only use a simpler form of the lemma, which is also easier to prove. The statement of the lemma and the proofs of Theorems 2, 5 appear in Section 4.

Besides the above two main results, we also describe a simpler variant of the construction of Fischer and Rozenberg [15], showing that for ternary matrices, the dependence between the parameters is not polynomial in general. We further suggest a way to tackle the weak removal lemma (i.e. the analogue of Theorem 2, without the polynomial dependence) in high dimensional matrices over arbitrary alphabets, by reducing it to an equivalent problem that looks more accessible. For more details, see Section 5.

2 Related work

Removal lemmas have been studied extensively in the context of graphs. The non-induced graph removal lemma (which was stated in the beginning of Section 1) has been one of the first applications of the celebrated Szemerédi graph regularity lemma [26]. The induced graph removal lemma, established in [4] by proving a stronger version of the graph regularity lemma, is a similar result considering induced subgraphs. It states that for any finite family

F of graphs and any >0 there existsδ=δ(F, )>0 with the following property. If an

n-vertex graphGis-far fromF-freeness, then it contains at leastδnv(F)_{induced copies of} someF ∈ F. Herev(F) denotes the number of vertices in F, andGis said to be (induced)

F-free if no induced subgraph ofGis isomorphic to a graph fromF.

The induced graph removal lemma was later extended to infinite families [9], stating the following. For any finite or infinite familyF of graphs and any >0 there existsfF() with the following property. If ann-vertex graphGis-far fromF-freeness, then with probability at least 2/3, a random induced subgraph ofGonfF() vertices contains a graph fromF. Note that whenF is finite, the statement of the infinite induced removal lemma is indeed equivalent to that of the finite version of the induced removal lemma.

The graph removal lemma was also extended to hypergraphs [19, 22, 21, 27]. See [13] for many more useful variants, quantitative strengthenings and extensions of the graph removal lemma.

Very recently, the authors and Fischer [2] generalized the (finite and infinite) induced graph removal lemma by obtaining anorder-preserving version of it, and also showed that the same type of proof can be used to obtain a removal lemma for two-dimensionalmatrices (with row and column order) over a finite alphabet; this it Theorem 1 above.

However, even for the non-induced graph removal lemma where the forbidden subgraph is a triangle, the best known general upper bound forδ−1in terms of−1is of tower-type [16, 12].

(5)

On the other hand, the best known lower bound for the dependence is super-polynomial but sub-exponential, and builds on a construction of Behrend [10]. See [1] for more details. Understanding the “right” dependence of δ−1 in−1, even for the simple case where the forbidden graphH is a triangle, is considered an important and difficult open problem.

In view of the above discussion, a lot of effort has been dedicated to the problem of characterizing the hereditary graph propertiesP for whichfP() is polynomial in −1, i.e., the easily testable graph properties. See the recent work of Gishboliner and Shapira [17]; for other previous works on this subject, see, e.g., [1, 8, 6]. Our work also falls under this category, but for (ordered) matrices instead of graphs; it is the first work of this type for ordered two-dimensional graph-like structures.

We finish by mentioning several other relevant removal lemma type results. Removal lemmas for vectors (i.e. one dimensional matrices where the order is important) are generally easier to obtain; in particular, a removal lemma for vectors over a fixed finite alphabet can be derived from a removal lemma for regular languages proved in [7]. A removal lemma for partially ordered sets with a grid-like structure, which can be seen as a generalization of the removal lemma for vectors, can be deduced from a result of Fischer and Newman in [14], where they mention that this problem for submatrices is more complicated and not understood. Recently, Ben-Eliezer, Korman and Reichman [11] obtained a removal lemma for patterns in multi-dimensional matrices. A pattern must be taken fromconsecutive locations, whereas in our case the rows and columns of a submatrix need not be consecutive. The case of patterns behaves very differently than that of submatrices, and in particular, in the removal lemma for patterns the parameters are linearly related (for any alphabet size) unlike the case of submatrices (in which, for alphabets of 3 letters or more, the relation cannot be polynomial).

3 Notation

Here we give some more notation that will be useful throughout the rest of the paper. We give the notation for rows but the notation for columns is equivalent. LetM : [m]×[n]→Γ be anm×nmatrix. For two rows in M whose indices inI arer < r0_{, we say that row}_r_is smaller than row r0 and rowr0 islarger than rowr. Thepredecessor of rowrinM is the largest row ¯rin M smaller thanr. In this case we say that ris thesuccessor of ¯r.

Let S be the submatrix of M on {r1, . . . , rs} × {c1, . . . , ct} where r1 < . . . < rs and

c1< . . . < ct. For i= 1, . . . , s, thei-row-index ofS inM isri; For two submatricesS, S0 of the same dimensions and withi-row-indicesri, r0_i respectively we say thatS isi-row-smaller thanS0 ifri< ri0 andi-row-bigger ifri > r0i.

Let X ={x1, . . . , xs−1} ⊆[m] with 0< x1< . . . < xs−1< mandY ={y1, . . . , yt−1} ⊆ [n] with 0< y1< . . . < yt−1< nbe subsets of indices. The submatrixS isrow-separated by

X ifri≤xi< ri+1 for anyi= 1, . . . , s−1,column-separated byY ifcj≤yj< cj+1 for any

j= 1, . . . , t−1 andseparated byX×Y if it is row separated byX and column separated byY. The elements ofX, Y are calledrow separators,column separators respectively.

3.1 Folding and unfoldable matrices

A matrix is unfoldableif no two neighboring rows in it are equal and no two neighboring columns in it are equal. Thefoldingof a matrixAis the unique matrix ˜Agenerated fromA

by deleting any row ofAthat is equal to its predecessor, and then deleting any column of the resulting matrix that is equal to its predecessor. Note that ˜A is unfoldable.

(6)

ILemma 7. Fix ans×t matrixA and letA˜ be itss0×t0 folding. For any >0 there exist

n0, δ >0, where n0 and δ−1 are polynomial in −1, such that for any n≥n0, any n×n matrixM that contains ns0+t0 _{copies of}_A_˜ _{also contains} _δns+t_{copies of}_A_.

Lemma 7 implies that generally, to prove removal lemma type results for finite families, it is enough to only consider families of unfoldable matrices. The proof follows immediately from the following lemma.

ILemma 8. LetA be ans×tfixed matrix and let A0 be ans0×t matrix created fromA

by deleting rows that are equal to their predecessors in A. Then for any >0there exist

n1=n1(A, )>0andτ=τ(A, )>0, where n1 andτ−1 are polynomial in −1, such that for anyn≥n1, anyn×n matrixM that containsns0+t_{copies of} _A0 _{also contains}_{τ n}s+t

copies ofA.

Proof of Lemma 8. LetT be the family of alln×tsubmatricesS ofM containing at least

ns0_/_{2 copies of} _A0_{. Any} _S _∈ _T _has n s0

≤ns0 _s0_×_t _{submatrices, so the number of}_A0 copies in submatrices fromT is at most|T|ns0. On the other hand, there are n_t≤ntn×t

submatrices of M so the number of A0 copies in n×t submatrices not in T is less than

ns0+t_/_{2. Hence the total number of} _A0 _{copies in submatrices from} _T _{is at least}_ns0+t_/_2,

implying that|T| ≥nt/2.

Observe that anyS∈T contains a collectionA(S) ofn/2s0 pairwise disjoint copies of

A0. To show this, we follow a greedy approach, starting with a collectionBof allA0-copies inS and with empty A. As long asBis not empty, we arbitrarily choose a copyC∈ B of

A0, add C toA and delete allA0-copies intersecting C (including itself) fromB. In each step, the number of deleted copies is at mosts0ns0−1_{, so the number of steps is at least}

ns0_/₂_s0_ns0−1₌_n/₂_s0_.

Letδ=/5ss0 and takeS∈T. Assuming thatnis large enough, pick disjoint collections

A1, . . . ,As⊆ A(S), each of size at leastδn, so that allA0-copies inAiarei-row-smaller than

allA0-copies inAi+1 for any 1≤i≤s−1. Then there areδsnscopies ofAinS: Eachs×t submatrix ofS whosei-th row is taken as thei-th row of a matrix fromAi is equal toA.

Therefore, the total number ofA-copies inM is at least|T|δs_ns_≥_δs_ns+t_/_{2, as desired.}

J

4 Proofs for the binary case

This section is dedicated to the proof of our main results in the binary domain: Theorem 2 and Theorem 5. As a general remark for the proofs in this section, We may and will assume that a square matrixM is sufficiently large (given >0), by which we mean thatM is an

n×nmatrix withn≥n0 for a suitablen0>0 that is polynomial in−1.

One of the main tools in the proofs of this section is a conditional regularity lemma for matrices due to Alon, Fischer and Newman [5]. We describe a simpler version of the lemma (this is Lemma 9 below) along with another useful result from their paper (Lemma 10 below).

Combining these results together yields the original version of the conditional regularity lemma used in the original proof of Theorem 3 in [5]. It is worth to note that even though Theorem 5 generalizes Theorem 3, for its proof we only need the simpler Lemma 9 and not the original regularity lemma, whose proof requires significantly more work. Lemma 10 is only used in the proof of Theorem 2.

We start with some definitions. A (δ, r)-row-clusteringof ann×nmatrixM is a partition of the set of rows ofM intor+ 1clusters R0, . . . , Rrsuch that theerror cluster R0 satisfies

|R0| ≤δnand for anyi= 1, . . . , r, every two rows inRi differ in at mostδnentries. That is, for everye, e0 ∈Ri, one can make row e equal toe0 by modifying at most δn entries.

(7)

A (δ, r)-column-clustering is defined analogously on the set of columns of M. The first conditional regularity lemma states the following.

ILemma 9([5]). Let k be a fixed positive integer and letδ >0be a small real. For every

n×n binary matrixM withn >(k/δ)O(k)_{, either}_M _admits₍_{δ, r}_{)-clusterings for both the} rows and the columns with r ≤(k/δ)O(k)_{, or for every} _k_×_k _{binary matrix} _A_{, at least a} (δ/k)O(k2) fraction of thek×ksubmatrices of M are copies of A.

Let R be a set of rows and let C be a set of columns in an n×n matrixM. Theblock

R×Cis the submatrix of M onR×C. A blockB isδ-homogeneouswithvalue bif there exists b ∈ {0,1} such that at least a 1−δ fraction of the entries ofB are equal to b. A (δ, r)-partition of M is a couple (R,C) where R={R1, . . . , Rr} is a partition of the set of

rows and C ={C1, . . . , Cr} is a partition of the set of columns of M, such that all but a

δ-fraction of the entries of M lie in blocks Ri×Cj that areδ-homogeneous. The second result that we need from [5], relating clusterings and partitions of a matrix, is as follows.

ILemma 10([5]). Letδ >0. If a square binary matrixM has(δ2_/₁₆_{, r}_{)-clusterings}_R_,_C of the rows and the columns respectively then (R,C)is a(δ, r+ 1)-partition ofM.

For the proofs of the above lemmas see [5]. We continue to the proof of Theorem 2. The following lemma is a crucial part of the proof.

I Lemma 11. Fix an s×t matrix A. For any > 0 there exists τ > 0, where τ−1 is polynomial in −1, such that anyn×n matrixM containingn2 _{pairwise-disjoint copies of}

A either containsτ ns+t_{copies of any} _s_×_t _{matrix, or there exist subsets of indices}_{X, Y} _of

sizes s−1, t−1respectively such thatM containsτ n2 _{pairwise disjoint copies of}_A _{that are} separated byX×Y.

Before providing the full proof of Lemma 11, we present a sketch of the proof. Clearly, whenever we apply Lemma 9 throughout the proof, we may assume that the outcome is thatM has suitable row and column clusterings, as the other possible outcome of Lemma 9 finishes the proof immediately. The main idea of the proof is to gradually find row separators, and then column separators, while maintaining a large set of pairwise disjoint copies ofA

that conform to these separators. This is done inductively (first for the rows, and then for the columns). The inductive step is described in what follows.

Assume we currently havej−1≥0 row-separators, and a setAof many pairwise disjoint

A-copies that have their firstj rows separated by these row-separators. We take a clustering of the rows ofM, and consider a cluster in which many rows are “good”, in the sense that they contain thej-th row of many of the disjointA-copies fromA. We put ourj-th separator as the medial row among the good rows. Next, we consider a matching of pairs (r1, r2) of good rows, where in each such pairr1 lies before thej-th separator andr2lies after the j-th separator. Observe that all good rows lie after the (j−1)-th separator.

If we take all pairwise-disjoint A-copies fromAwhosej-th row isr2, and “shift” their

j-th row to ber1, then most of them will still beA-copies (as rowsr1andr2are very similar, since they are in the same row cluster). This process creates a set A0 _{of many pairwise} disjointA-copies whosei-th row lies between separatorsi−1 and ifor anyi≤j, and the (j+ 1)-th row lies after separatorj. This finishes the inductive step.

We now continue to the full proof of Lemma 11.

Proof of Lemma 11. Let >0 and letM be a large enoughn×nbinary matrix containing a collectionU0 ofn2 pairwise disjointA-copies.

(8)

We prove the following claim by induction oni, fori= 0,1, . . . , s−1: there existτi, δi

with τ_i−1, δ−1_i polynomial in −1 such that either M contains τins+t _{copies of any}_s_×_t

matrix or there exist 0 =x0< x1< . . . < xi and a setUi ofδin2 pairwise disjoint A-copies

inM whose j-th row is bigger then xj−1 and no bigger than xj for any 1≤ j ≤ i, and the (i+ 1)-th row is bigger thanxi. The base casei = 0 is trivial withδ0=. Suppose now that i ≥ 1 and that x0, . . . , xi−1, δi−1 and Ui−1 are already determined. Applying Lemma 9 onM with parametersk= max{s, t} andδi−1/4, eitherM containsτins+tcopies of anys×tmatrix with τ_i−1polynomial in−1 and we are done, orM has a (δi−1/4, ri )-row-clusteringRi of M for ri polynomial in δi−1−1 and so in−1. The number of rows of

M that contain thei-th row of at leastδi−1n/2 of the A-copies inUi−1is at leastδi−1n/2, since the number ofA-copies inUi−1 whosei-th row is not taken from such a row ofM is less thatn·δi−1n/2 =δi−1n2/2. LetRi be a row cluster that contains at leastδi−1n/2ri

such rows. Note that all of these rows are bigger thanxi−1.Take subclustersR1i, R

2

i ofRi,

each containing at leastbδi−1n/4ric ≥δi−1n/5ri such rows (the inequality holds fornlarge enough) where each row inR1_i is smaller than each row inR2_i. Takexi to be the row index of the biggest row inR1

i.

Take arbitrarilyδi−1n/5ri couples of rows (r, r0) where r∈R2i andr0 ∈R

1

i and every

row participates in at most one couple. Let (r, r0) be such a couple. There exist δi−1n/2

s×tsubmatrices ofM that areA-copies fromUi−1 and whosei-th row isr. Moreover, for anyj < ithe j-th row of each of these submatrices lies betweenxj−1 (non-inclusive) and

xj (inclusive). Sincerandr0 differ in at most δi−1n/4 entries, there are at leastδi−1n/4 such submatricesT that satisfy the following: If we modifyT by taking itsi-th row to be r0

instead ofr,T remains anA-copy. Moreover, after the modification, thei-th row ofT is in

R1_i and is therefore no bigger thanxi, whereas the (i+ 1)-th row ofT is bigger than thei-th row ofT before the modification which is bigger thanxi, as needed. For every couple (r, r0) we can produceδi−1n/4 pairwise disjoint copies of Awhosej-th row is betweenxj−1 andxj

for anyj≥iand the (i+ 1)-th row is afterxi. There areδi−1n/5ri such couples (r, r0), and in total we get a setUi ofδin2copies ofAwith the desired structure forδi=δi2−1/20riwhere δ_i−1 is polynomial inδ_i−1₋₁and so in −1. Note that the copies inUi are pairwise disjoint. In the end of the process there is a setU =Us ofδsn2 pairwise disjoint copies ofA whose rows

are separated byX={x1, . . . , xs−1}. A feature that is useful in what follows is that each copy inU has exactly the same set of columns (as a submatrix ofM) as one of the original copies ofU0.

Now we apply the same process as above but in columns instead of rows, starting with theδsn2 copies in U. In the end of the process, we obtain that for some ˆτt,ˆδt such that ˆ

τ_t−1 and ˆδ−1_t are polynomial inδ_s−1 and so in−1, eitherM contains ˆτtns+t _{copies of any} s×tmatrix, or there exists a set ˆU of ˆδtn2pairwise disjoint copies ofA whose columns are

separated by a set of indicesY of size t−1. Moreover, by the above feature, each of the copies in ˆU has the same set of rows as some copy ofAfromU, so each copy has its rows separated byX. HenceX×Y separates all copies in ˆU. Takingτ= min{τt,ˆ δtˆ} finishes the

proof. _J

Next we show how Theorem 2 follows from Lemma 11. The idea of the proof is to show, using Lemmas 10 and 11, that there is a partition ofM with blocksRi×Cj (for 1≤i≤s, 1≤j≤t) satisfying the following.

All row clustersRi and all column clustersCj are large enough.

All rows ofRi (Cj) lie before all rows (columns) ofRi+1 (Cj+1 respectively) for any i andj.

Ri×Cj is almost homogeneous, and its “popular” value isAij.

(9)

We now complete the proof of Theorem 2.

Proof of Theorem 2. LetAbe ans×tbinary matrix and letk= max{s, t}. Let >0 and letM be a large enoughn×nbinary matrix that containsn2_{pairwise disjoint}_A_-copies. Lemma 11 implies that either M contains τ ns+t _{copies of}_A _where _τ−1 _{is polynomial in}

−1 (in this case we are done), or M contains at least τ n2 pairwise disjoint copies of A

separated by X×Y for suitable index subsets X, Y. By Lemma 9 we get that either M

has (τ2_/₁₂₈_{, r}_{)-clusterings of the rows and the columns where}_r_{is polynomial in}_τ−1 _and so in−1, or at least a ζ= (τ2_/₁₂₈_k₎O(k2) _{fraction of the}_s_×_t _{submatrices are}_A_{; in the} second case we are done. Suppose then thatM has (τ2_/₁₂₈_{, r}_{)-clusterings}_R_,_C_{of the rows,} columns respectively. The next step is to create refinements of the clusterings. Write the elements ofX asx1< . . . < xs−1 and letx0= 0, xs=n. Partition each R∈ Rintosparts

where thei-th part fori= 1, . . . , sconsists of all rows in Rwith index at leastxi−1 and less thanxi. Each such part is also aτ2_/_{128-cluster. Now separate each}_C_{∈ C} _into_t_{parts in a} similar fashion. This creates (τ2_/₁₂₈_,₍_r_{+ 1)}_k_{)-clusterings}_R0_,_C0 _{of the rows and the columns} respectively (where some of the clusters might be empty). By Lemma 10,P = (R0_,_C0_{) is a} (τ /4, r0)-partition ofM wherer0= (r+ 1)k+ 1, and each block of the partition has all of its entries between two neighboring row separators from X and between two neighboring column separators fromY.

There are at mostτ n2_/_{4 entries of}_M _{that lie in non-}_{τ /}_{4-homogeneous blocks of}_P _{and at} mostτ n2/4 entries of M that lie inτ /4-homogeneous blocks of P but do not agree with the value of the block. Therefore, the number of entries as above is no more thanτ n2_/_{2, and so} there exists a set ofτ n2_/_{2 pairwise disjoint copies of}_A_in_M _{separated by}_X_×_Y _{in which all} the entries come fromτ /4-homogeneous blocks and agree with the value of the block in which they lie. Hence there exist sets of rowsR1, . . . , Rs∈ R0 and sets of columns C1, . . . , Ct∈ C0

and a collectionAofτ n2_/₂₍_r0₎2k _{pairwise disjoint} _A_{-copies separated by}_X_×_Y _{such that}

for any 1 ≤ i ≤ s, 1 ≤ j ≤ t, the block Ri×Cj is τ /4-homogeneous, has value A(i, j),

lies between row separatorsxi−1andxi and between column separatorsyj−1andyj, and contains the (i, j) entry of anyA-copy inA. This implies that|Ri|,|Cj| ≥τ n/2(r0)2k _{for any}

1≤i≤sand 1≤j≤t, So there are (τ /2(r0₎2k₎s+t_ns+t _s_×_t_{submatrices of}_M _{whose (}_{i, j}₎

entry lies inRi×Cj for any i, j. Picking such a submatrixS at random, the probability that S(i, j) 6= A(i, j) for a specific couple i, j is at most τ /4; thus S is equal to A with probability at least 1−stτ /4>1/2 for small enoughτ. Hence the number ofA-copies inM

is at least(τ /2(r0)2k₎s+t_ns+t_/_2.

J

Next we give the proof of Theorem 5. For the proof, recall the definition of an unfoldable matrix and a folding of a matrix from Section 3. A family of matrices is unfoldable if all matrices in it are unfoldable. The folding of a finite family F of matrices is the set

˜

F={A˜:A∈ F }of the foldings of the matrices inF. Observe that ˜F is unfoldable for any family F. Note that if F is closed under (row) permutations then ˜F is also closed under (row) permutations.

We start with a short sketch of the proof, before turning to the full proof: As before, we may assume that our matrixM has a row clustering with suitable parameters. We may also assume that the forbidden family is unfoldable. Consider a submatrixQofM that contains exactly one “representative” row from any large enough row cluster. The crucial idea is that ifQdoes not contain many A-copies, thenM is close toF-freeness. Indeed, one can modify all rows inM to be equal to rows fromQwithout making many entry modifications, and after this modification, it is possible to eliminate allF-copies inM (without creating new

(10)

does not contain manyF-copies then the number of such columns is small. Since the above statement is true for any possible choice ofQ, we conclude that ifM is-far fromF-freeness then it must contain manyA-copies.

Proof of Theorem 5. It is enough to prove the statement of the theorem only forunfoldable families that are closed under row permutations. Indeed, suppose that Theorem 5 is true for all unfoldable families that are closed under row permutations. LetF be a family of binary matrices that is closed under row permutations and let ˜F be its folding. Then for any > 0 there exists ˜δ >0 such that any square binary matrix M which is -far from

˜

F-freeness contains ˜δns0+t0 copies of somes0×t0 matrix B∈F˜, where ˜δ−1is polynomial in

−1. Thus, provided thatM is large enough (i.e. that it is ann×nmatrix wheren≥n0 for a suitable choice ofn0polynomial in−1), we can apply Lemma 7 to get thatM also containsδns+t _{copies of the matrix} _A_{∈ F} _{whose folding is} _B_{, for a small enough} _{δ >} ₀

whereδ−1is polynomial in−1.

Therefore, suppose thatF is an unfoldable finite family of binary matrices that is closed under row permutations. Letkbe the maximal row or column dimension of a matrix fromF. Let >0 and apply Lemma 9 with parametersk and/6. Let M be a large enoughn×n

matrix withn >(k/)O(k), then eitherM containsδ2n2k copies of anyk×kmatrix, where

δ₂−1is polynomial in−1, orM has an (/6, r)-clustering of the rows withrpolynomial in−1. In the first case we are done, so suppose thatM has an (/6, r)-clusteringR={R0, . . . , Rr}

of the rows whereR0 is the error cluster.

Suppose thatM is-far fromF-freeness. We say that a clusterR6=R0inRislargeif it contains at leastn/6rrows. Note that the total number of entries that do not lie in large clusters is at mostn/6 +n/6 =n/3. Pick an arbitrary rowr(R) from every large cluster

R∈ Rand denote byQthe submatrix ofM created by these rows. LetA(Q) be a collection of pairwise disjoint copies of matrices fromF inQthat has the maximal possible number of copies. Suppose to the contrary that|A| ≤n/3kand letC be the set of all columns ofM

that intersect a copy fromA, thenC contains no more thann/3 columns. We can modify

M to make itF-free as follows: First modify every row that lies in a large clusterR∈ Rto be equal tor(R). Then pick some rowrofQand modify all rows that are not contained in large clusters to be equal tor. Finally do the following: As long asCis not empty, pick a columnc∈C that has a neighbor (predecessor or successor) not inCand modifyc to be equal to its neighbor, and then removecfrom C.

It is not hard to see that sinceF is unfoldable and closed under row permutations, after these modificationsM isF-free. Indeed, after the first and the second steps, all rows ofM

are equal to rows fromQ; the order of the rows does not matter sinceF is closed under row permutations. Now each time that we modify a columnc∈C in the third step, all copies of matrices fromF that intersect it are destroyed and no new copies are created. By the maximality ofA, any copy of a matrix fromF in the originalQintersected some column from

C, so we are done. The number of entry modifications needed in the first, second, third step respectively is at mostn2/6,n2/3, n2/3 and thus by making only 5n2/6 modifications of entries ofM we can make itF-free, contradicting the fact that M is-far fromF-freeness. Let Q be any matrix of representatives of the large row clusters as above. Then Q

contains a collectionAofn/3kpairwise disjoint copies of matrices fromF. In particular, there exist a certains×nsubmatrixT ofQand ans×tmatrixA(Q)∈ F such that at least

n/3k|F |rs _{of the copies in}_A_are_A_{-copies that lie in}_T_{. The following elementary removal}

(11)

IObservation 12. Fix ans×t matrixA. For any >0 there existsδ >0 such that if an

s×n matrixT containsn pairwise-disjointA-copies, then the total number of A-copies in

T is at leastδnt_{, with}_δ−1 _{polynomial in}−1_.

Proof. Let >0 and letT be a large enoughs×nmatrix containingnpairwise disjoint copies ofA. We constructtdisjoint subcollectionsA1, . . . ,AtofA, each of sizen/2t≤ bn/tc,

such that for anyi < j, all copies inAi arei-column-smaller than all copies inAj. This is

done by the following process fori= 1, . . . , t: takeAi to be the set of then/2t i-smallest

copies inAand delete these copies from A. Now observe that anys×t submatrix ofT that takes itsi-th column (fori= 1, . . . , t) as thei-th column of some copy fromAi is equal toA.

There are (n/2t)tsuch submatrices among all n_t≤nt _s_×_t _{submatrices of} _T_{, and so} _T

contains (/2t)t_nt_A_-copies.

J

Observation 12 implies that forQandA(Q) as above,Qcontainsγns+t _A_{-copies where} γ−1 is polynomial in (/3k|F |rs)−1 and so in−1. Finally we show thatM containsδns+t

copies of someA∈ F whereδ−1 is polynomial inγ−1 and so in−1, finishing the proof of the Theorem. For any large clusterR∈ RletR0 be some subcluster that contains exactly

bn/6rc > 0 rows. Let R0 ₌ _{_R0 _: _R _{∈ R}_{is large}_} _{and note that an} _α_{-fraction of the}

k×ksubmatricesS ofM have all of their rows in subclusters fromR0 _{with no subcluster} containing more than one row of S, whereα−1 is polynomial in −1. Let S be a random

k×ksubmatrix of M. Conditioning on the event thatS satisfies the above property, we can assume thatS is chosen in the following way: First a randomQis created by picking uniformly at random one representative from everyR0∈ R0_{, and then}_S _{is taken as a random}

k×ksubmatrix ofQ. LetA=A(Q) be defined as above. The probability thatS contains a copy of A is at least γ. That is, a random k×ksubmatrix S of M contains a copy of a matrix fromF with probability at least αγ, so there exists an s×t matrixA∈ F that is contained in a randomly chosen suchS with probability at leastαγ/|F |, soM contains

αγ n_s n t

/|F |k2k _{copies of some}_A_{∈ F}_{: To see this, observe that we can choose a random} s×t submatrixS0 _of_M _{by first picking a random}_k_×_k_submatrix_S _{and then picking an}

s×t random submatrixS0 ofS. The event thatS contains a copy ofAhas probability at leastαγ/|F |, and conditioned on this event,S0 is equal to Awith probability at leastk−2k_,

as the number ofs×t submatrices ofS is at most sktk≤k2k_{. The proof is concluded by}

taking a suitableδ=δ()>0 that satisfiesδns+t_≤_αγ n s

n

t

/|F |k2k _{for large enough values}

ofn. Note that indeedδ−1 is polynomial in−1. _J

5 Multi-dimensional matrices over arbitrary alphabets

As opposed to the polynomial dependence in the above results on binary matrices, Fischer and Rozenberg [15] showed that in analogous results for ternary matrices, as well as binary three-dimensional matrices, the dependence is super-polynomial in general. The proof builds on a construction of Behrend [10]. For the ternary case, it gives the following.

ITheorem 13 ([15]). There exists a (finite) familyF of2×2 binary matrices that is closed under permutations and satisfies the following. For any small enough >0, there exists an arbitrarily largen×nternary matrixM that containsn2 _{pairwise-disjoint copies of matrices} fromF, yet the total number of submatrices fromF inM is no more than−clogn4 where

c >0 is an absolute constant.

Theorem 13 implies that an analogue of Theorem 2 with polynomial dependence cannot be obtained when the alphabet is bigger than binary, even whenF is a small finite family

(12)

that is closed under permutations. In Subsection 5.1 we describe another construction that establishes Theorem 13, which is slightly simpler than the original construction in [15].

In what follows, we focus on the problem of finding a “weak” removal lemma analogous to Theorem 2 for matrices in more than two dimensions over an arbitrary alphabet. Here we do not try to optimize the dependence between the parameters, but rather to show that such a removal lemma exists. Note that in two dimensions this removal lemma follows from Theorem 1, but our results here suggest a direction to prove a weak high dimensional removal lemma without trying to generalize the heavy machinery used in [2] to the high dimensional setting. Our main result here states that this problem is equivalent in some sense to the problem of showing that if a hypermatrixM contains many pairwise-disjoint copies of a hypermatrixA, then it contains a “wide” copy ofA; more details are given later. In what follows, we use the termd-matrix to refer to a matrix inddimensions. An (n, d)-matrix is a

d-matrix whose dimensions are n× · · · ×n.

A weak removal lemma for families ofd-matrices that are closed under permutations follows easily from the hypergraph removal lemma [19, 22, 21, 27] using a suitable construction.

IProposition 14. Let Γbe an arbitrary alphabet and letF be a finite family ofd-matrices over Γthat is closed under permutations (in alldcoordinates). For any >0there exists

δ > 0such that the following holds. If an (n, d)-matrix M over Γ contains nd _pairwise

disjoint copies ofd-matrices fromF, thenM containsδns1+···+sd _{copies of some}_s

1×. . .×sd matrixA∈ F.

Note that Theorem 13 implies that the dependence ofδ−1 on−1in Proposition 14 cannot be polynomial. The question whether the statement of Proposition 14 holds for any finite family F is open for d-matrices withd > 2. Here we state the question in the following equivalent but simpler form.

IProblem 15. Let d >2be an integer. Is it true that for any alphabet Γ,s1×. . .×sd matrix A over Γ and > 0there exists δ > 0, such that for any (n, d)-matrix M over Γ containingnd pairwise-disjoint copies of A, the total number ofA-copies inM is at least

δns1+···+sd_?

Note that Theorem 2 settles the two-dimensional binary case of Problem 15 with δ−1

polynomial in−1, and Theorem 1 settles the two-dimensional case over any alphabet. On the other hand,δ−1 cannot be polynomial in−1 if|Γ|>2 ord >2.

Our main theorem in this domain shows that Problem 15 is equivalent to another statement that looks more accessible. We need the following definition to describe it. Let

M : [n1]×. . .×[nd]→Γ and letS be the submatrix ofM on the indices {r11, . . . , r1s1} ×

. . .× {rd1, . . . , rdsd}wherer

i j< r

i

j+1 for any 1≤i≤d, 1≤j≤si−1. The(i, j)-width ofS (for 1≤i≤dand 1≤j≤si−1) is (ri

j+1−rij)/ni.

ITheorem 16. The following statements are equivalent for anyd≥2.

1. For any alphabetΓ,s1×. . .×sd matrix Aover Γand >0 there existsδ >0 such that

for any (n, d)-matrix M that contains nd pairwise disjoint copies ofA, the total number ofA-copies inM is at leastδns1+···+sd_.

2. For any alphabet Γ, s1×. . .×sd matrix A over Γand > 0 there exists δ > 0 such that for any (n, d)-matrix M that contains nd _{pairwise disjoint copies of} _A_{, and any}

1≤i≤d,1≤j≤si, there exists anA-copy in M whose (i, j)-width is at least δ.

The proofs of the statements here are given, for simplicity, only for two dimensional matrices, but they translate directly to higher dimensions. The only major difference in

(13)

the high dimensional case is the use of the hypergraph removal lemma instead of the graph removal lemma. Due to space considerations, the proof of Theorem 16 is relegated to Appendix A, and here we only give the proof of Proposition 14.

Some definitions are required for the proof of Proposition 14. Ans×t reordering σis a permutation of [s]×[t] that is a Cartesian product of two permutationsσ1: [s]→[s] and

σ2: [t]→[t]. Given ans×t matrixA, the s×tmatrixσ(A) is the result of the following procedure: First reorder the rows ofAaccording to the permutationσ1and then reorder the columns of the resulting matrix according to the permutationσ2.

Proof of Proposition 14. Let k(F) denote the largest row or column dimension of matrices fromF. Let >0 and letM be ann×nmatrix over Γ that contains n2 pairwise-disjoint copies of matrices fromF. In particular, there is ans×tmatrixA∈ F such thatM contains

n2_/_{|F |}_{pairwise-disjoint copies of}_A_.

We construct an (s+t)-partite graphGon (s+t)nvertices as follows: There aresrow partsR1, . . . , Rs andtcolumn partsT1, . . . , Tt, each containingnvertices. The vertices of Ri (Ci) are labeledri₁, . . . , r_ni (ci₁, . . . , ci_n respectively). Two verticesra_i andrb_j (orca_i andcb_j) witha6=b are connected by an edge iffi6=j. ra

i andcbj are connected iffM(i, j) =A(a, b).

We now show that there exists a bijection between copies of Ks+t in G and couples

(S, σ) whereS is ans×t submatrix ofM andσ is ans×treordering such thatσ(S) =A. Indeed, take the following mapping: A couple (S, σ), whereS is the submatrix of M on

{a1, . . . , as} × {b1, . . . , bt}witha1< . . . < asandb1< . . . < btandσ=σ1×σ2, is mapped to the induced subgraph ofGon{rσ1(1)

a1 , . . . , r σ1(s) as , c σ2(1) b1 , . . . , c σ2(t) bt }.

It is not hard to see that (S, σ) is mapped to a copy ofKs+tif and only ifσ(S) is equal

toA. On the other hand, every copy ofKs+tinGhas exactly one vertex in each row part

and in each column part, and there exists a unique couple (S, σ) mapped to it.

There existn2_/_{|F |}_{pairwise-disjoint}_A_{-copies in} _M _{that are mapped (with the identity} reordering) to edge-disjoint copies ofKs+t inG. By the graph removal lemma, there exists δ >0 such that at least a δ-fraction of the subgraphs of G on s+t vertices are cliques. Therefore, at least aδ-fraction of the possible couples (S, σ) (where S is ans×tsubmatrix ofM andσis ans×t reordering) satisfyσ(S) =A, concluding the proof. _J The proof of Theorem 16 is given in Appendix A.

5.1 Lower bound

In this subsection we give an alternative constructive proof of Theorem 13. Our main tool is the following result in additive number theory from [1], based on a construction of Behrend [10].

I Lemma 17 ([1, 10]). For every positive integer m there exists a subset X ⊆ [m] =

{1, . . . , m} with no non-trivial solution to the equation x1+x2+x3= 3x4, whereX is of size at least

|X| ≥ m

e20√logm. (1)

Proof of Theorem 13. Consider the familyF={A, B} where

A= 1 0 0 1 , B= 0 1 1 0 ,

and observe thatF is closed under permutations. Letmbe a positive integer divisible by 10 and letX⊆[m/10] be a subset with no non-trivial solution to the equationx1+x2+x3= 3x4

(14)

that is of maximal size. We construct the followingm×m ternary matrix M. For any 1≤i≤m/5 and anyx∈X we put a copy ofAinM as follows:

M(i, i+x) =M(m/2 +i+ 2x, m/2 +i+ 3x) = 1

M(i, m/2 +i+ 3x) =M(m/2 +i+ 2x, i+x) = 0.

We set all other entries ofM to 2. LetAbe the collection ofq=m|X|/5≥m2_/₅₀_e20√logm

pairwise disjoint copies ofAinM that are created as above. Note that allA-copies inM

are separated by{n/2} × {n/2}, where there are two opposite quarters (with respect to the separation) that do not contain the entry 0 and the two other opposite quarters do not contain 1. Hence, everyA-copy must contain one entry from each quarter, andM does not contain copies ofB. The main observation is that all of theA-copies inM are actually copies fromA, soM contains exactly q A-copies.

To see this, suppose that the rows of an A-copy in M are i and j+n/2 for some 1≤i, j ≤ n/2, then there exist x1, x2, x3, x4 ∈ X such that the entries of the copy were taken from locations (i, i+x1),(i, m/2 +i+ 3x2),(m/2 +j, j−x3),(m/2 +j, m/2 +j+x4) inM and so we havei+x1=j−x3 andi+ 3x2=j+x4. Reordering these two equations we get that 3x2=x1+x3+x4, implying thatx1=x2=x3=x4 andj=i+ 2x1, so the aboveA-copy is indeed inA.

Letnbe an arbitrarily large positive integer divisible bym. GivenM as above, we create ann×n‘blowup’ matrixNas follows: For any 1≤i, j≤n,N(i, j) =M(bim/nc,bjm/nc). N

can also be seen as the result of replacing any entryeinMwith ann/m×n/mmatrix of entries equal toe. The total number ofA-copies inN is exactly (n/m)4_q₌_n4_|_X_|_/₅_m3_{, whereas} the maximum number of pairwise disjointA-copies inN is exactly (n/m)2_q₌_n2_|_X_|_/₅_m_. Assuming that >0 is small enough and pickingmto be the smallest integer divisible by 10 and larger thanclog _{for a suitable absolute constant}_{c >}_{0 gives that}_|_X_|_/₅_{m >}_{, but the}

number ofA-copies inN is at mostn4_|_X_|_/₅_m3_≤_n4_/m2_<−clog_n4 _{as needed.}

J

6 Concluding remarks

Generally, understanding property testing seems to be easier for objects that are highly symmetric. A good example of this phenomenon is the problem of testing properties of (ordered) one-dimensional binary vectors. There are some results on this subject, but it is far from being well understood. On the other hand, the binary vector propertiesP that are invariant under permutations of the entries (these are the properties in which for any vector

vthat satisfies P, any permutation of the entries ofv also satisfiesP) are merely those that depend only on the length and the Hamming weight of a vector. This makes the task of testing these properties trivial.

A central example of the symmetry phenomenon is the well investigated subject of property testing in (unordered) graphs, that considers only properties of functions from

[n] 2

to{0,1} that are invariant under permutations of [n₂]

induced by permutations on [n]. That is, if a labeled graphGsatisfies some graph property, then any relabeling of its vertices results in a graph that also satisfies this property. Indeed, the proof of the only known general result on testing properties ofordered graphs(here the functions are generally not invariant under permutations), given in [2], is substantially more complicated than the proof of its unordered analogue. See [25] for further discussion on the role of symmetries in property testing.

In general, matrices (with row and column order) do not have any symmetries. Therefore, the above reasoning suggests that proving results on the testability of matrix properties is

(15)

likely to be harder than proving similar results on properties of matrices where only the rows are ordered (such properties are invariant under permutations of the columns), which might be harder in turn than proving the same results for properties of matrices without row and column orders, i.e. bipartite graphs, as these properties are invariant under permutations of both the rows and the columns.

Theorem 2 is a weak removal lemma for binary matrices with row and column order, while Theorem 3 is an induced removal lemma for binary matrices without row and column order, and our generalization of it, Theorem 5, is an induced removal lemma for binary matrices with a row order but without a column order. It will be very interesting to settle Problem 4, that asks whether a polynomial induced removal lemma exists for binary matrices with row and column orders.

It will be interesting to expand our knowledge of matrices in higher dimensions and of ordered combinatorial objects in general. Proposition 14 is a non-induced removal lemma for (multi-dimensional) matrices without row and column orders. It will be interesting to get results of this type for less symmetric objects, ultimately for ordered multi-dimensional matrices. We believe that providing a direct solution (that does not go through Theorem 1) for the following seemingly innocent problem is of interest, and might help providing techniques to help settling Problem 15 in general. In what follows, theheight of a 2×2 submatrixS in ann×nmatrixM is the difference between the indices of the rows ofS inM, divided byn.

IProblem 18. LetA=

0 1 2 3

and suppose that an n×nmatrix containsn2 _pairwise disjoint copies ofA. Show (without relying on Theorem 1) that there existsδ=δ()such thatM contains an A-copy with height at leastδ.

The three dimensional analogue of this problem is obviously also of interest. Here Theorem 1 cannot be applied, so currently we do not know whether such aδ=δ() that depends only on

exists. Solving the three-dimensional analogue will settle Problem 15 when the forbidden hypermatrix has dimensions 2×2×2, and the techniques might lead to settling Problem 15 in its most general form.

As a final remark, in the results in whichδ−1 is polynomial in−1 we have not tried to obtain tight bounds on the dependence, and it may be interesting to do so.

Acknowledgements. The authors wish to thank the anonymous reviewers for useful feed-back.

References

1 N. Alon. Testing subgraphs in large graphs. Random Structures and Algorithms, 21:359– 370, 2002.

2 N. Alon, O. Ben-Eliezer, and E. Fischer. Testing hereditary properties of ordered graphs and matrices. arXiv, 1704:02367, 2017.

3 N. Alon, R. A. Duke, H. Lefmann, V. Rödl, and R. Yuster. The algorithmic aspects of the regularity lemma. Journal of Algorithms, 16:80–109, 1994.

4 N. Alon, E. Fischer, M. Krivelevich, and M. Szegedy. Efficient testing of large graphs. Combinatorica, 20:451–476, 2000.

5 N. Alon, E. Fischer, and I. Newman. Efficient testing of bipartite graphs for forbidden induced subgraphs. SIAM J. Comput., 37:959–976, 2007.

6 N. Alon and J. Fox. Easily testable graph properties.Combin. Probab. Comput, 24:646–657, 2015.