On
Monotonicity
Testing
Boolean
Isoperimetric
type
Theorems
Subhash
Khot
Dor
Minzer
Property Testing
•
subset
P
of universe
(graphs, function, etc.)
Property:
•
accepts
an object in
P
;
•
rejects
if
-
far
from
P
.
(one-sided)
tester
•
Specific to each property.
Hamming
Distance
P
P
far
close
BF Monotonicity Testing
Universe: Boolean Functions
partial order on Boolean cube
P={f: f Monotone}
• P monotone functions.
Distance from monotonicity
Normalized hamming:
fraction of inputs
f,g differ on
Normalized hamming:
fraction of inputs
f,g differ on
}
1
,
0
{
}
1
,
0
{
:
n
f
i
i
y
x
i
y
x
)
(
)
(
x
f
y
f
y
x
)
,
(
min
)
(
f
f
g
P
g
Partial Timeline
1998 2002 2013 2014 2015
Introduced
by GGLRS
Introduced
by GGLRS
Lower bound
FLNRRS
Lower bound
FLNRRS
o(n)
algorithm CS
o(n)
algorithm CS
Improved
algorithm
CST
Improved
algorithm
CST
Tight
algorithm
Tight
algorithm
All proposed algorithms
•
Pair Testers
•
Non Adaptive
All proposed algorithms
One-sided Pair Tester
PickQuery xf(x)y ,
f(y)
Reject if non-mono. detected
One-Sided, Non-adaptive Pair-tester
•
Accept
Reject
w.h.p
f
P
( f) >
Edge Tester[GGLRS]
Pick xR{0, 1}nand iR[n] Query f(x), f(xi)
Reject if non-mono. detected
Thm[GGLRS]:
•
detection probability of edge-tester
(f)/n
Denote x
ifor x with
coordinate i flipped
Denote
x
ifor
x
with
Coming Up
Lower Bound
•
in the middle Section
•
0/1 outside.
random anti-dictator
•
Picks
x<y
points
•
(x, y)
O(
n)
non-adaptive pair-tester
•
w/probability
O(1/
n)
Detects violation
n
Lower Bound[FLNRRS]
negligible
negligible
negligible
negligible
1
0
Hamming
Weight
Hamming
Weight
0n 1n
Simplistic
:
detection prob. for
n
-
distance is
n
edge-tester’s?
NO
!?
Simplistic
:
detection prob. for
n
-
distance is
n
edge-tester’s?
NO
!?
Pick
x<y
,
(x, y) =
n
Query
f(x)
,
f(y)
Reject
if non-mono. detected
n
c
x
n
2
i
x
x
k even.
majority on a
unless a is balanced: i anti dictator on b.
k even.
majority on a
unless a is balanced:
i anti dictator on b.
Tiebreaker function
detection prob
n
detection of edge
tester?
No!
Detection occurs only if
both points balanced in
a
distance
n/k
k/2 > 0 k/2 < 1 k/2 = 1-xi
set k=n/2
gives
(f)/n
or
(f)
2/
n
set
k=n/2
gives
(f)/n
or
(f)
2/
n
Pick
x
y
,
(x, y) =
n
Query
f(x)
,
f(y)
Reject
if non-mono. detected
n k
k
x
x
x
x
1,...,
,
1,...,
Previous Work
•
Testing is possible in
O(n/
)
queries.
•
Can one do better?
GGLRS
:
•
An affirmative answer by [CS 13],
possible in
O(n
7/8
-3/2)
•
Later improved by [CST14] O(n
5/6
-4).
Improve
:
•
with
O(n
1/2
-2)
queries.
•
Optimal up to logarithmic factors by
the lower bound of [FLNRRS].
:
Our Tester
Pick lR [½log(n)] Pick xR {0, 1}n
w/prob. ½ pick S{i| xi=0} of size 2l
o/w S{i|xi=1} of same size
Query f(x),
f(xS)
Reject if a non-mono. detected.
Theorem:
• detection prob. of our tester (f)2/n
Denote x
Sfor x with
coordinates S flipped
Denote
x
Sfor
x
with
coordinates
S
flipped
Proof’s
Road Map
G
f
-
negative edges
Event
R
Subgraph
G
’
Event
R
’
Then:
Estimate
R
’
Issuess:
-
Persistency
-
overcounting
Issuess:
-
Persistency
-
overcounting
U’ regular
D’ bounded degree
Large
U’
regular
D’
bounded degree
Analysis
Hypercube graph
• Vertices: {0,1}n
• Edges x↔xi
A Boolean function
• 0/1 labeling of vertexes
Subgraph
G
-• only monotonicity violating edges (red) • remove isolated vertices. • is a bipartite graph
1
1
0
0
0
0
0
0
1
1
0
0
1
1
1
1
1
1
Pick lR
[½log(n/log n)] Pick xR {0, 1}n
w/prob. xi=0} of size ½ pick S2l{i| o/w S{i|xi=1} of same size
Query f(x),
f(xS)
Event R
•
in
D
labeled
1
;
in
U
labeled
0
G
-Bipartite
•
x<y, (x, y)
G
-•
y<z
Consider
(x, y, z)
•
tester picks
x, z
•
f(z)=0
Event
R
:
1 1 0 0 0 0 0 0 1 1 1 1 0 0 0 0 1 1 0 0 D U
x
y
z
Pick lR [½log(n/log n)]
Pick xR {0, 1}n
w/prob. ½ pick S{i|xi=0} of size 2l
o/w S{i|xi=1} of same size
Query f(x), f(xS)
Reject if a violation is found.
} : ) ( , 0 ) ( , : ) ,
{(x z x D f z y x x y z
R
vs
•
Thm
•
detection charged to
(x, y)
times
#edges
in
G
-Proof
•
f(z)
f(y)
? (
persistency
)
•
z
might be counted
multiple times
(for many
y
’s)
Issues!
1 1 0 0 0 0 0 0 1 1 1 1 0 0 0 0 1 1 0 0x
y
z
Pick lR [½log(n/log n)]
Pick xR {0, 1}n
w/prob. ½ pick S{i|xi=0} of size 2l
o/w S{i|xi=1} of same size
Query f(x), f(xS)
Reject if a violation is found.
Solution
:
Move to Sub-Graph G’ s.t
these issues are negligible
.
Solution
:
Move to Sub-Graph
G’
s.t
these issues are negligible
.
)
/
)
(
(
~
)
Sub-graph
•
subgraph
G’=
D’
U’, E’
of
G
-and
d
s.t.
•
U’
-regular:
deg(y)=d
for every
y
U’
•
D’
-bound:
deg(x)
2d
for every
x
D’
•
Size: let
=
|U’|/2
nLemma(main):
•
main technical difficulty of this work
Proof is the
•
R’
•
Let us prove that
Assuming this
•
Set
=
n/log n
and estimate detection
probability conditioned on distance =
Proof:
How?
Consider all possible d’s
(powers of 2)
How?
Consider all possible
d
’s
(powers of 2)
) ) ( (
~ 2
2d f
) / ) ( ( ~ ) '
Pr(R f 2 n
} : ) ( , 0 ) ( , : ) ,
{(x z x D f z y x x y z
R'{(x,z):xD' ,f(z) 0 , y (x) :x y z}
A Bound On Total Influence
Let
• I-=n . Fraction negative edges
• I+=n . Fraction positive edges
• I=n . Fraction influential edges
Lemma
• If I-/I+ < 1-δ then I=Oδ(n) • One line proof.
Corollary
• negative and the positive influence must be
very close for the total influence to be large.
We may therefore
assume
I<O(
n)
.
O/w edge-tester
detects
non-monotonicity w/prob.
(
n)
!
We may therefore
assume
I<O(
n)
.
O/w edge-tester
detects
non-monotonicity w/prob.
(
n)
Non-Persistency a Non-Issue
•
y
is called (
τ-
1
)-non-persistent if
where
z
is a random point
τ-
1
steps above
y
Definition
•
Let
be fraction of (
τ-
1
)-non-persistent
inputs. Then
Lemma
•
Most vertices in
U’
are presistent!
Corrolary
Due to regularity:
Almost all edges!
Due to regularity:
Almost all edges!
Previous lemma
Previous lemma
31
)) ( )
( (
Prz f z f y
)
(
)
(
)
(
o
n
I
n
o
n
I
n n /log
•
Is a non-Issue
•
Probability
z
is above
two
y
’s is negligible
Over-counting
•
Simple calculation
(inclusion-exclusion).
The rest
Overcoming Over-counting
•subgraph G’=D’U’, E’ ofG- and d s.t.
•U’-regular: Deg(y)=d for every yU’
•D’-bound: Deg(x)2d for every xD’
•Size: let = |U’|/2n
Lemma(main):
• main technical difficulty of this work
Proof is the
•R’
•Let us prove that
Assuming this
•Set =n/log n and estimate detection probability conditioned on distance =
Proof:
)) ( (
~ 2
Boolean
Isoperimetric
type
Isoperimetric Inequalities
(
Directed
)
Sensitivity )Directed( Sensitivity
•
For x such that f(x)=1
•
If(x) – The number of sensitive (influential) edges touching x
.
•
I
-f(x) - The number of negatively sensitive (influential) edges touching x
•
For f(x)=0, both are defined to be 0
.
Well known
Well known
•
I var(f)
GGLRS: directed parallel
GGLRS: directed parallel
•
I- (f)
1
1
1
1
1
1
0
0
0
0 00
0
In CS13
In CS13
• I
-f Γ-f,matching =Ω(ε(f)2)
A Directed Version
A Directed Version
•
Margulis inequality
If Γf =Ω(var(f)2)
Directed version
Directed version
•
Talagrand generalization
of Margulis
?
1
1
0
0
0
0
0
0
1
1
1
1
0
0
0
0
1
1 D
U
Directed – Undirected Analogy
Maximal matching in G
-Maximal matching in
G
-Fraction of vertices
with non-zero
sensitivity
Fraction of vertices
with non-zero
Isoperimetric Inequalities
•
For every Boolean function
f
Thm[Talagrand 93’]
Thm[Talagrand 93’]
•
var(f)
for Boolean
f
is roughly the size of
the minority value
•
Inductive step: split according to a
variable, account for the difference
between those functions
.
proof
proof
•
we prove a directed parallel
For our purposes
For our p
urposes
If(x) – The number of sensitive(influential)
edges touching x
.
If(x) – The number of sensitive(influential)
edges touching x . 1 1 1 1 1 1 0 0 0
0 00
0 0
))
(var(
)
)
(
(
I
x
f
E
f
))
(
(
)
)
(
(
I
x
f
Isoperimetric Inequalities
Remarks
•
proof is very different from Talagrand’s
•
Why: (f) misbehaves in induction
•
Switching
•
Results in a function on
the same number of bits
•
Monotonize on
i
Switch(i)
[
GGLRS
]
•
Monotonize on a set of
indices
•
Not commutative
Sequential
applicatio
n
•
Ρ
a permutation,
S
ρapplies
switches in the order of
ρ
•
γ(f) = E
ρ(Δ(f,S
ρ°f))
Distance
by
Switches
1 0 1 0i
i
)}
(
),
(
max{
)
1
,
)(
(
x
f
i
x
f
x
x
f
S
i i
Distance By Switches
Lemma(Fatal, Ron)
•
For every
f
:
ε(f) ≤ γ(f) ≤
2
ε(f)
Proof
• Left side clear. For the right side observe that for every f,g & i: Δ(Si°g,Si°f)≤ Δ(g,f)
• Let g be the closest monotone, then
• Δ(g,Sρ°f) = Δ(Sρ°g,Sρ°f) ≤ Δ(g,f) = ε(f)
• By the triangle inequality
Overview of Proof
•
Analyze the simpler case in
which every variable has only
negative or positive influence
Pure
Functions
•
Reduce the general case to
pure functions using an
operator called
split
.
Reduce
•
On the splitting order
.
Throw
Pure (Unate) functions
•
f
is
pure
if each variable has either positive or
negative (not both) influence on
f
Definition
•
Let
A
be the set of positively influential variables:
p
z=E
y(f(y,x
A=z))
•
Define
g
to be constant
0/1
on restriction
z
according to
p
z.Why is it simple for pure functions?
•
g
is monotone (active only on positive variables)
•
f
is
O(ε(f))
close to
g
(proof uses FKG inequality)
Claims
} 2 / 1 {
1
)
,
Pure functions
•
ε(f)≈Δ(g,f)=E
z
(Δ(g(y,x
A=z),f(y,x
A=z)))
•
The last term is
≈E
z
(var(f(y,x
A=z)))
Observe
•
last inequality is Talagrand’s!
Finishing the proof
))
(
(
))]
,
(
(var(
[
]
)
(
[
]
)
(
[
]
)
(
[
) , ( , ) , ( ,f
z
x
f
E
y
I
E
y
I
E
x
I
E
A z z x f y z z x f y z f x A A
} 2 / 1 {1
)
,
Open Questions
Adaptivity
•
Chen et al showed a matching
lower bound for two sided testers
•
Can adaptivity help?(current
lower bound logarithmic)
Isoperime
tric
•
No dependence on the dimension
in Talagrand’s theorem
.
•
Is the dependency on n necessary
in the directed version