• No results found

Monotonicity.ppsx

N/A
N/A
Protected

Academic year: 2020

Share "Monotonicity.ppsx"

Copied!
34
0
0

Loading.... (view fulltext now)

Full text

(1)

On

Monotonicity

Testing

Boolean

Isoperimetric

type

Theorems

Subhash

Khot

Dor

Minzer

(2)

Property Testing

subset

P

of universe

(graphs, function, etc.)

Property:

accepts

an object in

P

;

rejects

if

-

far

from

P

.

(one-sided)

tester

Specific to each property.

Hamming

Distance

P

P

far

close

(3)

BF Monotonicity Testing

Universe: Boolean Functions

partial order on Boolean cube

P={f: f Monotone}

• P monotone functions.

Distance from monotonicity

Normalized hamming:

fraction of inputs

f,g differ on

Normalized hamming:

fraction of inputs

f,g differ on

}

1

,

0

{

}

1

,

0

{

:

n

f

i

i

y

x

i

y

x

)

(

)

(

x

f

y

f

y

x

)

,

(

min

)

(

f

f

g

P

g

(4)

Partial Timeline

1998 2002 2013 2014 2015

Introduced

by GGLRS

Introduced

by GGLRS

Lower bound

FLNRRS

Lower bound

FLNRRS

o(n)

algorithm CS

o(n)

algorithm CS

Improved

algorithm

CST

Improved

algorithm

CST

Tight

algorithm

Tight

algorithm

All proposed algorithms

Pair Testers

Non Adaptive

All proposed algorithms

(5)

One-sided Pair Tester

PickQuery xf(x)y ,

f(y)

Reject if non-mono. detected

One-Sided, Non-adaptive Pair-tester

Accept

Reject

w.h.p

f

P

( f) >

(6)

Edge Tester[GGLRS]

Pick xR{0, 1}nand iR[n] Query f(x), f(xi)

Reject if non-mono. detected

Thm[GGLRS]:

detection probability of edge-tester

(f)/n

Denote x

i

for x with

coordinate i flipped

Denote

x

i

for

x

with

(7)

Coming Up

Lower Bound

(8)

in the middle Section

0/1 outside.

random anti-dictator

Picks

x<y

points

(x, y)

O(

n)

non-adaptive pair-tester

w/probability

O(1/

n)

Detects violation

n

Lower Bound[FLNRRS]

negligible

negligible

negligible

negligible

1

0

Hamming

Weight

Hamming

Weight

0n 1n

Simplistic

:

detection prob. for

n

-

distance is

n

edge-tester’s?

NO

!?

Simplistic

:

detection prob. for

n

-

distance is

n

edge-tester’s?

NO

!?

Pick

x<y

,

(x, y) =

n

Query

f(x)

,

f(y)

Reject

if non-mono. detected

n

c

x

n

2

i

x

x

(9)

k even.

majority on a

unless a is balanced: i anti dictator on b.

k even.

majority on a

unless a is balanced:

i anti dictator on b.

Tiebreaker function

detection prob

n

detection of edge

tester?

No!

Detection occurs only if

both points balanced in

a

distance

n/k

k/2 > 0 k/2 < 1 k/2 = 1-xi

set k=n/2

gives

(f)/n

or

(f)

2

/

n

set

k=n/2

gives

(f)/n

or

(f)

2

/

n

Pick

x

y

,

(x, y) =

n

Query

f(x)

,

f(y)

Reject

if non-mono. detected

n k

k

x

x

x

x

1

,...,

,

1

,...,

(10)

Previous Work

Testing is possible in

O(n/

)

queries.

Can one do better?

GGLRS

:

An affirmative answer by [CS 13],

possible in

O(n

7/8

-3/2

)

Later improved by [CST14] O(n

5/6

-4

).

Improve

:

with

O(n

1/2

-2

)

queries.

Optimal up to logarithmic factors by

the lower bound of [FLNRRS].

:

(11)

Our Tester

Pick lR [½log(n)] Pick xR {0, 1}n

w/prob. ½ pick S{i| xi=0} of size 2l

o/w S{i|xi=1} of same size

Query f(x),

f(xS)

Reject if a non-mono. detected.

Theorem:

• detection prob. of our tester (f)2/n

Denote x

S

for x with

coordinates S flipped

Denote

x

S

for

x

with

coordinates

S

flipped

(12)

Proof’s

Road Map

G

f

-

negative edges

Event

R

Subgraph

G

Event

R

Then:

Estimate

R

Issuess:

-

Persistency

-

overcounting

Issuess:

-

Persistency

-

overcounting

U’ regular

D’ bounded degree

Large

U’

regular

D’

bounded degree

(13)

Analysis

Hypercube graph

• Vertices: {0,1}n

• Edges xxi

A Boolean function

• 0/1 labeling of vertexes

Subgraph

G

-• only monotonicity violating edges (red)remove isolated vertices.is a bipartite graph

1

1

0

0

0

0

0

0

1

1

0

0

1

1

1

1

1

1

Pick lR

[½log(n/log n)] Pick xR {0, 1}n

w/prob. xi=0} of size ½ pick S2l{i| o/w S{i|xi=1} of same size

Query f(x),

f(xS)

(14)

Event R

in

D

labeled

1

;

in

U

labeled

0

G

-

Bipartite

x<y, (x, y)

G

-•

y<z

Consider

(x, y, z)

tester picks

x, z

f(z)=0

Event

R

:

1 1 0 0 0 0 0 0 1 1 1 1 0 0 0 0 1 1 0 0 D U

x

y

z

Pick lR [½log(n/log n)]

Pick xR {0, 1}n

w/prob. ½ pick S{i|xi=0} of size 2l

o/w S{i|xi=1} of same size

Query f(x), f(xS)

Reject if a violation is found.

} : ) ( , 0 ) ( , : ) ,

{(x z x D f z y x x y z

(15)

R

vs

Thm

detection charged to

(x, y)

times

#edges

in

G

-Proof

f(z)

f(y)

? (

persistency

)

z

might be counted

multiple times

(for many

y

’s)

Issues!

1 1 0 0 0 0 0 0 1 1 1 1 0 0 0 0 1 1 0 0

x

y

z

Pick lR [½log(n/log n)]

Pick xR {0, 1}n

w/prob. ½ pick S{i|xi=0} of size 2l

o/w S{i|xi=1} of same size

Query f(x), f(xS)

Reject if a violation is found.

Solution

:

Move to Sub-Graph G’ s.t

these issues are negligible

.

Solution

:

Move to Sub-Graph

G’

s.t

these issues are negligible

.

)

/

)

(

(

~

)

(16)

Sub-graph

subgraph

G’=

D’

U’, E’

of

G

-

and

d

s.t.

U’

-regular:

deg(y)=d

for every

y

U’

D’

-bound:

deg(x)

2d

for every

x

D’

Size: let

=

|U’|/2

n

Lemma(main):

main technical difficulty of this work

Proof is the

R’

Let us prove that

Assuming this

Set

=



n/log n

and estimate detection

probability conditioned on distance =

Proof:

How?

Consider all possible d’s

(powers of 2)

How?

Consider all possible

d

’s

(powers of 2)

) ) ( (

~ 2

2d f

   ) / ) ( ( ~ ) '

Pr(R f 2 n

} : ) ( , 0 ) ( , : ) ,

{(x z x D f z y x x y z

R'{(x,z):xD' ,f(z) 0 , y (x) :x y z}

(17)

A Bound On Total Influence

Let

I-=n . Fraction negative edges

• I+=n . Fraction positive edges

• I=n . Fraction influential edges

Lemma

• If I-/I+ < 1-δ then I=Oδ(n) • One line proof.

Corollary

negative and the positive influence must be

very close for the total influence to be large.

We may therefore

assume

I<O(

n)

.

O/w edge-tester

detects

non-monotonicity w/prob.

(

n)

!

We may therefore

assume

I<O(

n)

.

O/w edge-tester

detects

non-monotonicity w/prob.

(

n)

(18)

Non-Persistency a Non-Issue

y

is called (

τ-

1

)-non-persistent if

where

z

is a random point

τ-

1

steps above

y

Definition

Let

be fraction of (

τ-

1

)-non-persistent

inputs. Then

Lemma

Most vertices in

U’

are presistent!

Corrolary

Due to regularity:

Almost all edges!

Due to regularity:

Almost all edges!

Previous lemma

Previous lemma

3

1

)) ( )

( (

Prz f zf y

)

(

)

(

)

(

o

n

I

n

o

n

I

n n /log 

(19)

Is a non-Issue

Probability

z

is above

two

y

’s is negligible

Over-counting

Simple calculation

(inclusion-exclusion).

The rest

Overcoming Over-counting

•subgraph G’=D’U’, E’ ofG- and d s.t.

•U’-regular: Deg(y)=d for every yU’

•D’-bound: Deg(x)2d for every xD’

•Size: let  = |U’|/2n

Lemma(main):

main technical difficulty of this work

Proof is the

R’

•Let us prove that

Assuming this

•Set =n/log n and estimate detection probability conditioned on distance =

Proof:

)) ( (

~ 2

(20)

Boolean

Isoperimetric

type

(21)

Isoperimetric Inequalities

(

Directed

)

Sensitivity )Directed( Sensitivity

For x such that f(x)=1

If(x) – The number of sensitive (influential) edges touching x

.

I

-f(x) - The number of negatively sensitive (influential) edges touching x

For f(x)=0, both are defined to be 0

.

Well known

Well known

I  var(f)

GGLRS: directed parallel

GGLRS: directed parallel

I- (f)

1

1

1

1

1

1

0

0

0

0 00

0

(22)

In CS13

In CS13

• I

-f Γ-f,matching =Ω(ε(f)2)

A Directed Version

A Directed Version

Margulis inequality

If Γf =Ω(var(f)2)

Directed version

Directed version

Talagrand generalization

of Margulis

?

1

1

0

0

0

0

0

0

1

1

1

1

0

0

0

0

1

1 D

U

Directed – Undirected Analogy

Maximal matching in G

-Maximal matching in

G

-Fraction of vertices

with non-zero

sensitivity

Fraction of vertices

with non-zero

(23)

Isoperimetric Inequalities

For every Boolean function

f

Thm[Talagrand 93’]

Thm[Talagrand 93’]

var(f)

for Boolean

f

is roughly the size of

the minority value

Inductive step: split according to a

variable, account for the difference

between those functions

.

proof

proof

we prove a directed parallel

For our purposes

For our p

urposes

If(x) – The number of sensitive(influential)

edges touching x

.

If(x) – The number of sensitive(influential)

edges touching x . 1 1 1 1 1 1 0 0 0

0 00

0 0

))

(var(

)

)

(

(

I

x

f

E

f

))

(

(

)

)

(

(

I

x

f

(24)

Isoperimetric Inequalities

Remarks

proof is very different from Talagrand’s

Why: (f) misbehaves in induction

(25)

Switching

Results in a function on

the same number of bits

Monotonize on

i

Switch(i)

[

GGLRS

]

Monotonize on a set of

indices

Not commutative

Sequential

applicatio

n

Ρ

a permutation,

S

ρ

applies

switches in the order of

ρ

γ(f) = E

ρ

(Δ(f,S

ρ°

f))

Distance

by

Switches

1 0 1 0

i

i

)}

(

),

(

max{

)

1

,

)(

(

x

f

i

x

f

x

x

f

S

i i

(26)

Distance By Switches

Lemma(Fatal, Ron)

For every

f

:

ε(f) ≤ γ(f) ≤

2

ε(f)

Proof

• Left side clear. For the right side observe that for every f,g & i: Δ(Sg,Sf)≤ Δ(g,f)

• Let g be the closest monotone, then

• Δ(g,Sρ°f) = Δ(Sρ°g,Sρ°f) ≤ Δ(g,f) = ε(f)

• By the triangle inequality

(27)

Overview of Proof

Analyze the simpler case in

which every variable has only

negative or positive influence

Pure

Functions

Reduce the general case to

pure functions using an

operator called

split

.

Reduce

On the splitting order

.

Throw

(28)

Pure (Unate) functions

f

is

pure

if each variable has either positive or

negative (not both) influence on

f

Definition

Let

A

be the set of positively influential variables:

p

z

=E

y

(f(y,x

A

=z))

Define

g

to be constant

0/1

on restriction

z

according to

p

z.

Why is it simple for pure functions?

g

is monotone (active only on positive variables)

f

is

O(ε(f))

close to

g

(proof uses FKG inequality)

Claims

} 2 / 1 {

1

)

,

(29)

Pure functions

ε(f)≈Δ(g,f)=E

z

(Δ(g(y,x

A

=z),f(y,x

A

=z)))

The last term is

≈E

z

(var(f(y,x

A

=z)))

Observe

last inequality is Talagrand’s!

Finishing the proof

))

(

(

))]

,

(

(var(

[

]

)

(

[

]

)

(

[

]

)

(

[

) , ( , ) , ( ,

f

z

x

f

E

y

I

E

y

I

E

x

I

E

A z z x f y z z x f y z f x A A

      } 2 / 1 {

1

)

,

(30)

Open Questions

Adaptivity

Chen et al showed a matching

lower bound for two sided testers

Can adaptivity help?(current

lower bound logarithmic)

Isoperime

tric

No dependence on the dimension

in Talagrand’s theorem

.

Is the dependency on n necessary

in the directed version

(31)

References

Related documents

Organizational Communication, Group and Organizational Behavior, Human Resource Development, Issues in Management, Leadership 1997 to date Adjunct Professor of

LuAnn Volkmer, ARNP-C, Pediatric Care Mental Health Specialist, Blank Children’s Hospital Adolescent Clinic Stephen Wolfe, MD, Professor Emeritus, Department of Family

For grants you have received for work outside the submitted work, you should disclose support ONLY from entities that could be perceived to be affected financially by the

ONS Office for National Statistics PDL Previously Developed Land QUELS Quality of Employment Land Supply R&amp;D Research and Development RES Regional Economic Strategy RHS

It is therefore of particular interest for applied research which approach is more adequate when analyzing discrete choice models: the standard conditional logit model or more

The C p distributions for δ f = 20 ◦ presented in figure 4(e) and δ f = 40 ◦ presented in figure 4(f) obtained along the centerline (y/s=0.5) of the model indicate that the zero

Internal images of a person affected by a loss of vision change with time depending on the time of occurrence of the defect, its progression, but also psycho-social

Gender relations affect household security, family well-being, planning, production and many other aspects of life (Bravo-Baumann, 2000). In the light of this