• No results found

18.175: Lecture 33 Ergodic theory

N/A
N/A
Protected

Academic year: 2021

Share "18.175: Lecture 33 Ergodic theory"

Copied!
29
0
0

Loading.... (view fulltext now)

Full text

(1)

18.175: Lecture 33 Ergodic theory

Scott Sheffield

MIT

(2)

Outline

Setup

Birkhoff’s ergodic theorem

(3)

Outline

Setup

Birkhoff’s ergodic theorem

(4)

Motivating problem

I Consider independent bond percolation on Z2 with some fixed parameter p > 1/2. Look at some simulations.

I Let Ω be the set of maps from the edges of Z2 to {0, 1}, F the usual product σ-algebra, and P = Pp the probability measure.

I Now consider an n × n box centered at 0 and ask: what fraction of the points in that box belong to an infinite clusters? Does this fraction converge to a limit (in some sense: in probability, or maybe almost surely) as n → ∞?

I Let Cx = 1x ∈infinitecluster. If the Cx were independent or each other, then this would just be a law of large numbers question. But the Cx are not independent of each other — far from it.

I We don’t have independence. We have translation invariance instead. Is that good enough?

I More general: Cx distributed in some translation invariant way, EC0< ∞. Is mean of Cx (on large box) nearly constant?

(5)

Motivating problem

I Consider independent bond percolation on Z2 with some fixed parameter p > 1/2. Look at some simulations.

I Let Ω be the set of maps from the edges of Z2 to {0, 1}, F the usual product σ-algebra, and P = Pp the probability measure.

I Now consider an n × n box centered at 0 and ask: what fraction of the points in that box belong to an infinite clusters? Does this fraction converge to a limit (in some sense: in probability, or maybe almost surely) as n → ∞?

I Let Cx = 1x ∈infinitecluster. If the Cx were independent or each other, then this would just be a law of large numbers question. But the Cx are not independent of each other — far from it.

I We don’t have independence. We have translation invariance instead. Is that good enough?

I More general: Cx distributed in some translation invariant way, EC0< ∞. Is mean of Cx (on large box) nearly constant?

(6)

Motivating problem

I Consider independent bond percolation on Z2 with some fixed parameter p > 1/2. Look at some simulations.

I Let Ω be the set of maps from the edges of Z2 to {0, 1}, F the usual product σ-algebra, and P = Pp the probability measure.

I Now consider an n × n box centered at 0 and ask: what fraction of the points in that box belong to an infinite clusters? Does this fraction converge to a limit (in some sense: in probability, or maybe almost surely) as n → ∞?

I Let Cx = 1x ∈infinitecluster. If the Cx were independent or each other, then this would just be a law of large numbers question. But the Cx are not independent of each other — far from it.

I We don’t have independence. We have translation invariance instead. Is that good enough?

I More general: Cx distributed in some translation invariant way, EC0< ∞. Is mean of Cx (on large box) nearly constant?

(7)

Motivating problem

I Consider independent bond percolation on Z2 with some fixed parameter p > 1/2. Look at some simulations.

I Let Ω be the set of maps from the edges of Z2 to {0, 1}, F the usual product σ-algebra, and P = Pp the probability measure.

I Now consider an n × n box centered at 0 and ask: what fraction of the points in that box belong to an infinite clusters? Does this fraction converge to a limit (in some sense: in probability, or maybe almost surely) as n → ∞?

I Let Cx = 1x ∈infinitecluster. If the Cx were independent or each other, then this would just be a law of large numbers question.

But the Cx are not independent of each other — far from it.

I We don’t have independence. We have translation invariance instead. Is that good enough?

I More general: Cx distributed in some translation invariant way, EC0< ∞. Is mean of Cx (on large box) nearly constant?

(8)

Motivating problem

I Consider independent bond percolation on Z2 with some fixed parameter p > 1/2. Look at some simulations.

I Let Ω be the set of maps from the edges of Z2 to {0, 1}, F the usual product σ-algebra, and P = Pp the probability measure.

I Now consider an n × n box centered at 0 and ask: what fraction of the points in that box belong to an infinite clusters? Does this fraction converge to a limit (in some sense: in probability, or maybe almost surely) as n → ∞?

I Let Cx = 1x ∈infinitecluster. If the Cx were independent or each other, then this would just be a law of large numbers question.

But the Cx are not independent of each other — far from it.

I We don’t have independence. We have translation invariance instead. Is that good enough?

I More general: Cx distributed in some translation invariant way, EC0< ∞. Is mean of Cx (on large box) nearly constant?

(9)

Motivating problem

I Consider independent bond percolation on Z2 with some fixed parameter p > 1/2. Look at some simulations.

I Let Ω be the set of maps from the edges of Z2 to {0, 1}, F the usual product σ-algebra, and P = Pp the probability measure.

I Now consider an n × n box centered at 0 and ask: what fraction of the points in that box belong to an infinite clusters? Does this fraction converge to a limit (in some sense: in probability, or maybe almost surely) as n → ∞?

I Let Cx = 1x ∈infinitecluster. If the Cx were independent or each other, then this would just be a law of large numbers question.

But the Cx are not independent of each other — far from it.

I We don’t have independence. We have translation invariance instead. Is that good enough?

I More general: Cx distributed in some translation invariant way, EC0< ∞. Is mean of Cx (on large box) nearly constant?

(10)

Rephrasing problem

I Let θx be the translation of the Z2 that moves 0 to x . Each θx induces a measure-preserving translation of Ω. Then Cx(ω) = C0−x(ω)). So summing up the Cx values is the same as summing up the C0x(ω)) value over a range of x .

I The group of translations is generated by a one-step vertical and a one-step horizontal translation. Refer to the

corresponding (commuting, P-preserving) maps on Ω as φ1 and φ2.

I We’re interested in averaging C0j1φk2ω) over a range of (j , k) pairs.

I Let’s simplify matters still further and consider the one-dimensional problem. In this case, we have a random variable X and we study empirical averages of the form

N−1

N

X

n=1

X (φnω).

(11)

Rephrasing problem

I Let θx be the translation of the Z2 that moves 0 to x . Each θx induces a measure-preserving translation of Ω. Then Cx(ω) = C0−x(ω)). So summing up the Cx values is the same as summing up the C0x(ω)) value over a range of x .

I The group of translations is generated by a one-step vertical and a one-step horizontal translation. Refer to the

corresponding (commuting, P-preserving) maps on Ω as φ1

and φ2.

I We’re interested in averaging C0j1φk2ω) over a range of (j , k) pairs.

I Let’s simplify matters still further and consider the one-dimensional problem. In this case, we have a random variable X and we study empirical averages of the form

N−1

N

X

n=1

X (φnω).

(12)

Rephrasing problem

I Let θx be the translation of the Z2 that moves 0 to x . Each θx induces a measure-preserving translation of Ω. Then Cx(ω) = C0−x(ω)). So summing up the Cx values is the same as summing up the C0x(ω)) value over a range of x .

I The group of translations is generated by a one-step vertical and a one-step horizontal translation. Refer to the

corresponding (commuting, P-preserving) maps on Ω as φ1

and φ2.

I We’re interested in averaging C0j1φk2ω) over a range of (j , k) pairs.

I Let’s simplify matters still further and consider the one-dimensional problem. In this case, we have a random variable X and we study empirical averages of the form

N−1

N

X

n=1

X (φnω).

(13)

Rephrasing problem

I Let θx be the translation of the Z2 that moves 0 to x . Each θx induces a measure-preserving translation of Ω. Then Cx(ω) = C0−x(ω)). So summing up the Cx values is the same as summing up the C0x(ω)) value over a range of x .

I The group of translations is generated by a one-step vertical and a one-step horizontal translation. Refer to the

corresponding (commuting, P-preserving) maps on Ω as φ1

and φ2.

I We’re interested in averaging C0j1φk2ω) over a range of (j , k) pairs.

I Let’s simplify matters still further and consider the one-dimensional problem. In this case, we have a random variable X and we study empirical averages of the form

N−1

N

X

n=1

X (φnω).

(14)

Examples: stationary X

j

sequences

I Could take Xj i.i.d.

I Or Xn could be a Markov chain, with each individual Xj

distributed according to a stationary distribution π.

I Rotations of the circle. Say X0 is uniform in [0, 1] and generally Xj = X0+ αj modulo 1.

I If X0, X1, . . . is stationary and g : R{0,1,...}→ R is measurable, then Yk = g (Xk, Xk+1, . . .) is stationary.

I Bernoulli shift. X0, X1, . . . are i.i.d. and Yk =P

j =1Xk+j2−j.

I Can constructed two-sided (Z-indexed) stationary sequence from one-sided stationary sequence by Kolmogorov extension.

I What if Xi are i.i.d. tosses of a p-coin, where p is itself random?

(15)

Examples: stationary X

j

sequences

I Could take Xj i.i.d.

I Or Xn could be a Markov chain, with each individual Xj distributed according to a stationary distribution π.

I Rotations of the circle. Say X0 is uniform in [0, 1] and generally Xj = X0+ αj modulo 1.

I If X0, X1, . . . is stationary and g : R{0,1,...}→ R is measurable, then Yk = g (Xk, Xk+1, . . .) is stationary.

I Bernoulli shift. X0, X1, . . . are i.i.d. and Yk =P

j =1Xk+j2−j.

I Can constructed two-sided (Z-indexed) stationary sequence from one-sided stationary sequence by Kolmogorov extension.

I What if Xi are i.i.d. tosses of a p-coin, where p is itself random?

(16)

Examples: stationary X

j

sequences

I Could take Xj i.i.d.

I Or Xn could be a Markov chain, with each individual Xj distributed according to a stationary distribution π.

I Rotations of the circle. Say X0 is uniform in [0, 1] and generally Xj = X0+ αj modulo 1.

I If X0, X1, . . . is stationary and g : R{0,1,...}→ R is measurable, then Yk = g (Xk, Xk+1, . . .) is stationary.

I Bernoulli shift. X0, X1, . . . are i.i.d. and Yk =P

j =1Xk+j2−j.

I Can constructed two-sided (Z-indexed) stationary sequence from one-sided stationary sequence by Kolmogorov extension.

I What if Xi are i.i.d. tosses of a p-coin, where p is itself random?

(17)

Examples: stationary X

j

sequences

I Could take Xj i.i.d.

I Or Xn could be a Markov chain, with each individual Xj distributed according to a stationary distribution π.

I Rotations of the circle. Say X0 is uniform in [0, 1] and generally Xj = X0+ αj modulo 1.

I If X0, X1, . . . is stationary and g : R{0,1,...}→ R is measurable, then Yk = g (Xk, Xk+1, . . .) is stationary.

I Bernoulli shift. X0, X1, . . . are i.i.d. and Yk =P

j =1Xk+j2−j.

I Can constructed two-sided (Z-indexed) stationary sequence from one-sided stationary sequence by Kolmogorov extension.

I What if Xi are i.i.d. tosses of a p-coin, where p is itself random?

(18)

Examples: stationary X

j

sequences

I Could take Xj i.i.d.

I Or Xn could be a Markov chain, with each individual Xj distributed according to a stationary distribution π.

I Rotations of the circle. Say X0 is uniform in [0, 1] and generally Xj = X0+ αj modulo 1.

I If X0, X1, . . . is stationary and g : R{0,1,...}→ R is measurable, then Yk = g (Xk, Xk+1, . . .) is stationary.

I Bernoulli shift. X0, X1, . . . are i.i.d. and Yk =P

j =1Xk+j2−j.

I Can constructed two-sided (Z-indexed) stationary sequence from one-sided stationary sequence by Kolmogorov extension.

I What if Xi are i.i.d. tosses of a p-coin, where p is itself random?

(19)

Examples: stationary X

j

sequences

I Could take Xj i.i.d.

I Or Xn could be a Markov chain, with each individual Xj distributed according to a stationary distribution π.

I Rotations of the circle. Say X0 is uniform in [0, 1] and generally Xj = X0+ αj modulo 1.

I If X0, X1, . . . is stationary and g : R{0,1,...}→ R is measurable, then Yk = g (Xk, Xk+1, . . .) is stationary.

I Bernoulli shift. X0, X1, . . . are i.i.d. and Yk =P

j =1Xk+j2−j.

I Can constructed two-sided (Z-indexed) stationary sequence from one-sided stationary sequence by Kolmogorov extension.

I What if Xi are i.i.d. tosses of a p-coin, where p is itself random?

(20)

Examples: stationary X

j

sequences

I Could take Xj i.i.d.

I Or Xn could be a Markov chain, with each individual Xj distributed according to a stationary distribution π.

I Rotations of the circle. Say X0 is uniform in [0, 1] and generally Xj = X0+ αj modulo 1.

I If X0, X1, . . . is stationary and g : R{0,1,...}→ R is measurable, then Yk = g (Xk, Xk+1, . . .) is stationary.

I Bernoulli shift. X0, X1, . . . are i.i.d. and Yk =P

j =1Xk+j2−j.

I Can constructed two-sided (Z-indexed) stationary sequence from one-sided stationary sequence by Kolmogorov extension.

I What if Xi are i.i.d. tosses of a p-coin, where p is itself random?

(21)

Definitions

I Say that A is invariant if the symmetric difference between φ(A) and A has measure zero.

I Observe: class I of invariant events is a σ-field.

I Measure preserving transformation is called ergodic if I is trivial, i.e., every set A ∈ I satisfies P(A) ∈ {0, 1}.

I Example: If Ω = R{0,1,...} and A is invariant, then A is necessarily in tail σ-field T , hence has probability zero or one by Kolmogorov’s 0 − 1 law. So sequence is ergodic (the shift on sequence space R{0,1,2,...} is ergodic..

(22)

Definitions

I Say that A is invariant if the symmetric difference between φ(A) and A has measure zero.

I Observe: class I of invariant events is a σ-field.

I Measure preserving transformation is called ergodic if I is trivial, i.e., every set A ∈ I satisfies P(A) ∈ {0, 1}.

I Example: If Ω = R{0,1,...} and A is invariant, then A is necessarily in tail σ-field T , hence has probability zero or one by Kolmogorov’s 0 − 1 law. So sequence is ergodic (the shift on sequence space R{0,1,2,...} is ergodic..

(23)

Definitions

I Say that A is invariant if the symmetric difference between φ(A) and A has measure zero.

I Observe: class I of invariant events is a σ-field.

I Measure preserving transformation is called ergodic if I is trivial, i.e., every set A ∈ I satisfies P(A) ∈ {0, 1}.

I Example: If Ω = R{0,1,...} and A is invariant, then A is necessarily in tail σ-field T , hence has probability zero or one by Kolmogorov’s 0 − 1 law. So sequence is ergodic (the shift on sequence space R{0,1,2,...} is ergodic..

(24)

Definitions

I Say that A is invariant if the symmetric difference between φ(A) and A has measure zero.

I Observe: class I of invariant events is a σ-field.

I Measure preserving transformation is called ergodic if I is trivial, i.e., every set A ∈ I satisfies P(A) ∈ {0, 1}.

I Example: If Ω = R{0,1,...} and A is invariant, then A is necessarily in tail σ-field T , hence has probability zero or one by Kolmogorov’s 0 − 1 law. So sequence is ergodic (the shift on sequence space R{0,1,2,...} is ergodic..

(25)

Outline

Setup

Birkhoff’s ergodic theorem

(26)

Outline

Setup

Birkhoff’s ergodic theorem

(27)

Ergodic theorem

I Let φ be a measure preserving transformation of (Ω, F , P).

Then for any X ∈ L1 we have 1

n

n−1

X

m=0

X (φmω) → E (X |I)

a.s. and in L1.

I Note: if sequence is ergodic, then E (X |I) = E (X ), so the limit is just the mean.

I Proof takes a couple of pages. Shall we work through it?

(28)

Ergodic theorem

I Let φ be a measure preserving transformation of (Ω, F , P).

Then for any X ∈ L1 we have 1

n

n−1

X

m=0

X (φmω) → E (X |I)

a.s. and in L1.

I Note: if sequence is ergodic, then E (X |I) = E (X ), so the limit is just the mean.

I Proof takes a couple of pages. Shall we work through it?

(29)

Ergodic theorem

I Let φ be a measure preserving transformation of (Ω, F , P).

Then for any X ∈ L1 we have 1

n

n−1

X

m=0

X (φmω) → E (X |I)

a.s. and in L1.

I Note: if sequence is ergodic, then E (X |I) = E (X ), so the limit is just the mean.

I Proof takes a couple of pages. Shall we work through it?

References

Related documents

These requirements as stated reflect refinement of the November 2011 draft version of this document. Requirement 2 now combines two aspects of solutions for

The Ontario Superior Court of Justice in two recent decisions regarding parental alienation accepted the role of child’s counsel as adducing all relevant evidence pertaining to

To explore how allele frequencies change in populations of organisms, you will first build a computer spreadsheet that models the changes in a hypothetical gene pool from

– Member of the Canadian Investor Protection Fund, TD Waterhouse Private Investment Counsel Inc., TD Waterhouse Private Banking (offered by The Toronto-Dominion Bank) and

Some organism utilize simple inorganic substances such as carbon dioxide and water, whereas other organisms utilize complex substances which are required to be

The objectives of this study were: (i) to identify important traffic operation and roadway geometric factors influencing urban midblock crash frequencies by collision type and

Dutch Cycling Embassy is partij voor dipplomatieke posten en Holland Branding. DCE kan deelnemende publieke en

Our Virtual Agent can income you quickly identify and never common issues with sending or receiving email I didn't receive an email someone sent it my Outlookcom.. What sent it mean