SIAM 05 24 17

(1)

Dimension Reduction for Systems with Slow Relaxation

SIAM DS17 May 24, 2015

Raman Venkataramani and Juan Restrepo Shankar Venkataramani

(2)

(3)

(4)

‘Oil’ consists of I distinct species with concentrations c_i(t), i = 1, 2, . . . , I each decaying at a constant rate ↵_i:

@_tc_i(t) = ↵_ic_i(t), ↵_i > 0, 1 _ i _ I.

Single observable: M(t) is a weighted average of the concentrations c_i

M (t) = X

i

ici(t) =

X

i

ici(0)e ↵it.

Impractical/impossible to separately measure the concentrations/amounts c_i of all the individual species.

Question: Can we use the measured quantity M(t) to extract the various decay rates ↵_i using nonlinear fitting?

(5)

No!

Continuum limit

• One cannot hope to extract the decay rates ↵_i, i = 1, 2, . . . , I from the measured function M(t)

• We therefore consider the complementary limit, where the number of dis-tinct species I 1.

(6)

The model: Linear evaporation process

@_t⇢(w, t) = w⇢(w, t), M (t) =

Z 1

0

⇢(w, t) dw. Nondimensional evaporation rate: 0 _ w _ 1. Continuum limit: c_i _! ⇢(w).

⇢(w, 0) is “random” and E[⇢(w, 0)] = 1.

Schr¨odinger picture of the evolution of the system:

⇢(w, t) = ⇢(w, 0)e wt.

“Dual” Heisenberg picture:

G(t) =

Z

g(w)⇢(w, t)dw. Observable :

G(t) = Z

g(w)⇢(w, t)dw = Z

g(w)e wt⇢(w, 0)dw = Z

(7)

Evolution of the total mass

(8)

Discrete time setting

Discrete time = Takens delay-coordinate embedding

⇢_n₊₁(w) = ⇤T ⇢_n(w)

g(n+1)⌧ (w) = ⇤gn⌧ (w)

⇤ : C([0, 1]) _! C([0, 1])

⇤g(w) = e w⌧ g(w)

⇤[1] ₆= 1, so ⇤ is not the Koopman operator for a dynamical system!

Nonetheless, we can “formally” apply the Mori-Zwanzig projection operator technique.

E[M_n] =

Z 1

0 E

[⇢_n(w)]dw =

Z

e nw⌧ dw = 1 e

n⌧

(9)

Mori-Zwanzig projection

M_n =

n

X

k=1

h_kM_{n k} + _n,

h_k = Memory kernel, _n = Orthogonal dynamics (“noise”)

This equation is exact. Intuition: It is good place to start approximating.

Can solve for memory kernel anaytically.

h_k _⇠ 1

k log2(k) as k ! 1,

Although P h_k converges to ˆH(1) = 1, the partial sums go to 1 extremely slowly, 1 PN_k₌₁ h(k) _⇠ log(N) 1.

(10)

Filtering, estimation and prediction

Given a sequence of noisy measurements ˜M_k =

Z

⇢_kdw + _k where _k are

uncorrelated normal variates.

Question: What is the “best” prediction for Mn in terms of the

measure-ments ˜Mk for k < n?

Abstractly, optimal estimate = conditional expectation

¯

M_n = E[M_n _| M˜_n ₁, M˜_n ₂, . . . , M˜₁, M˜₀].

Goal: Concrete representation for optimal estimator = explicit functions F_n

such that

E[M_n _| M˜_n ₁, M˜_n ₂, . . . , M˜₁, M˜₀] _⇡ F_n( ˜M_n ₁, M˜_n ₂, . . . , M˜_j, . . .).

(11)

Classification of filters

• Autonomous = shift-invariant = F_n _⌘ F independent of n.

• F_n only depends on ˜M_n ₁, M˜_n ₂, . . . , M˜_{n L} = Finite impulse response

with L taps.

• F_n is genie-aided if it has access to future information. Like a Maxwell demon, this fictional construct is useful because it allows us to bound the best-case behavior of constructible filters.

• Filter is empirical or data-driven = coefficients obtained through regression on one or many realizations of the underlying random process ˜M_k.

Reduced model: If F_n is a (close to) optimal filter, then

c

M_n = F_n(Mc_n ₁, Mc_n ₂, . . . , Mc_j, . . .) + ✓_n,

✓_n stochastic with appropriate statistics = good surrogate for the process M_n.

(12)

Empirical filters

Assume no measurement error. State-space model is:

M_n =

n X

k=1

h_kM_{n k} + _n

n is a non-stationary random process

Find the weights h0_k by minimizing the sum of the normalized squared

resid-uals J X j=1 N X

n=L+1

"

Mn(j) PL_k₌₁ h0_kM_{n k}(j)

PL

k=1 M (j) n k

#2

, where the outer sum is over di↵erent

realizations, and the inner sum is over all subsequences of L consecutive values of M_k(j).

(13)

(14)

Distribution of initial conditions

E[⇢₀(w)] = 1

E[⇢₀(w)⇢₀(w0)] = 1 + ¯2 (w w0)

We can construct a sequence of point mass (i.e. discretized) initial conditions whose weak limits satisfy these conditions

E[M_n] = 1 e

n⌧

E[M_nM_j] = E[M_n]E[M_j] + ¯2 1 e

(n+j)⌧

(n + j)⌧

Regression: optimal AR(L) filter of the form

M_n = q_nM₀ + h(₁n)M_n ₁ + h₂(n)M_n ₂ + _{· · ·} + h_L(n)M_{n L} + ✓_n,

(15)

Nonautonomous optimal filters Yule-Walker equations

1 e(2n k)⌧

(2n k)⌧ =

L

X

j=1

h(_jn) 1 e

(2n k j)⌧

(2n k j)⌧ , k = 1, 2, . . . , L.

Hilbert matrix! 0 B B B @ 1 2n 1 1 2n 2 .. . 1 2n L 1 C C C A = 0 B B B @ 1 2n 2 1

2n 3 · · ·

1

2n L 1 1

2n 3

1

2n 4 · · ·

1

2n L 2

..

. ... . .. ...

1

2n L 1

1

2n L 2 · · ·

1 2n 2L 1 C C C A 0 B B B B @

h(n)₁ h(n)₁

.. . h(n)_L

1 C C C C A .

(16)

Asymptotic filter

h(n)_j =

L Y

i₆=j

i i j

L Y

i=1

2n i j

2n i

= ( 1)j 1

✓ L

j ◆

+ ( 1)j L

2 2n ✓ L 1 j 1 ◆

+ O(n 2).

h(₁n) = 6 36 2n 1,

h(₂n) = 15 + 630 2n 1

225

n 1,

h(₃n) = 20 3360

2n 1 +

2100

n 1

1200 2n 3,

h(₄n) = 15 + 7560 2n 1

6300

n 1 +

6300 2n 3

450

n 2,

h(₅n) = 6 7560

2n 1 +

7560

n 1

10080 2n 3 +

1260

n 2

180 2n 5,

h(₆n) = 1 + 2772 2n 1

3150

n 1 +

5040 2n 3

840

n 2 +

210 2n 5

3

(17)

Universal filter

Asymptotic filter coefficients converge as n _{! 1}

lim

n_!1 h

(n)

j = ( 1)j 1

✓

L j

◆

Post facto justification for averaging over n,

L X i=0 ✓ L i ◆

( 1)i

n i =

L!

n(n 1)(n 2) _{· · ·} (n L) ⇠

L!

nL+1 ,

M_n _⇡ LM_n ₁ L(L 1)

2 Mn 2 + · · · ( 1)

L_M

(18)

Universal filter and slow decay of correlations

f(x) is algebraically decaying. Among all sets of coefficients ↵₀, ↵₁, ↵₂, . . . , ↵_L, normalized by ↵₀ = 1, the linear combination

L X i=0 ✓ L i ◆

( 1)if(n i) _⇠ d L

dxL f

✓

n L

2

◆

,

is asymptotically “the smallest” possible.

Not true for exponentially decaying functions!

Slowly decaying correlations implies stochastically parameterization:

[(1 R)Lf]_n =

L X i=0 ✓ L i ◆

( 1)if_{n i} = _n✓_n,

(19)

The time and temperature dependence of the evaporation curves are best fit by one of the following two equations:

%E = (0.165(%D) + 0.045(T 15)) log(t) and

(20)

Nondimensional evaporation curves

Time scale is set by most volatile species: w_max = 1.

M (t) = 1 a log(1 + t/t₀) and

M (t) = 1 a(p1 + t/t₀ 1)

t, t₀ (small scale cuto↵) and a _⌧ 1 are all dimensionless

˙

M(t) . ↵_maxM(t) = M (t) so that a/t₀ . 1.

Ranges of validity: Tmax ⇠ t0e1/a for the logarithmic equation and Tmax ⇠

(21)

Nondimensional evaporation curves and filtering

Empirical

(22)

Log-concavity

d2

dt2 log(M (t)) =

R

w2⇢(w, t)dw R ⇢(w, t)dw R w⇢(w, t)dw 2

R

⇢(w, t)dw 2 0

This relation has to hold for every realization

Log equation:

T_crit _⇡ 1

e Tmax, M (Tcrit) ⇡ a ⌧ 1.

Sqrt equation:

T_crit _⇡ 1

4 Tmax, M (Tcrit) ⇡

1

(23)

Universal vs. asymptotic filter

The ability of a filter to track/predict these functions accurately is not nec-essarily a positive feature.

(24)

Conclusions

• Mori-Zwanzig does poorly on systems with slow decay of correlations.

• The universal filter has very small error as n _{! 1}, but is not very

dis-criminating.

• The empirical linear filter is very discriminating/nearly optimal among all

linear filters with fixed coefficients and L (a given number of) taps. Floor

for its error – Sloppy model.

• The extended asymptotic filter is (essentially) time varying so it has

(25)