[main] [srq]mg|pages

SRQ seminar – October 29th, 2018

Notes from a talk in the SRQ series.

SRQ Seminar notes 20181029 Jara

Entropy approach to fluctuations of interacting particle systems

Reader can refer to the paper “Non-Equilibrium fluctuations of interacting particle systems” (written jointly with Octavio Menezes, on the arXiv).

Continuous time, finite state (state space \(\Omega\)), Markov chain \((x_t)_{t \geqslant 0}\).

\(\displaystyle \textit{Generator:} \quad \quad L f (x) = \sum_y r (x, y) (f (y) - f (x)), \qquad x \in \Omega .\)

\(\bar{\nu}\) a (probability) reference measure \(\bar{\nu} (x) > 0\) for all \(x \in \Omega\). \((\mu_t)_{t \geqslant 0}\) trajectory of (probability) reference measures,

\(\displaystyle \psi_t (x) := \frac{\mu_t (x)}{\bar{\nu} (x)}, \quad \text{density of $\mu_t$ wrt. $\bar{\nu}$} .\)
\(\displaystyle f_t (x) := \frac{\mathbb{P} (x_t = x)}{\mu_t (x)}, \quad \text{density of the law of $x_t$ wrt. $\bar{\nu}$} .\)
\(\displaystyle \textit{Carré du champ:} \quad \quad \Gamma f (x) := \sum_y r (x, y) (f (y) - f (x))^2 .\)

\(L_t^{\ast}\) adjoint of \(L\) wrt. \(\mu_t\).

\(\displaystyle H (t) := \int f_t \log f_t \mathrm{d} \mu_t, \qquad \textit{relative entropy} .\)

Theorem 1. (Yau's inequality)

\(\displaystyle H' (t) \leqslant - \int \Gamma \sqrt{f_t} \mathrm{d} \mu_t + \int (L^{\ast}_t \mathbb{1}- \partial_t \log \psi_t) f_t \mathrm{d} \mu_t .\)

Note that \(L^{\ast}_t \mathbb{1}- \partial_t \log \psi_t = 0\) iff the sequence \(\mu_t\) solves the Fokker–Planck equation, it measures how far our reference measures \((\mu_t)_t\) are to be the actual law of our Markov process.

This is a very general inequality. For diffusions this inequality is an identity.

Yau's idea for hydrodynamic limit is to guess a good \((\mu_t)_t\) for which \(L^{\ast}_t \mathbb{1}- \partial_t \log \psi_t\) is “small”.

A new inequality. Take a function \(V : [0, T] \times \Omega \rightarrow \mathbb{R}\) and consider

\(\displaystyle \int_0^T V_t (x_t) \mathrm{d} t\)

which is an additive functional of the Markov chain. We call it also an “observable”. We are interested in the behaviour of the integral for large \(T\).

Theorem 2.

\(\displaystyle \log \mathbb{E}^{\mu_0} \left[ \exp \left( \int_0^T V_t (x_t) \mathrm{d} t \right) \right] \leqslant \int_0^T \sup_f \left[ - \int \Gamma \sqrt{f} \mathrm{d} \mu_t + \int \left( V_t + \frac{1}{2} (L^{\ast}_t \mathbb{1}- \partial_t \log \psi_t) \right) f \mathrm{d} \mu_t \right] \mathrm{d} t,\)

where the supremum is over any density \(f\) for \(\mu_t\).

Interesting fact is that the structure of the r.h.s. is similar to the structure of Yau's inequality.

If you know how to bound integrals of arbitrary functions like \(\int (L^{\ast}_t \mathbb{1}- \partial_t \log \psi_t) f_t \mathrm{d} \mu_t\) in terms of \(\int \Gamma \sqrt{f} \mathrm{d} \mu_t\) then this can be applied also to the exponential moments for observables.

Two examples to which to apply these inequalities:

  1. Exclusion process with boundary conditions.

  2. Mean–field spin systems, in particular (Ben-Hamou–Peres): we have particles on \(\mathbb{Z}\) at rate \(1\) we choose a site \(x\) and a site \(y\) and we change the content \(\sigma (x)\) to \(\sigma (x) + \sigma (y)\) modulo \(2\).

Mean-field spin system

(work in progress with F. Hernández (Bogotá))

\(\displaystyle \Omega_N = \{ 0, 1 \}^N\)
\(\displaystyle \eta \in \Omega_N, \quad x \in \Lambda_n = \{ 1, \ldots, N \} ; \quad \eta^x = \eta + \delta_x \text{(mod $2$)}\)

Function \(F : \Omega \rightarrow [0, \infty]\) smooth, generator:

\(\displaystyle L f (\eta) = \sum_x F \left( \frac{1}{N} \sum_{y \neq x} \eta_y \right) (f (\eta^x) - f (\eta)) .\)

The measure \(\bar{\nu} = \otimes_x \operatorname{Bernoulli} (1 / 2)\) is invariant for this system (by detailed balance). A natural question is to describe the convergence of this model to equilibrium.

Ben-Hamou–Peres consider the case \(F (u) = u\). We focus now on this example:

\(\displaystyle L f (\eta) = \frac{1}{N} \sum_{x \neq y} \eta_y (f (\eta^x) - f (\eta)) .\)

The Bernoulli measure is not ergodic, there is another ergodic component where every site is empty. So we consider the state space where we have at least one particle. There the dynamics is well defined.

Let \((\eta^N (t))_{t \geqslant 0}\) chain generated by \(L\), we take \(\eta^N (0)\) with at least one particle. Define the distance to equilibrium:

\(\displaystyle d_N (t) = \| \operatorname{Law} (\eta^N (t)) - \bar{\nu} \|_{\operatorname{TV}} .\)

Ben-Hamou–Peres prove a cutoff phenomenon.

We will prove that there exists a function \(G\) looking like the green curve such that:

\(\displaystyle \lim_{N \rightarrow \infty} d_N (t^N_{\operatorname{mix}} + b) = G (b) .\)

Ideas:

\(\displaystyle \pi^N_t = \frac{1}{N} \sum_x \eta^N_x (t)\)

a) If \(\pi^N_0 \rightarrow x_0 > 0\) then

\(\displaystyle \pi^N_t \rightarrow \varphi_t\)

where

\(\displaystyle \left\{ \begin{array}{l} \dot{\varphi}_t = \varphi_t (1 - 2 \varphi_t) =: - V' (\varphi_t)\\ \varphi_0 = x_0 \end{array} \right.\)

This is the law of large numbers.

b) If in addition

\(\displaystyle \sqrt{N} (\pi^N_0 - x_0) \rightarrow y_0\)

then we have a CLT:

\(\displaystyle \sqrt{N} (\pi^N_t - \varphi_t) \rightarrow \xi_t\)

where

\(\displaystyle \left\{ \begin{array}{l} \mathrm{d} \xi_t = - V'' (\varphi_t) \xi_t \mathrm{d} t + \sqrt{\varphi_t} \mathrm{d} B_t\\ \xi_0 = y_0 \end{array} \right.\)

These two results tell us that the number of particles is well approximated by the Gaussian with mean \(N \varphi_t\) and variance \(N \xi_t^2\).

\(\displaystyle \sigma_t^2 := \mathbb{E} [\xi_t^2] \xrightarrow[t \rightarrow \infty]{} \frac{1}{4} .\)

In general if \(F (0) \neq 0\) then

\(\displaystyle t^N_{\operatorname{mix}} = \frac{1}{2 F (1 / 2)} \log N.\)

If \(F (0) = 0\) and \(F' (0) \neq 0\) then

\(\displaystyle t^N_{\operatorname{mix}} = \left( \frac{1}{F' (0)} + \frac{1}{2 F (1 / 2)} \right) \log N.\)

Let us take

\(\displaystyle \mu_t := \otimes_x \operatorname{Bern} (\varphi_t) .\)

If we compute

\(\displaystyle L^{\ast}_t \mathbb{1}- \partial_t \log \psi_t = \frac{(1 - 2 \varphi_t)}{N} \sum_{x \neq y} \omega_x \omega_y + \text{l.o.t.}\)
\(\displaystyle \omega_x = \frac{\eta_x - \varphi_t}{\varphi_t (1 - \varphi_t)}\)

l.o.t.\(=\) lower order terms, lots of terms. At this point Yau's inequality already tells us something meaningful. Note that by CLT

\(\displaystyle \frac{1}{N} \sum_{x \neq y} \omega_x \omega_y \approx \frac{(\pi_t^N - \varphi_t)^2}{N}\)

should be approx. Gaussian, actually is the square of a subGaussian (the square exponential moments exists) random variable under \(\mu_t\). And since \(\mu_t\) is explicit we have

\(\displaystyle \int e^{\theta \frac{(\pi_t^N - \varphi_t)^2}{N}} \mathrm{d} \mu_t \leqslant e^{c \theta^2}\)

for small \(\theta\). Now

\(\displaystyle \int (L^{\ast}_t \mathbb{1}- \partial_t \log \psi_t) f_t \mathrm{d} \mu_t \leqslant \frac{1}{\gamma} \left[ H (t) + \log \int e^{\frac{\gamma}{N} (\pi_t^N - \varphi_t)^2} \mathrm{d} \mu_t \right]\)

and \((1 - 2 \varphi_t) \lesssim e^{- t}\).

\(\displaystyle H (t) \leqslant H (t_0) f (t_0)\)

where \(f (t) \rightarrow 0\) as \(t \rightarrow \infty\).

If \(\operatorname{Law} (\eta^N (t_0))\) is close to \(\mu_{t_0}\) (in relative entropy), then \(\operatorname{Law} (\eta^N (t))\) is close to \(\mu_t\) for all \(t \geqslant t_0\) uniformly in \(N\).

The CLT tells us something about the distribution of \(\operatorname{Law} (\eta^N (t_0))\). In particular it tells us that if we have convergence in law then there is a coupling which converge in probability. The same is true for two different sequences of random variables which can be coupled to remain at small distance.

\(\pi^N_{t_0}\) is at distance \(\varepsilon / N^{1 / 2}\) of \(\operatorname{Binom} (N, \varphi_{t_0})\) if \(t_0\) is large.

We use entropy twice: one for large time to be close to product measure forever, and the other to prove that strating from deterministic we are, after small time, near a product measure.

[I could not follow the final part of this discussion]

\(\displaystyle \)