Stochastic Processes

Stochastic Partial Differential Equations

Цели урока

Build the mild solution of the stochastic heat SPDE via the Walsh integral
Explain the reduced regularity of the solution (1/4 in t, 1/2 in x)
Derive the Zakai equation for the nonlinear filtering problem
Connect SPDEs to neural solvers (FNO, PINN)

Предварительные знания

Stochastic control
Ito integral
Heat equation

Tesla FSD fuses data from 8 cameras, radar, and ultrasound every 10 ms. The Kalman filter is the Zakai equation for the linear case. All of sensor fusion is an SPDE solved in real time.

Tesla FSD: Zakai equation as the basis for nonlinear sensor fusion
NVIDIA Modulus: neural solvers for turbulent SPDEs
FNO (Fourier Neural Operator): operator learning for parametric SPDEs
Finance: Heath-Jarrow-Morton interest rate model as an SPDE

Walsh, Zakai, and the birth of SPDE theory

John Walsh defined the stochastic integral with respect to a space-time Brownian sheet in 1984, opening the door to a rigorous SPDE theory. Moshe Zakai derived his filtering equation in 1969, generalizing the Kalman-Bucy filter. Nualart, Da Prato, and others systematically studied solution regularity in the 1990s-2000s. Modern applications: Anima Anandkumar and Zongyi Li developed FNO in 2020 - neural networks for solving SPDEs on large grids in seconds.

SPDEs and the Stochastic Heat Equation

NVIDIA Modulus, 2024. A neural simulator for turbulence accelerates CFD computations 100x. Under the hood: SPDEs for space-time random fields. The stochastic heat equation describes diffusion driven by a random source and has become the standard model for spatial noise.

In dimension d >= 2 space-time white noise is too rough: u has no pointwise L^2 regularity. For d >= 2, noise must be smoothed (colored noise) or generalized solutions must be used.

Why does the mild solution of the stochastic heat SPDE have 1/4 regularity in time rather than 1/2 like Brownian motion?

White noise dot{W}(t,x) is a generalized function in both variables. The heat kernel smooths in x, giving 1/2-Holder there. In t: without spatial integration it would be 1/2, but double roughness gives 1/4.

The Zakai Equation and Nonlinear Filtering

GPS gives position with 2-3 meter error. A camera gives a different error. Lidar gives a third. How to combine all sensors into a single state estimate? This is nonlinear filtering. The Zakai equation is its rigorous mathematical solution.

Kalman filter as a special case

Linear Zakai equation

With f(x) = Ax (linear dynamics) and h(x) = Cx (linear observation), initial distribution X_0 ~ N(m_0, P_0), the filter pi_t = N(m_t, P_t) remains Gaussian for all t. The Zakai equation reduces to the Kalman-Bucy equations: dm_t = A m_t dt + P_t C^T (dY_t - C m_t dt), dP_t/dt = AP_t + P_t A^T + Q - P_t C^T R^{-1} C P_t. The Riccati equation again.

The particle filter (sequential Monte Carlo) is a numerical solution to the Zakai equation through a weighted particle set. Used in Tesla FSD for sensor fusion: GPS + IMU + cameras + lidar.

How does the Zakai equation differ from the Fokker-Planck equation?

Fokker-Planck: dt p(t,x) = L* p(t,x) - deterministic PDE for unconditional density. Zakai: d sigma_t(phi) = sigma_t(L phi) dt + sigma_t(h phi) dY_t - SPDE with dY_t because it incorporates observations.

Solution Regularity and Neural PDE Solvers

Neural PDE solvers (Physics-Informed Neural Networks, FNO) handle SPDEs through random realizations. The Fourier Neural Operator by Anima Anandkumar solves parametric SPDEs in milliseconds instead of hours with FEM. The key question: what regularity needs to be approximated?

FNO (Fourier Neural Operator) is trained on pairs (initial condition, noise realization) -> SPDE solution. Generalization to new noise realizations is automatic. This is operator learning: FNO approximates the map from function spaces to function spaces, not a single fixed solution.

For space-time SPDEs, spectral methods are effective: expanding in eigenfunctions of the operator L gives explicit formulas for the variance of each mode. Efficient for linear SPDEs.

FNO learns to approximate an SPDE solution. Does it approximate a function or an operator?

FNO approximates the operator G: (u_0, W) -> u(T,*). Generalization is achieved by training on many (noise realization, solution) pairs. For new realizations FNO gives solutions in milliseconds.

Connections to other topics

SPDEs connect functional analysis, stochastic calculus, and numerical methods

Nonlinear filtering — Related topic
Neural PDE solvers — Related topic
Rough paths theory — Related topic
Financial models — Related topic

Итоги

Stochastic heat SPDE: mild solution via Walsh integral, regularity 1/4 in t and 1/2 in x
Variance grows like sqrt(t) - slower than an SDE because thermal diffusion dampens noise
Zakai equation: SPDE for the unnormalized conditional distribution given observations
Linear Zakai = Kalman-Bucy; nonlinear = particle filter

Вопросы для размышления

Why does space-time white noise in dimension d >= 2 prevent pointwise solution values for the SPDE?
How does the Zakai equation extend when the observations are themselves nonlinear in noise?
What fundamentally distinguishes FNO from PINN when solving parametric SPDEs?

Связанные уроки

sp-25-stochastic-control — The Zakai equation is an SPDE arising from nonlinear filtering
sp-20 — Walsh integral generalizes the Ito integral to space-time
sp-27 — Rough paths theory extends SPDEs to irregular noise
sp-24-levy-processes