Stochastic filter

When we observe some natural phenomenon, there are essentially two “random” or “noisy” influences: First there may be uncontrollable, seemingly random, processes that influence the phenomenon itself, then there will be errors in the measuring process. **Stochastic filters** are algorithms that compute estimations of the true state of the phenomenon, given “noisy” observation.

Therefore, **stochastic filters** are actually **estimators** that are supposed to remove the noise from the measurements: The removal of noise is the reason for the name “filter”.

The **Kalman-Bucy filter**, which is the solution of a linear filtering problem, has found many applications including aerospace engineering (Ranger, Mariner, Apollo), and can therefore be used as an example that quite sophisticated mathematics may have relevant real world applications almost immediately after its invention.

This page primarily deals with filtering of continuous time analytic systems. For discrete time filtering see temporal filtering.

We will formulate the filtering problem in continuous time using stochastic differential equations. Let the original process $X_t$ satisfy a stochastic differential equation

$d X_t = A(X_t ,t) \; d t + B(X_t, t) \; d W_{1, t}$

This is the process we would like to know about. But all we have is knowledge about a measurement process $Z_t$ of the original process $X_t$, which again satisfies a differential equation

$d Z_t = C(X_t, t) \; d t + D(X_ t, t ) \; d W_{2, t}$

Both equations may be multidimensional, not necessarily with the same dimension, and $W_{1, t}$ and $W_{2, t}$ are independent Wiener processes of the appropriate dimension.

We are looking for an estimator $\hat{X}_T$ of the state of the process $X_T$ at time $T$, given the observation of $Z_t$ on the interval $[0, T]$. To make the statement “given the observation” mathematically precise, we demand that the estimator $\hat{X}_T$ is measurable with respect to the $\sigma$-algebra generated by the process $Z_t$.

The **best estimator** $\hat{X}_T$ is defined to be an estimator that minimizes

$\int_{\Omega} \| X_t - \hat{X}_t \|^2 \; d P$

where $\Omega, P$ denotes the probability space and the probability measure of $(W_{1, t}, W_{1, t})$ with the appropriate initial condition (which we left unspecified).

We consider the linear filtering problem in continuous time, using stochastic differential equations. The original process is denoted by $X_t$, the process of measurements is denoted by $Z_t$. Both satisfy a linear stochastic differential equation in one dimension:

$d X_t = A(t) X_t d t + B(t) d W_{1, t}$

and

$d Z_t = C(t) X_t d t + D(t) d W_{2, t}$

where $W_{1, t}$ and $W_{2, t}$ are independent Wiener processes, and the functions $A, B, C, D$ are given.

The solution of the problem is given by the **Kalman-Bucy filter**, which specifies a stochastic differential equation which $\hat{X}_T$ has to satisfy.

The solution to the linear filtering problem $\hat{X}_t$ satisfies the following stochastic differential equation:

$d \hat{X}_t = (A(t) - \frac{C^2(t) S(t)}{D^2(t)}) \hat{X}_t d t + \frac{C(t) S(t)}{D^2(t)} \; d Z_t$

with the initial condition $\hat{X}_0 = E(X_0)$. The function $S$ is defined as

$S(t) := E(( X_t - \hat{X}_t)^2)$

and is known to satisfy the following deterministic Ricci differential equation:

$\frac{d S}{d t} = 2 A(t) S(t) - \frac{C^2(t)}{D^2(t)} S^2(t) + B^2(t)$

with the obvious initial condition $S(0) = E ( (X_0 - E(X_0))^2 )$.

We will calculate the Kalman-Bucy filter for a constant process:

$d X_t = 0$

with $E(X_0) = 0$ and $E(X_0^2) =: a^2$.

The observations are noisy, that is:

$d Z_t = X_t \; d t + m \; d W_t$

with initial condition $Z_0 = 0$.

Then the Kalman-Bucy filter is

$\hat{X_z} = \frac{a^2}{m^2 + a^2 t} \; Z_t$

See chapter 6, “The Filtering Problem”, of:

- Bernt Øksendal,
*Stochastic Differential Equations: An Introduction with Applications*, 6th ed. (ZMATH)

category: statistical methods