Kalman Filter

Kalman Filter is just Bayes Filter applied to a Gaussian linear setting.

Settings

First we have a state transition:

x_{t} = A_{t} x_{t - 1} + B_{t} u_{t} + ε_{t}

$ε_{t}$ ‘s mean is $0$ and the covariance is $R_{t}$ .

Then we have the measurements:

z_{t} = C_{t} x_{t} + δ_{t}

Again, measurement noise $δ_{t}$ has $0$ mean and covariance $Q_{t}$ .

The algorithm

Here line 2 and 3 are calculating $\overline{b e l} (x)$ , while 5 and 6 are calculating $b e l (x)$ .

My own intuition

$\overset{μ}{ˉ}_{t}$ : That’s easy from state transition formula
$\overset{ˉ}{Σ}_{t}$ : The original covariance goes through the linear formula, since the covariance is a quadratic matrix, we must have $A_{t}$ multiplied twice. Add the $R_{t}$ term for state transition noise.

On Kalman Gain

Kalman Gain: Let’s write this as

K_{t} = \frac{Σ ˉ _{t} C _{t}^{T}}{C _{t} Σ ˉ _{t} C _{t}^{T} + Q _{t}}

. That’s strange (and nobody pointed that out), cause obviously to make sense of it we need another $C_{t}$ there on the numerator part.

Let’s just say, $K_{t}$ is intuitively,

C_{t}^{- 1} * ratio of covariance

. Of course there’s no guarantee that $C_{t}$ is invertible. But let’s just keep it that way.

Now onto the next formula. We compute the innovation: the difference of “real measurement” and “expected measurement”, $z_{t} - C_{t} \overset{μ}{^}_{t}$ . Recall that $z_{t} = C_{t} x_{t} + δ_{t}$ . If we just multiply $C^{- 1}$ to both side, we get $C_{t}^{- 1} z_{t} = x_{t} + ...$ . So here we are,

μ_{t} = \overset{μ}{ˉ}_{t} + ratio of covariance (α) * C_{t}^{- 1} (z_{t} - C_{t} \overset{μ}{ˉ}_{t}) = α C^{- 1} z_{t} + (1 - α) \overset{μ}{ˉ}_{t}

Voilà!

For the last one,

Σ_{t} = (1 - α) \overset{ˉ}{Σ}_{t}

Why? If $α \to 0$ , that means measurement noise is too large, and we rely only on state update. If $α \to 1$ , measurement is so good we just need that, and we are super certain.

The derivation

The following notes are from Gauss-Markov Models, a supplementary material from 16-831, F14. It’s clearer than the version in the book Probabilistic Robotics.

Say we have a vector $x \sim N (μ, Σ)$ ,

Linear transformation, $A x$ , is easier with moment parameterization. $A x \sim N (A μ, A Σ A^{T})$ .

Conditioning, getting $x_{1} ∣ x_{2}$ from joint distribution $x$ , is easier with natural parameterization. Note that conditioning is basically start from joint distribution and then treat $x_{2}$ as “known”.

x_{1} ∣ x_{2} \sim \tilde{N} (J_{1} - P_{12} x_{2}, P_{11})

If we want to multiply two likelihood function, $p (x ∣ z) \sim p (z ∣ x) p (x)$ , then posterior can be simply computed by

x_{1} ∣ z \sim \tilde{N} (J_{1} + J_{2}, P_{1} + P_{2})

Now with these in mind, we can have a gauss-markov model. I’m too tired now to repeat the stuff in the PDF. But the general idea is we do it in two steps. One prediction / rollup, one conditioning.

The former computes $p (x_{t} ∣ z_{1}, ... z_{t - 1})$ , and relies on $p (x_{t - 1} = x ∣ z_{1}, ... z_{t - 1})$ . The latter is $p (x_{t}) ∣ z_{1}, ..., z_{t}$ and relies on the previous formula. The first step is easier in moment parameters, while the latter is easier in natural one.

If we use Sherman-Morrison-Woodbury formula and convert the natural parameterization to moment one, we get our familiar Kalman filter, which is an algorithm, not the underlying probabilistic model.

The "information" is Fisher information

The precision matrix $P = Σ^{- 1}$ in the natural parameterization is the Fisher Information about the state given the current belief. The measurement update $P_{new} = P_{old} + C^{T} Q^{- 1} C$ is adding the Fisher information from the new observation. This is why conditioning is additive in natural parameters — Fisher information from independent observations adds. The Kalman filter achieves the Cramér-Rao bound: $Σ_{t}$ is the minimum-variance estimator, and $Σ_{t}^{- 1}$ is the total accumulated Fisher information.

Yanda's Random Notes

Explorer

Kalman Filter

Settings

The algorithm

My own intuition

On Kalman Gain

The derivation

Graph View

Table of Contents

Backlinks