Kolmogorov equations (original) (raw)

From Wikipedia, the free encyclopedia

Equations characterizing continuous-time Markov processes

In probability theory, Kolmogorov equations, including Kolmogorov forward equations and Kolmogorov backward equations, characterize continuous-time Markov processes. In particular, they describe how the probability of a continuous-time Markov process in a certain state changes over time.

Diffusion processes vs. jump processes

[edit]

Writing in 1931, Andrei Kolmogorov started from the theory of discrete time Markov processes, which are described by the Chapman–Kolmogorov equation, and sought to derive a theory of continuous time Markov processes by extending this equation. He found that there are two kinds of continuous time Markov processes, depending on the assumed behavior over small intervals of time:

If you assume that "in a small time interval there is an overwhelming probability that the state will remain unchanged; however, if it changes, the change may be radical",[1] then you are led to what are called jump processes.

The other case leads to processes such as those "represented by diffusion and by Brownian motion; there it is certain that some change will occur in any time interval, however small; only, here it is certain that the changes during small time intervals will be also small".[1]

For each of these two kinds of processes, Kolmogorov derived a forward and a backward system of equations (four in all).

The equations are named after Andrei Kolmogorov since they were highlighted in his 1931 foundational work.[2]

William Feller, in 1949, used the names "forward equation" and "backward equation" for his more general version of the Kolmogorov's pair, in both jump and diffusion processes.[1] Much later, in 1956, he referred to the equations for the jump process as "Kolmogorov forward equations" and "Kolmogorov backward equations".[3]

Other authors, such as Motoo Kimura,[4] referred to the diffusion (Fokker–Planck) equation as Kolmogorov forward equation, a name that has persisted.

Continuous-time Markov chains

[edit]

The original derivation of the equations by Kolmogorov starts with the Chapman–Kolmogorov equation (Kolmogorov called it fundamental equation) for time-continuous and differentiable Markov processes on a finite, discrete state space.[2] In this formulation, it is assumed that the probabilities P ( x , s ; y , t ) {\displaystyle P(x,s;y,t)} {\displaystyle P(x,s;y,t)} are continuous and differentiable functions of t > s {\displaystyle t>s} {\displaystyle t>s}, where x , y ∈ Ω {\displaystyle x,y\in \Omega } {\displaystyle x,y\in \Omega } (the state space) and t > s , t , s ∈ R ≥ 0 {\displaystyle t>s,t,s\in \mathbb {R} _{\geq 0}} {\displaystyle t>s,t,s\in \mathbb {R} _{\geq 0}} are the final and initial times, respectively. Also, adequate limit properties for the derivatives are assumed. Feller derives the equations under slightly different conditions, starting with the concept of purely discontinuous Markov process and then formulating them for more general state spaces.[5] Feller proves the existence of solutions of probabilistic character to the Kolmogorov forward equations and Kolmogorov backward equations under natural conditions.[5]

For the case of a countable state space we put i , j {\displaystyle i,j} {\displaystyle i,j} in place of x , y {\displaystyle x,y} {\displaystyle x,y}. The Kolmogorov forward equations read

∂ P i j ∂ t ( s ; t ) = ∑ k P i k ( s ; t ) A k j ( t ) {\displaystyle {\frac {\partial P_{ij}}{\partial t}}(s;t)=\sum _{k}P_{ik}(s;t)A_{kj}(t)} {\displaystyle {\frac {\partial P_{ij}}{\partial t}}(s;t)=\sum _{k}P_{ik}(s;t)A_{kj}(t)},

where A ( t ) {\displaystyle A(t)} {\displaystyle A(t)} is the transition rate matrix (also known as the generator matrix),

while the Kolmogorov backward equations are

∂ P i j ∂ s ( s ; t ) = − ∑ k P k j ( s ; t ) A i k ( s ) {\displaystyle {\frac {\partial P_{ij}}{\partial s}}(s;t)=-\sum _{k}P_{kj}(s;t)A_{ik}(s)} {\displaystyle {\frac {\partial P_{ij}}{\partial s}}(s;t)=-\sum _{k}P_{kj}(s;t)A_{ik}(s)}

The functions P i j ( s ; t ) {\displaystyle P_{ij}(s;t)} {\displaystyle P_{ij}(s;t)} are continuous and differentiable in both time arguments. They represent the probability that the system that was in state i {\displaystyle i} {\displaystyle i} at time s {\displaystyle s} {\displaystyle s} jumps to state j {\displaystyle j} {\displaystyle j} at some later time t > s {\displaystyle t>s} {\displaystyle t>s}. The continuous quantities A i j ( t ) {\displaystyle A_{ij}(t)} {\displaystyle A_{ij}(t)} satisfy

A i j ( t ) = [ ∂ P i j ∂ u ( t ; u ) ] u = t , A j k ( t ) ≥ 0 , j ≠ k , ∑ k A j k ( t ) = 0. {\displaystyle A_{ij}(t)=\left[{\frac {\partial P_{ij}}{\partial u}}(t;u)\right]_{u=t},\quad A_{jk}(t)\geq 0,\ j\neq k,\quad \sum _{k}A_{jk}(t)=0.} {\displaystyle A_{ij}(t)=\left[{\frac {\partial P_{ij}}{\partial u}}(t;u)\right]_{u=t},\quad A_{jk}(t)\geq 0,\ j\neq k,\quad \sum _{k}A_{jk}(t)=0.}

Relation with the generating function

[edit]

Still in the discrete state case, letting s = 0 {\displaystyle s=0} {\displaystyle s=0} and assuming that the system initially is found in state i {\displaystyle i} {\displaystyle i}, the Kolmogorov forward equations describe an initial-value problem for finding the probabilities of the process, given the quantities A j k ( t ) {\displaystyle A_{jk}(t)} {\displaystyle A_{jk}(t)}. We write p k ( t ) = P i k ( 0 ; t ) {\displaystyle p_{k}(t)=P_{ik}(0;t)} {\displaystyle p_{k}(t)=P_{ik}(0;t)} where ∑ k p k ( t ) = 1 {\displaystyle \sum _{k}p_{k}(t)=1} {\displaystyle \sum _{k}p_{k}(t)=1}, then

d p k d t ( t ) = ∑ j A j k ( t ) p j ( t ) ; p k ( 0 ) = δ i k , k = 0 , 1 , … . {\displaystyle {\frac {dp_{k}}{dt}}(t)=\sum _{j}A_{jk}(t)p_{j}(t);\quad p_{k}(0)=\delta _{ik},\qquad k=0,1,\dots .} {\displaystyle {\frac {dp_{k}}{dt}}(t)=\sum _{j}A_{jk}(t)p_{j}(t);\quad p_{k}(0)=\delta _{ik},\qquad k=0,1,\dots .}

For the case of a pure death process with constant rates the only nonzero coefficients are A j , j − 1 = μ j , j ≥ 1 {\displaystyle A_{j,j-1}=\mu _{j},\ j\geq 1} {\displaystyle A_{j,j-1}=\mu _{j},\ j\geq 1}. Letting

Ψ ( x , t ) = ∑ k x k p k ( t ) , {\displaystyle \Psi (x,t)=\sum _{k}x^{k}p_{k}(t),\quad } {\displaystyle \Psi (x,t)=\sum _{k}x^{k}p_{k}(t),\quad }

the system of equations can in this case be recast as a partial differential equation for Ψ ( x , t ) {\displaystyle {\Psi }(x,t)} {\displaystyle {\Psi }(x,t)} with initial condition Ψ ( x , 0 ) = x i {\displaystyle \Psi (x,0)=x^{i}} {\displaystyle \Psi (x,0)=x^{i}}. After some manipulations, the system of equations reads,[6]

∂ Ψ ∂ t ( x , t ) = μ ( 1 − x ) ∂ Ψ ∂ x ( x , t ) ; Ψ ( x , 0 ) = x i , Ψ ( 1 , t ) = 1. {\displaystyle {\frac {\partial \Psi }{\partial t}}(x,t)=\mu (1-x){\frac {\partial {\Psi }}{\partial x}}(x,t);\qquad \Psi (x,0)=x^{i},\quad \Psi (1,t)=1.} {\displaystyle {\frac {\partial \Psi }{\partial t}}(x,t)=\mu (1-x){\frac {\partial {\Psi }}{\partial x}}(x,t);\qquad \Psi (x,0)=x^{i},\quad \Psi (1,t)=1.}

An example from biology

[edit]

One example from biology is given below:[7]

p n ′ ( t ) = ( n − 1 ) β p n − 1 ( t ) − n β p n ( t ) {\displaystyle p_{n}'(t)=(n-1)\beta p_{n-1}(t)-n\beta p_{n}(t)} {\displaystyle p_{n}'(t)=(n-1)\beta p_{n-1}(t)-n\beta p_{n}(t)}

This equation is applied to model population growth with birth. Where n {\displaystyle n} {\displaystyle n} is the population index, with reference the initial population, β {\displaystyle \beta } {\displaystyle \beta } is the birth rate, and finally p n ( t ) = Pr ( N ( t ) = n ) {\displaystyle p_{n}(t)=\Pr(N(t)=n)} {\displaystyle p_{n}(t)=\Pr(N(t)=n)}, i.e. the probability of achieving a certain population size.

The analytical solution is:[7]

p n ( t ) = ( n − 1 ) β e − n β t ∫ 0 t p n − 1 ( s ) e n β s d s {\displaystyle p_{n}(t)=(n-1)\beta e^{-n\beta t}\int _{0}^{t}\!p_{n-1}(s)\,e^{n\beta s}\mathrm {d} s} {\displaystyle p_{n}(t)=(n-1)\beta e^{-n\beta t}\int _{0}^{t}\!p_{n-1}(s)\,e^{n\beta s}\mathrm {d} s}

This is a formula for the probability p n ( t ) {\displaystyle p_{n}(t)} {\displaystyle p_{n}(t)} in terms of the preceding ones, i.e. p n − 1 ( t ) {\displaystyle p_{n-1}(t)} {\displaystyle p_{n-1}(t)}.

  1. ^ a b c Feller, W. (1949). "On the Theory of Stochastic Processes, with Particular Reference to Applications". Proceedings of the (First) Berkeley Symposium on Mathematical Statistics and Probability. Vol. 1. University of California Press. pp. 403–432.
  2. ^ a b Kolmogorov, Andrei (1931). "Über die analytischen Methoden in der Wahrscheinlichkeitsrechnung" [On Analytical Methods in the Theory of Probability]. Mathematische Annalen (in German). 104: 415–458. doi:10.1007/BF01457949. S2CID 119439925.
  3. ^ Feller, William (1957). "On Boundaries and Lateral Conditions for the Kolmogorov Differential Equations". Annals of Mathematics. 65 (3): 527–570. doi:10.2307/1970064. JSTOR 1970064.
  4. ^ Kimura, Motoo (1957). "Some Problems of Stochastic Processes in Genetics". Annals of Mathematical Statistics. 28 (4): 882–901. doi:10.1214/aoms/1177706791. JSTOR 2237051.
  5. ^ a b Feller, Willy (1940) "On the Integro-Differential Equations of Purely Discontinuous Markoff Processes", Transactions of the American Mathematical Society, 48 (3), 488-515 JSTOR 1990095
  6. ^ Bailey, Norman T.J. (1990) The Elements of Stochastic Processes with Applications to the Natural Sciences, Wiley. ISBN 0-471-52368-2 (page 90)
  7. ^ a b Logan, J. David; Wolesensky, William R. (2009). Mathematical Methods in Biology. Pure and Applied Mathematics. John Wiley& Sons. pp. 325–327. ISBN 978-0-470-52587-6.