Euclidean Quantum Mechanics and Universal Nonlinear Filtering

Euclidean Quantum Mechanics and Universal Nonlinear Filtering

Bhashyam Balaji
Radar Systems Section,
Defence Research and Development Canada, Ottawa,
3701 Carling Avenue,
Ottawa ON K1A 0Z4 Canada
Email: Bhashyam.Balaji@drdc-rddc.gc.ca
Abstract:

An important problem in applied science is the continuous nonlinear filtering problem, i.e., the estimation of a Langevin state that is observed indirectly. In this paper, it is shown that Euclidean quantum mechanics is closely related to the continuous nonlinear filtering problem. The key is the configuration space Feynman path integral representation of the fundamental solution of a Fokker-Planck type of equation termed the Yau Equation of continuous-continuous filtering. A corollary is the equivalence between nonlinear filtering problem and a time-varying Schrödinger equation.

Fokker-Planck Equation, Kolmogorov Equation, stochastic differential equations, Duncan-Mortensen-Zakai (DMZ) Equation, nonlinear filtering, Feynman path integral

1 Introduction

The problem of the evolution of a Langevin state, or a signal of interest, described by a continuous-time stochastic dynamical model arises frequently in practice. Specifically, in (classical) filtering probelms one deals with macroscopic objects whose state variables are phenomenologically well-described by classical deterministic laws modified by external disturbances that can be modelled as random noise. So the state of the system is described by a noisy version of a deterministic nonlinear dynamical system termed the state model, i.e., the dynamics is governed by a system of first-order differential equations in the state variable () with an additional contribution due to noise that is random (). The noise in the state model is referred to as the signal noise. If the noise is Gaussian (or more generally multiplicative Gaussian) the state process is a Markov process. Since the process is stochastic, the state process is completely characterized by a probability density function. The Fokker-Planck-Kolmogorov foward equation (FPKfe) describes the evolution of this probability density function (or equivalently, the transition probability density function) and is the complete solution of the state evolution problem[1].

However, in many applications the signal, or state variables, cannot be directly observed. Instead, what is measured is a related stochastic process () called the measurement process. The measurement process can often be modelled as yet another nonlinear continuous-time stochastic dynamical system called the measurement model. Loosely speaking, the observations, or measurements, are discrete-time samples drawn from a different system of noisy first order differential equations. The noise in the measurement dynamical system is referred to as measurement noise. The continuous-continuous “filtering” problem is the estimation of the continuous-time signal or state process given the (noisy) observations of a related continuous-time stochastic process (the measurement or observation process). For a recent discussion, see [2].

The conditional probability density is the complete solution of the filtering problem. That is, it embodies all the probabilistic information about the state process contained in the observations and in the initial condition. Here the Bayesian point of view is being adopted. From the conditional probability density, an optimal estimate may be computed for any loss function. For instance, the minimum variance estimate is the conditional mean. The solution of the nonlinear filtering problem is said to be universal if the initial distribution can be arbitrary111A classic discussion of nonlinear filtering theory can be found in the text by Jazwinski[3]. For a more up-to-date discussion, see [4]...

The traditional approach to the solution of the continuous-continuous universal nonlinear filtering problem requires the solution of the Duncan-Mortensen-Zakai (DMZ) equation, a stochastic differential equation (SDE) describing the unnormalized conditional probability density. The DMZ equation can be gauge transformed to the time-varying partial differential equation termed the robust DMZ equation. However, since the robust DMZ equation is a partial differential equation (PDE) with the coefficients depending on the measurements the PDE cannot be solved off-line, or in real time. In [5], it was proved that solving the robust DMZ equation is equivalent to solving a PDE, termed the Yau Equation (YYe) of continuous-continuous filtering, whose coefficients are independent of the measurements. Hence, the YYe can be solved off-line and also in a memoryless way.

Recently, it has been noted that the Feynman path integral222For a classic review of path integrals, see [6]. can be used to solve the general nonlinear filtering problem. In fact, it was shown that the path integral formulation of the continuous-continuous filtering problem directly leads to the YYe[4]. The path integral formula for the fundamental solution of the YYe was also derived in [4]. The advantages of the Feynman path integral formulation are many, including a completely independent and self-contained formulation and solution of the general nonlinear filtering problem (than is traditional in filtering theory literature which is based on measure theoretic techniques termed the Feynman-Kǎc formalism), and simple, efficient and accurate algorithms implementable in real-time[2].

The Feynman path integral has proven to be a very powerful tool in modern theoretical physics, often leading to results that are not evident using other methods. In this paper, one such instance is provided in the application to the filtering problem. In particular, it is demonstrated that there is a very close relationship between nonlinear filtering theory and Euclidean quantum mechanics. Specifically, the fundamental solution of the YYe may be viewed as the expectation of a certain operator in a Euclidean quantum mechanical system. This follows from the path integral formula for the fundamental solution of the YYe derived in [4]. This result generalizes the equivalence between nonlinear filtering and the Euclidean Schrödinger equation derived in [7] via more complicated means. Their result was limited to a filtering system where the signal model drift is the same as in the finite-dimensional Yau filter333Their explicit solution imposed an additional condition on the measurement model drift, but that is not relevant for our discussion here.. This result does not require that assumption—the signal model is assumed to have additive noise (or somewhat more generally, orthogonal diffusion vielbein so that the diffusion matrix is proportional to the identity matrix), and the measurement model noise is also additive. Also, no explicit time dependence is assumed in the model.

The outline of our paper is as follows. In Section 2, notation and some of the important results for the real-time solution of the continuous-continuous nonlinear filtering problem are reviewed. In Section 3, are summaries of the results obtained using the Feynman path integral formulation, specifically the path integral expressions for the fundamental solution for the FPKfe and the YYe. In Section 4, the equivalence between Euclidean quantum mechanics and nonlinear filtering is presented. In the following section it is shown that it leads to the equivalence between nonlinear filtering and the time-varying Schrödinger equation obtained in [7]. In Section 6, some important conceptual issues are discussed which further clarify the relationship between the YYe and Euclidean quantum mechanics. The conclusions and directions of future work are presented in Section 7. In Appendix A, the path integral formula for the fundamental solution of the YYe that forms the basis of our work is verified.

2 Basic Results of Nonlinear Filtering

A very brief summary of results of filtering theory is provided in this section. For the purposes of this paper, the important result is that the solution of the filtering problem requires the solution of a Fokker-Planck type of equation called the Yau equation. A more complete discussion with references can be found in [4] and [2]. It is worth contrasting the simplicity of the Feynman path integral solution discussed in the next section with the traditional discussion summarized here.

2.1 From the DMZ SDE to the Robust DMZ PDE

The signal and observation model considered is the following:

(1)

Here and are valued stochastic processes, and and are valued stochastic processes, respectively, and . These are defined in the Itô sense. The processes and are assumed to be independent Brownian processes with variances and respectively. Also, is referred to as the signal (measurement) model drift, as the diffusion vielbein, and as the diffusion matrix. In this paper, the additive noise case is considered where is the identity matrix, although the same result holds if one assumes that the diffusion vielbein is orthogonal. Finally, no explicit time dependence is assumed in the model.

The unnormalized conditional probability density, , of the state given the observations satisfies the DMZ stochastic differential equation:

(2)

Here

(3)

where is the zero-degree differential operator , is the probability density at the initial time . Under the following gauge transformation

(4)

the DMZ SDE is transformed into the following time-varying PDE called the robust DMZ equation [8]:

(5)

Here is the Laplacian. The solution of a PDE when the initial condition is a delta function is called its fundamental solution.

2.2 The Yau Equation of Continuous-Continuous Filtering

Recently, S-T. Yau and Stephen Yau made a major advance in the real-time solution of the general nonlinear filtering problem [5]. They began by observing that if satisfies the equation

(6)

in the time interval , then the function defined as

(7)

satisfies the parabolic partial differential equation termed the Yau Equation (YYe)

(8)

in the same time interval. The converse of the statement is also true. In [9], they further showed that it is sufficient to use the previous observation, i.e., satisfies Equation 6 if and only if defined as

(9)

satisfies Equation 8 in the time interval . Equation 7 ( Equation 9) and Equation 8 are referred to as the post-measurement (pre-measurement) forms of the YYe.

Equation 6 can also be obtained by setting to in Equation 5. It was proved that the solution of Equation 6 approximates very well the solution of the robust DMZ equation (Equation 5), i.e., it converges to in both the pointwise sense and the sense. Thus, solving Equation 5 is equivalent to solving Equation 8. In fact, in [10] it was proved that converges to .

There are two important points to note. The coefficients of the robust DMZ equation (Equation 5) contain the measurements. Consequently, the partial differential equation to be solved for continuous-continuous filtering is unknown prior to measurements. In other words, the robust DMZ equation has to be solved on-line; its solution cannot be pre-computed. By contrast, in the YYe (Equation 8), measurements are absent from the partial differential equation. The measurements only enter the initial condition at each measurement step. This implies that the YYe can be be solved off-line; there is no need for on-line solution of PDEs444This assumes that the measurement steps are equidistant or known.. This makes real-time solution feasible even if the state dimension is large. The algorithm based on the YYe is termed the Yau algorithm

In addition, it is noted that all other known algorithms for continuous-continuous filtering assume boundedness of the measurement model drift (see discussion in [10] and [11]). This is a highly restrictive assumption and implies, for instance, that they cannot even handle the linear model which the Kalman filter handles very well. The Yau algorithm is shown to converge to the true solution under much weaker conditions on the drift (see [5],[9], [11]). In that sense, the Yau algorithm is the only practical nonlinear filtering algorithm for the general filtering problem. Also note that from a path integral perspective, the Yau algorithm is the most natural one.

The simplicity of the YYe has already led to some important advances. In particular, it has led to a demonstration of an equivalence between solving a class of nonlinear filtering problems and the time-varying Schrödinger equation[7]. As a result, the Yau PDE can be reduced to a system of ODEs explicitly solvable using the power series method. This result will be shown to be a corollary of the main result of this paper.

3 Path Integral Formula for the Fundamental solution of the YYe

In recent papers, it was demonstrated that the path integral method leads to an independent formulation and solution of the universal nonlinear filtering problem [12, 4]. Specifically, the path integral formula for the transition probability density in continuous-discrete and continuous-continuous filtering was investigated. One of the results of the aforementioned work is the path integral formula for the fundamental solution of the FPKfe and the YYe. Some of the relevant results from the papers are reviewed below.

When the diffusion matrix is , the transition probability satisfies the FPKfe (see, for instance, [1])

(10)

The fundamental solution of this FPKfe can be shown to be given by (see [13] and [12])

(11)

In continuous-continuous filtering, it is necessary to incorporate the measurement stochastic process as well as the signal process. That is, the measurement process ensemble must also be considered. The inclusion of measurement noise means that each system in the ensemble leads to a different time-dependent vector . Although only one realization of the measurement stochastic process is observed, it is still meaningful to talk about an ensemble average of the measurement process (in addition to an ensemble average over the state process). Thus, the quantity of interest in continuous-continuous filtering is

(12)

where denotes averaging with respect to measurement noise . It has been shown in [4] that the transition probability density conditional on the measurements is given by555Here the “post-measurement” form is used; the pre-measurement form does not affect the conclusion[4].

(13)

where

(14)

and

(15)

It can be shown that is the fundamental solution of the YYe[4]. For completeness, a verification of this result is presented in the Appendix.

The transition probability density is the complete solution to the continuous filtering problem. Thus if the initial distribution is , then the evolved conditional probability distribution is given by

(16)

4 YYe and Euclidean Quantum Mechanics

In this section it is shown that there is a general equivalence between the YYe and Euclidean quantum mechanics.

4.1 A General Equivalence

Consider the path integral formula for the fundamental solution of the Yau Equation

(17)

where the action is

(18)

Integrating by parts, the third term above yields

(19)

Here the Feynman convention is used implicitly, i.e., symmetrized arguments of , so that the ordinary rules of calculus apply. Thus, the action simplifies to

(20)

We can therefore write the fundamental solution as

(21)

where the “Lagrangian” , is

(22)

and

(23)

Consider a Euclidean quantum mechanical system with the Hamiltonian

(24)

Then, performing the path integral quantization of this system will lead to the following expression of the transition probability amplitude:

(25)

Now, let us consider the following matrix element:

(26)

In the general case, the value of the line integral depends on the path. Therefore, it cannot be “factored out”. After all, it is a part of the path integral, which is a weighted sum over all paths. This is not a “gauge transformation” in the traditional sense (since it is not outside of the path integral). This expectation value can be evaluated perturbatively, or more generally, non-perturbatively.

However, it can be viewed as an expectation of the signal model drift line integral operator, i.e.,

(27)

in the quantum mechanical system with Lagrangian given by Equation 22.

Note that there are several differences between this operator and the Wilson lines that arise in quantum field theories. First of all, this is a quantum mechanical system, not a quantum field theory. Secondly, in field theory the coordinates are a parameter, whereas here (as in quantum mechanics) the coordinates are quantized and not an operator.

This can be summarized as follows: The fundamental solution of the Yau Equation, Equation 8, is the same as the matrix element of the operator in Equation 27 for the Euclidean quantum mechanical system with the Hamiltonian as in Equation 24.

4.2 A Special Case

The quantum mechanical equivalence can be made even more direct for a certain class of the signal model drift. This reault is known and shown here using path integral methods. Specifically, suppose the line integral is path independent. Then, it can be factored out of the path integral since its value does not depend on the path. It is straightforward to see when this is possible.

In the one-dimensional case, it is always possible to write the (smooth) drift as a gradient:

(28)

since is simply given by the integral of .

In the general dimensional case, path independence of the line integral requires that the drift be a gradient:

(29)

A special case is the linear symmetric case. Specifically, if is a symmetric matrix the drift as

(30)

where

(31)

Clearly, the the state model drift for the Yau filtering system with a symmetric matrix is also a gradient of some scalar.

In all such cases, the expectation of the relevant operator is simply

(32)

In fact,

(33)

A gauge transformation relates the fundamental solution of the Yau Equation and the corresponding Euclidean quantum mechanical system.

The discussion in this subsection can be summarized as follows: When the signal model drift is a gradient of a scalar field, the fundamental solution of the Yau Equation is, up to a gauge transformation, the same as the transition probability amplitude of a Euclidean quantum mechanical system with Lagrangian given by Equation 22.

5 Nonlinear Yau Filtering System and Time-Dependent Schrödinger equation

5.1 The Work of S-T. Yau and Stephen Yau

As discussed in Section 2, in order to obtain the unnormalized conditional probability , it suffices to solve the YYe

(34)

In [7], the authors considered the filtering system signal model drift

(35)

where is an anti-symmetric matrix. The symmetric part can be incorporated in . By considering the quantity defined by

(36)

they showed that it is sufficient to solve the following Schrödinger equation

(37)

where

(38)

and

(39)

Here,

(40)

and is an orthogonal matrix.

5.2 Path Integral Derivation

We now derive using path integral methods the result discussed in the previous sectoion relating the nonlinear Yau filtering system to a Schrödinger equation using path integral methods. In matrix notation, the Lagrangian for the Yau filter state model is

(41)

Since

(42)

it follows that

(43)

where is as defined in Equation 38. The term can be integrated out of the path integral and simply yields a phase factor

(44)

Now, in terms of

(45)
(46)

so that

(47)
(48)

Therefore, in terms of , the Lagrangian is

(49)

Combining this with Equation 44, the fundamental solution becomes

(50)

where is given by Equation 49, and this equation defines . Note that the Jacobian of the transformation from the measure to is unity since is an orthogonal matrix.

It follows from the standard path integral formulation of quantum mechanics that is just the path integral representation of the fundamental solution of the following Schrödinger equation:

(51)

This is precisely the equation obtained in [7]. For the initial condition, since

(52)

it follows that

(53)

which is the same as Equation 50. Thus the result in [7] has been independently established.

6 Additional Remarks

A few comments on our results are in order.

  1. The general equivalence proposed in Section 4.1 is that between the fundamental solution of the FPKfe and a certain matrix element of a quantum mechanical system. Note that all previous equivalences were between fundamental solutions of FPKfe and a Schrödinger equation. While the previous equivalences were obtainable using operator methods, it is not clear how the proposed general equivalence can be derived using operator methods.

  2. The Feynman path integral methods used here are very different from the measure-theoretic methods used to study the nonlinear filtering problem (see, for instance, [14]). The Feynman path integral approach developed in [4, 2] and this paper leads to an independent way of tackling the universal nonlinear filtering problem. In particular, unlike the standard filtering theory approaches, the DMZ equation (or its variants) is not taken as an input. On the contrary, the YYe is obtained directly as a consequence of the path integral approach. It is also noted that the Euclidean quantum physics referred to in the equivalence to nonlinear filtering developed in this paper is a quantum mechanical one, not a quantum field theoretical one. Also note that here is analogous to the Planck’s constant, , in quantum physics.

  3. Although a formal equivalence has been developed between Euclidean quantum mechanics and universal nonlinear filtering, it is important to point out that there is a profound difference between classical and quantum probabilities. In the “real time” (as opposed to Euclidean time) quantum mechanics, the transition probability amplitude is not a probability; it may not even be real. The probability amplitude is to be multiplied with its complex conjugate to obtain a probability density. In contrast, the transition probability density in filtering theory is a classical probability density. We have shown that, in a mathematical sense, matrix elements in a Euclidean quantum mechanical system equal the transition probability density of a classical stochastic process. This equivalence is purely mathematical, not conceptual.

  4. The reason for some of the mathematical equivalence between nonlinear filtering and quantum physics is the semi-group property. That is, in stochastic processes the Chapman-Kolmogorov semi-group property is a fundamental property of Markov processes. The Chapman-Kolmogorov semi-group property allows the transition probability density in stochastic process to be written as follows. Let us partition the time interval into equi-spaced time intervals so that where . Then, from the Chapman-Kolmogorov semi-group property it follows that

    (54)

    where and and the delta function condition in the definition of is written as integration limits. On the other hand, in quantum physics, the semi-group property is a basic property of the time-evolution operator. It is the semi-group property of the time-evolution operator that leads to the path integral formula for the transition probability amplitude.

  5. Note that the unitarity of the time evolution operator plays a crucial role in quantum mechanics. This is because it is essential for conservation of probability. It is noted that the unitarity of the time evolution operator implies that the Hamiltonian operator is Hermitian666In the standard formulation of quantum mechanics, Hermiticity of the Hamiltonian is required in order to ensure that the eigenvalues of the Hamiltonian (and hence the possible energies) are real and that the time evolution is unitary (i.e., conserves probability). This can also be ensured even if the Hamiltonian is not Hermitian provided that the Hamiltonian is space-time (or ) reflection symmetric. For a pedagogical discussion of non-Hermitian quantum mechanics, see [15]..

    The evolution of the probability distribution for a continuous Markov processes is described by the FPKfe. The FPKfe operator is not a Hermitian operator. This is not inconsistent with the conservation of probability. This is because the FPKfe is a continuity equation, and so the probability is conserved (as long as the boundary terms vanish). This is yet another instance of the profound difference between classical and quantum probabilities.

  6. In the continuous-continuous filtering case, the YYe plays the role that the FPKfe plays in continuous-discrete filtering. However, the YYe is not a continuity equation. This is not a contradiction since the YYe is related to the unnormalized conditional probability density, whereas the FPKfe evolves (and preserves the normalization of) the normalized probability density.

  7. In this paper, it has been assumed that the noise is additive and the model is not explicitly time-dependent. In the more general case, a simple equivalence, as derived in this paper, is not possible. For instance, in order to obtain the YYe, it was necessary to assume that the measurement model was not explicitly time dependent (see [4]); this is not valid in the general case. Also, when the noise is multiplicative, quantization is not as straightforward. This can be traced to the well-known operator ordering ambiguity in quantum physics (since the position and momentum operators do not commute). Furthermore, standard calculus manipulations are no longer possible in the path integral. However, the path integral result is the same even for the multiplicative noise case if the diffusion matrix is proportional to the identity matrix.

Finally, note that the results of this paper also apply to the FPKfe itself, which is simply the case , in a straightforward manner.

7 Conclusions and Future Work

The main conclusion of this paper is that certain continous-continuous nonlinear filtering problems are related to Euclidean quantum mechanics. Specifically, the transition probability density for nonlinear filtering problems with additive noise and with square diffusion vielbein and explicitly time-independent drift is the fundamental solution of the YYe, and is the same as the expectation of a certain operator in a quantum mechanical system. A corollary of this general result is a derivation of the relationship between the YYe in the Yau filtering system and a time-varying Schrödinger equation which was first derived earlier. This equivalence leads to some useful results from the methods used in quantum physics.

There are many possible directions for future work:

  1. The path integral formula can be exactly solvable, or lead to simpler methods of obtaining solutions, in some simple cases. In fact, for the model studied in [7], an explicit path integral solution is described in a paper currently in preparation [16].

  2. The equivalence to Euclidean quantum mechanics immediately leads to many filtering problems that can be solved exactly, as will be explored in future papers. The remarkable fact is that many exactly solvable Euclidean quantum mechanical problems correspond to filtering problems that are not finite-dimensional (i.e., Lie algebra of and is not finite-dimensional, see [4]) . Thus, simplicity in filtering theory does not imply finite dimensionality.

  3. The path integral formulation naturally leads to a perturbative solution of the nonlinear filtering problem. Such a perturbative solution which is analogous to extended Kalman filtering is called extended Yau filtering (EYF). Thus, it is possible to perturb about the Yau filter (a generalization of the linear Kalman filter), rather than the linear Kalman filter. Clearly, since the EKF is a special case of the EYF, such a formulation will be superior to the EKF.

  4. However, it is noted that the fundamental solution is defined nonperturbatively. This is important since sometimes the perturbative approaches, like EKF, fail.

  5. Perhaps the most important advantage of the path integral lies in numerical methods of computation [17]. In fact, path integrals are the only known way to carry out nonperturbative computation in quantum chromodynamics (QCD). Note that QCD, the gauge theory of strong interactions, is a quantum field theory, not merely quantum mechanical and the QCD Lagrangian is highly nonlinear. The excellent numerical results in QCD suggest that currently known path integral methods should give very good performance for the simpler case of (large dimensional) universal nonlinear filtering. It is sufficient to note that the Dirac-Feynman approximation, the crudest approximation of the path integral, already yields excellent results (see [2]), and is adequate for smaller dimensional problems.

It is clear that many lines of investigation are suggested by the path integral methods and it is planned that results of those investigations will be presented in future papers.

Appendix A Verification of the Path Integral Formula

It is now proven that the path integral formula Equation 13 satisfies the YYe. This closely follows the method used by Feynman to verify the path integral formula for the Schrödinger equation in quantum mechanics.

According to the Chapman-Kolmogorov semi-group property

(55)

where is given by Equation 14. When is an infinitesimal

(56)

where . The dominant contribution occurs when the following condition is satisfied:

(57)

We may write in this region

(58)

where the equalities are valid to . Substituting this into Equation 56 (keeping terms up to )

(59)

It is noted that the Jacobian of the the transformation from to to is included in the second step.

The constant is fixed by

(60)

Hence, it follows that