A numerical algorithm for fully nonlinear HJB equations:
an approach by control randomization
Abstract
We propose a probabilistic numerical algorithm to solve Backward Stochastic Differential Equations (BSDEs) with nonnegative jumps, a class of BSDEs introduced in [9] for representing fully nonlinear HJB equations. In particular, this allows us to numerically solve stochastic control problems with controlled volatility, possibly degenerate. Our backward scheme, based on leastsquares regressions, takes advantage of highdimensional properties of MonteCarlo methods, and also provides a parametric estimate in feedback form for the optimal control. A partial analysis of the error of the scheme is provided, as well as numerical tests on the problem of superreplication of option with uncertain volatilities and/or correlations, including a detailed comparison with the numerical results from the alternative scheme proposed in [7].
Key words: Backward stochastic differential equations, control randomization, HJB equation, uncertain volatility, empirical regressions, MonteCarlo.
MSC Classification: 60H10, 65Cxx, 93E20.
1 Introduction
Consider the following general HamiltonJacobiBellman (HJB) equation:
(1.1)  
where is a bounded subset of . It is well known that the HJB equation (1.1) is the dynamic programming equation for the following stochastic control problem:
(1.2)  
Moreover, it is proved in [9] that this HJB equation admits a probabilistic representation by means of a BSDE with nonpositive jumps. We recall below this construction.
Introduce a Poisson random measure on with finite intensity measure associated to the marked point process , independent of , and consider the pure jump process , valued in , defined as follows:
and interpreted as a randomization of the control process .
Next, consider the uncontrolled forward regime switching diffusion process
Observe that the pair process is Markov. Now, consider the following BSDE with jumps w.r.t. the BrownianPoisson filtration .
(1.3) 
where is the compensated measure of .
Finally, we constrain the jump component of the BSDE (1.3) to be nonpositive, i.e.
We denote by an upper bound for the compact set of , i.e. for all , and we make the standing assumptions:

The functions and are Lipschitz: there exists s.t.
for all , .

The functions and are Lipschitz continuous: there exists s.t.
for all , .
Under these conditions, we consider the minimal solution of the following constrained BSDE:
subject to the constraint
(1.5) 
By the Markov property of , there exists a deterministic function such that the minimal solution to (1)(1.5) satisfies , .
Now, the aim of this paper is to provide a numerical scheme for computing an approximation of the solution of the constrained BSDE (1)(1.5). In light of Theorem 1.1, this will provide an approximation of the solution of the general HJB equation (1.1), which encompasses stochastic control problems such as the one described in equation (1.2), ie. problems where both the drift and the volatility of the underlying diffusion can be controlled, including degenerate diffusion coefficient.
The outline of the subsequent sections is the following.
First, Section 2 describes our scheme. We start from a timediscretization of the problem, proposed in [8], which gives rise to a backward scheme involving the simulation of the forward regime switching process , hence taking advantage of highdimensional properties of MonteCarlo methods. The final step towards an implementable scheme is to approximate the conditional expectations that arise from this scheme. Here we use empirical leastsquares regression, as this method provides a parametric estimate in feedback form of the optimal control. A partial analysis of the impact of this approximation is provided, and the remaining obstacles towards a full analysis are highlighted.
Then, Section 3 is devoted to numerical tests of the scheme on various examples. The major application that takes advantage of the possibilities of our scheme is the problem of pricing and hedging contingent claims under uncertain volatility and (for multidimensional claims) correlation. Therefore most of this section is devoted to this specific application. To our knowledge, the only other Monte Carlo scheme for HJB equations that can handle continuous controls as well as controlled volatility is described in [7], where they make use of another generalization of BSDEs, namely secondorder BSDEs. Therefore we compare the performance of our scheme to the results provided in their paper.
Finally, Section 4 concludes the paper.
2 Regression scheme
Define a deterministic time grid := for the interval , with mesh where . Denote by . The discretization of the constrained BSDE (1)(1.5) can be written as follows:
(2.1) 
where .
First, remark that, from the Markov property of , there exist deterministic functions and such that . Hence and can be seen as intermediate quantities towards the discretetime approximation of the BSDE (1)(1.5) , which do not depend on .
Formally, the jump constraint (1.5) states that a.s., meaning that the minimal solution satisfies .
Moreover, one can extract from the scheme if needed. Indeed, denoting , i.e. , then .
Finally remark that the numerical scheme (2.1) is explicit, as we choose to define as a function of and not of .
The convergence of the solution of the discretized scheme (2.1) towards the solution of the constrained BSDE (1)(1.5) is thoroughly examined in [8]. In this paper, we start from the discrete version (2.1) and derive an implementable scheme from it.
Indeed, the discrete scheme (2.1) is in general not readily implementable because it involves conditional expectations that cannot be computed explicitly. It is thus necessary in practice to approximate these conditional expectations. Here we follow the empirical regression approach ([10, 3, 6, 18, 1]). In our context, apart from being easy to implement, the strong advantage of this choice is that, unlike other standard methods, it provides as a byproduct a parametric feedback estimate for the optimal control.
The idea is to replace the conditional expectations from (2.1) by empirical regressions. This section is devoted to the analysis of the error generated by this replacement.
2.1 Localizations
The first step is to localize the discrete BSDE (2.1), i.e. to truncate it so that it admits a.s. deterministic bounds. Introduce and and define the following truncations of and :
(2.2)  
(2.3) 
Define and define the localized version of the discrete BSDE (2.1), using the truncations (2.2) and (2.3).
(2.4) 
First, we check that this localized BSDE does admit a.s. bounds.
Lemma 2.1.
[almost sure bounds] For every and every , the following uniform bounds hold a.s.:
where , and
Proof.
First, as is continuous, there exists such that for all , . Hence
(2.5) 
Next,
Thus, using the CauchySchwarz inequality and dividing by :
Now, using Young’s inequality with :
Remark that
where . Hence
Thus, for every , one can group together the terms involving and using Jensen’s inequality:
where . Finally:
(2.6) 
Using equations (2.5) and (2.6), one obtains by induction that:
(2.7) 
where . Finally remark that
Thus
(2.8) 
And
(2.9)  
Finally, combine equations (2.7), (2.8) and (2.9) with and to obtain the following a.s. bound for :
In particular, for and :
where . The same inequality holds for . For , use the CauchySchwarz inequality to obtain:
and the same inequality holds for .∎
Lemma 2.2.
For , define where is a Gaussian random variable with mean and variance . Then:
Proof.
Developing the square yields
Then the two expectations can be explicited as follows
Finally, the use of Mill’s ratio inequality concludes the proof. ∎
Proposition 2.1.
The following bounds hold:
where , and , , is the solution of the following linear constrained BSDE:
Proof.
Define , , , and . First
Then
Hence
Then
Using Jensen’s inequality and Young’s inequality with parameter , :
Now, for any , one can group together the terms in and using Jensen’s inequality:
where, as in Lemma 2.1, . Hence, using that for any random variables and ,
the following holds:
By induction
where, as in Lemma 2.1, . Finally, take to obtain the desired bound. ∎
2.2 Projections
In its current form, the scheme (2.4) is not readily implementable, because its conditional expectations cannot be computed in general. Therefore, there is a need to approximate these conditional expectations. For handiness and efficiency, we choose, in the spirit of [10] and [6], to approximate them by empirical leastsquares regression.
First, we will study the impact of the replacement of the conditional expectations by theoretical leastsquares regressions. We will see that the resulting scheme is not easy to analyze. Therefore, we will study a stronger version of it, and discuss their practical differences. As it is already a daunting task for standard BSDEs (cf. [10]), and in view of the difficulties already raised at theoretical regression level, we leave the study of the final replacement of these regressions by their empirical counterparts for further research.
Hence, for each , consider and that are nonempty closed convex subsets of , as well as the corresponding projection operators and . Using the above projection operators in lieu of the conditional expectations in (2.4), we obtain the following approximation scheme:
(2.10) 
where and are truncation operators that ensure that the a.s. upper bounds for from Lemma 2.1 will also hold for .
To be more specific, choose the subsets and as follows:
where , , and , , are predefined sets of deterministic functions from into . Hence, for any random variable in , is defined as follows:
(2.11)  