Mean Field Production Output Control with Sticky Prices: Nash and Social Solutions\thanksreffootnoteinfo
This paper presents an application of mean field control to dynamic production optimization. Both noncooperative and cooperative solutions are considered. We first introduce a market of a large number of agents (firms) with sticky prices and adjustment costs. By solving auxiliary limiting optimal control problems subject to consistent mean field approximations, two sets of decentralized strategies are obtained and further shown to asymptotically attain Nash equilibria and social optima, respectively. The performance estimate of the social optimum strategies exploits a passivity property of the underlying model. A numerical example is given to compare market prices, firms’ outputs and costs under two two solution frameworks.
BW]Bingchang Wang, MH]Minyi Huang
School of Control Science and Engineering, Shandong University, Jinan 250061, P. R. China
School of Mathematics and Statistics, Carleton University, Ottawa, ON K1S 5B6, Canada
Key words: Mean field game; social optimum; Nash equilibrium; production output adjustment; sticky price.
Mean field game theory is effective to design decentralized strategies in a system of many players which are individually negligible but collectively affect a particular player (see e.g., , , , ). By identifying a consistency relationship between the individual’s best response and the mass (population macroscopic) behavior, one may obtain a fixed-point equation to specify the mean field. This procedure leads to a set of decentralized strategies as an -Nash equilibrium for the actual model with a large but finite population. By now, mean field games have been intensively studied in the LQG (linear-quadratic-Gaussian) framework , , , , ; there is also a large body of works on nonlinear models , , , . For further literature, readers are referred to , ,  for mean field models with a major player,  for oblivious equilibria proposed for large-scale Markov decision processes of industry dynamics,  for mean field games with Markov jump parameters. For a survey on mean field game theory, see , , and . Besides noncooperative games, social optima in mean field control have been investigated in some literature , . Mean field games and control have found wide applications, including smart grids , , , finance, economics , , , , operation research , , , , and social sciences , , , etc.
This paper aims to present an application of mean field control to production output adjustment in a large market with many firms and sticky prices. Under the stickiness assumption, the price of the underlying product does not adjust instantaneously according to its demand function, but evolves slowly and smoothly. Dynamic game models for duopolistic competition with sticky prices were initially proposed by Simaan and Takayama , and then extended to investigate asymptotically stable steady-state equilibrium prices in . In , , the authors considered open and closed-loop Nash equilibria for dynamic oligopoly with firms and compared prices’ behavior in and outside the steady-state levels, respectively. Adjustment costs in production models have been addressed in the economic literature (see e.g. ) and they have been taken into account in the study of dynamic oligopoly , , . The work  introduces a duopoly where each firm has output level subject to control according to a first-order integrator dynamics. However, when the number of firms is large (e.g. in a perfectly competitive market) and the adjustment cost is considered, the computational complexity of output adjustment is high. In the mean field control framework, one can effectively address the complexity issue.
Within our model, a large number of producers supply a certain product with sticky prices, and the output adjustment incurs a cost. The cost function of a firm is based on product cost, price, and adjustment cost. In , we combined the price and firm’s output as a 2-dimensional system. Thus, the cost function has indefinite state weights, which differs from many existing LQG models of mean field games in the literature , . In this paper, the price in the mean field limit model is taken as an exogenous signal without the need of state space augmentation. This contributes to deriving a simple condition that ensures the solvability of the resulting equation system.
The Nash equilibrium and the social optimum are two fundamental solution notions to competitive markets with many firms, where the former applies to the noncooperative model, and the latter is for the cooperative model. In this paper, we design Nash and social optimum strategies for the production output control model based on the mean field control methodology, respectively, and further compare two solutions numerically. The Nash solution of our model starts by solving a limiting optimal control problem and next applies the consistency requirement for the mean field approximation. We then obtain a set of decentralized strategies and show that the set of strategies is an -Nash equilibrium. For the social optimum solution, we first provide an auxiliary optimal control problem by a person-by-person optimality approach, and then design a set of decentralized strategies by solving the limiting auxiliary problem subject to consistent mean field approximations. The set of strategies is shown to be asymptotically socially optimal by exploiting a passivity property of the underlying model.
An illustrative numerical example is given to compare market prices, firms¡¯ outputs and optimal costs under the game and social optimum frameworks. It is numerically shown that the social optimum has a lower average output level than that in the noncooperative case. This is similar to the behavior in a duopoly model  where cooperation of the two players results in a lower total output than in the Cournot equilibrium.
The paper is organized as follows. Section II introduces the game and social optimum problems with players. In Section III, we first design a set of decentralized strategies by the mean field control methodology and then show its asymptotic Nash equilibrium property. In Section IV, we construct a set of decentralized strategies, which is shown to be asymptotically socially optimal. In Section V, a comparison of two solutions is demonstrated by a numerical example. Section VI concludes the paper.
Notation: denotes the Euclidean vector norm or matrix spectral norm. For a matrix , denotes the determinant of . denotes the class of -dimensional continuous functions on ; is the class of bounded and continuous functions; For a family of -values random variables , is the -algebra generated by the collection of random variables; ; . For two sequences and , denotes , and denotes . For convenience of presentation, we use to denote generic positive constants, which may vary from place to place.
2 Problem Description
2.1 Dynamic oligopoly with sticky prices
Dynamic game models for oligopolistic competition with sticky prices were initially proposed by Simaan and Takayama , and then further investigated in , , . According to the model in , , the sticky price evolves by
where is the output of firm , , and has the role of control. The payoff function of firm is described by
The constants and are positive, and is the cost of unit output.
2.2 Output adjustment in a mean field framework
where denotes the speed of adjustment to the level on the demand function, and is the average of firms’ outputs. The output of each firm is described by the stochastic differential equation (SDE)
where are independent standard Brownian motions, which are also independent of initial outputs of all firms . The constants , and are positive.
Adjustment costs in production models have been addressed in the economic literature (see e.g. ) and they have been taken into account in the study of dynamic oligopoly , , , . The work  introduces a duopoly where each firm has output level subject to control according to a first order integrator dynamics. In the resulting differential game, the instantaneous payoff of each firm is determined from its net profit minus quadratic penalty terms of and
As in , is the price on the demand function for the given level of firms’ outputs. In the static case, the inverse demand function has a linear version ; here for simplicity we set as 1. The scaling factor for is standard in modelling and analysis of large markets, and some closely related price modelling in a large dynamic market can be found in , . is used to indicate friction in adjusting the output, and is random shocks in output.
The cost function of each firm is given by
and . Here, denotes the production cost, and denotes the adjustment cost. The minimization of is equivalent to maximizing the payoff
We only consider the case to make the subsequent optimization problems be of practical interest. Otherwise, given a positive , the production cost already exceeds the price, and the optimization problem is not too meaningful.
The social cost is defined as
Based on costs (3) and (4), one may formulate a standard LQG game and an optimal control problem, respectively. A limitation of this approach is that the control strategy will be centralized. Our goal is to look for decentralized strategies for the corresponding optimization criterion.
The basic objective of this paper is to seek Nash solutions and social solutions to mean field production output control with sticky prices. Specifically, we study the following two problems:
Problem I: Find -Nash equilibrium strategies for agents to minimize the individual cost over the set of decentralized strategies
where , , .
Problem II: Find asymptotic social optimum strategies for agents to minimize over the set of decentralized strategies , .
For a large market, a natural way of modeling the sequence of parameters is to view them as being sampled from a space such that this sequence exhibits certain statistical properties when . Define the associated empirical distribution function , where if and otherwise.
We introduce the assumptions.
A1) The initial price is a constant. The initial outputs of all firms are independent. for all ; there exists independent of such that .
A2) There exists a distribution function such that the empirical distribution converges weakly to , where . Furthermore, each and .
A3) For all , is contained in a fixed compact set , and .
3 Nash Solutions to Output Adjustment
3.1 Optimal control for the limiting problem
Assume that is given for approximation of . Replacing in (1) by , we introduce
Accordingly, by replacing in (4) with we define the cost function:
The corresponding admissible control set is .
We introduce the HJB equation:
where . Let . Then the optimal control law is
is the unique solution to the algebraic Riccati equation (11) such that .
Proof. By solving (11), we have or . If , . Otherwise, when , .
The inequality in Lemma 1 specifies a stability condition for the closed-loop system which must be satisfied by the solution of .
Proof. Note that by (6), implies . We can prove parts 1) and 3) by showing that and are uniquely determined from the fact and , respectively (see e.g., , ). To show part 2) we first obtain a prior integral estimate of (see (14)) and then use the completion of squares technique (see e.g., , ). By Lemma A.1, implies , which further gives that is well defined to be finite since . By Proposition A.1, leads to which further implies
3.2 Control synthesis and analysis
In the above, is a continuum parameter. is regarded as the expectation of the state given the parameter in the individual dynamics. The last equation is due to the consistency requirement for the mean field approximation. and is to be determined. For further analysis, we make the following assumption.
then A4) holds.
where It can be verified that is a map from the Banach space to itself. For any ,
It follows that is a contraction and hence has a unique fixed point .
3.2.1 The case of uniform agents
By direct computations, we have
and the equation has the solution
In what follows, we use Routh’s stability criterion  to determine the number of roots of with negative real parts. The first column of the Routh array for is It can be verified that the first column of the Routh array always has a sign change. By Routh’s stability criterion, (20) has a root with a positive real part, and two roots with negative real parts.
Let be two roots of (20) with negative real parts, and be the corresponding (generalized) complex eigenvectors. Let . The solution to equation (19) given by is in if and only if there exist constants such that . Indeed, suppose
where is a root of (20) with a positive real part and is the corresponding complex eigenvector. The solution
is in if and only if , where are polynomials of .
Denote , and , where . Then we have
Note that is given. There exists a unique solution to (23) if and only if and are linearly independent.
From the analysis above, we have the following result.
(19) admits a unique solution such that and are in if and only if and are linearly independent. In this case, A4) holds.
Take parameters as . Let , . In this case, has only two eigenvalues with negative real parts and . The corresponding eigenvectors are and , respectively. By (23), we have and . Then (19) admits a unique solution in . However, . The parameters in this example satisfy the condition of Proposition 2, but not of Proposition 1.
3.3 -Nash equilibrium
Consider the system of firms. Let the control strategy of firm be given by
Proof. By (26), it follows that
From this together with (25), we have
By and elementary linear SDE estimates, we have
By solving this linear SDE and using the fact that is Hurwitz, we can show
which leads to (28).
By the above theorem, we can obtain the next corollary.
We are now in a position to show an asymptotic Nash equilibrium property. Denote
Proof. See Appendix A.
4 Social Solutions to Output Adjustment
We first construct an auxiliary optimal control problem by examining the social cost variation due to the control perturbation of a single agent. Then, by mean field approximations we design a set of decentralized strategies which is shown to have asymptotic social optimality.