HAL Id: hal-02499527
https://hal.archives-ouvertes.fr/hal-02499527
Preprint submitted on 5 Mar 2020
HAL is a multi-disciplinary open access
archive for the deposit and dissemination of
sci-entific research documents, whether they are
pub-lished or not. The documents may come from
teaching and research institutions in France or
abroad, or from public or private research centers.
L’archive ouverte pluridisciplinaire HAL, est
destinée au dépôt et à la diffusion de documents
scientifiques de niveau recherche, publiés ou non,
émanant des établissements d’enseignement et de
recherche français ou étrangers, des laboratoires
publics ou privés.
COMPARING THE BEST REPLY STRATEGY AND
MEAN FIELD GAMES: THE STATIONARY CASE
Matt Barker, Pierre Degond, Marie-Thérèse Wolfram
To cite this version:
STATIONARY CASE
MATT BARKER
Department of Mathematics, Imperial College London, London, SW7 2AZ, UK and Grantham Institute, Imperial College London, London, SW7 2AZ, UK
Email: [email protected]
PIERRE DEGOND
Department of Mathematics, Imperial College London, London, SW7 2AZ, UK Email: [email protected]
MARIE-THERESE WOLFRAM
University of Warwick, Mathematics Institute, Gibbet Hill Road, CV47AL Coventry, UK and Radon Institute for Computational and Applied Mathematics, Altenbergerstr. 69, 4040 Linz, Austria
Email: [email protected]
Acknowledgements
MB acknowledges support by the Natural Environment Research Council (NERC) under training grant no. NE/L002515/1. PD acknowledges support by the Engineering and Physical Sciences Research Council (EPSRC) under grants no. EP/M006883/1 and EP/N014529/1, by the Royal Society and the Wolfson Foundation through a Royal Society Wolfson Research Merit Award no. WM130048 and by the National Science Foundation (NSF) under grant no. RNMS11-07444 (KI-Net). PD is on leave from CNRS, Institut de Math´ematiques de Toulouse, France. MTW acknowledges partial support from the Austrian Academy of Sciences under the New Frontier’s grant NST-001 and the EPSRC under the First Grant EP/P01240X/1.
AMS subject classification 35Q84, 35Q91, 35J15, 35J57, 91A13, 91A23, 49N70, 34C60, 37M05
Key words
Mean field games, best reply strategy, stationary Fokker-Planck equation
1
Data statement: no new data were collected in the course of this research. 1. Abstract
Mean field games (MFGs) and the best reply strategy (BRS) are two methods of describing competitive optimisation of systems of interacting agents. The latter can be interpreted as an approximation of the respective MFG system, see [1, 3, 18]. In this paper we present a systematic analysis and comparison of the two approaches in the stationary case. We provide novel existence and uniqueness results for the stationary boundary value problems related to the MFG and BRS formulations, and we present an analytical and numerical comparison of the two paradigms in a variety of modelling situations.
2. Introduction
Mean field games (MFGs) describe the dynamics of large interacting agent systems, in which individuals determine their optimal strategy by minimising a given cost functional. The extensive current literature is based on the original work of Lasry and Lions [34, 35, 36] and Huang, Caines and Malham´e [31, 32, 33]. MFGs have been used successfully in many different disciplines. A good overview is presented by Caines, Huang and Malham´e in [6], a detailed probabilistic approach by Carmona and Delarue in [9, 10].
MFGs can be formulated as parabolic optimal control problems (under certain conditions on the cost). This connection can be used to construct approximations to MFGs. Degond, Liu and Ringhofer proposed a so-called best reply strategy (BRS) in [19]. It can be derived from the corresponding explicit in time discretisation of the respective optimal control problem, as in [1, 18]. More recently it has been derived in [3] through considering a discounted optimal control problem and taking the discount factor to ∞. Specifically, in [1, 18] the limit ∆t → 0 in the case of the following cost functional
J∆t(α; m) = E " Z t+∆t t α2 s 2 + 1 ∆th(Xs, m(Xs)) ds # , is analysed. In [3] the limiting behavior of MFG systems for cost functionals
Jρ(α; m) = E " Z T 0 α2 s 2 + h(Xs, m(Xs)) e−ρs ds # ,
as the temporal discount factor ρ tends to infinity is considered. In both cases the resulting dynamics depend instantaneously on the cost function h. Hence agents do not anticipate future dynamics in the respective limits, as they do in MFG approaches.
As far as the authors are aware, no systematic analysis and comparison of the two approaches has been done yet. However, the BRS is computationally less expensive and therefore more attractive in applications. Therefore it is important to understand under which circumstances the use of each model is appropriate and whether there are situations where the two models are comparable. This paper is a first step analysing the similarities and differences between the two models in a systematic way.
The existence and uniqueness of solutions to stationary MFGs has been studied extensively in previous literature ( c.f. [7, 23, 25, 36]). However, apart from a small number of papers e.g. [16, 24], almost all results focus on problems posed on the torus in order avoid dealing with boundary conditions. In [2] the Dirichlet problem was motivated as a stopping time problem, and it was analysed in [24]. In this paper we consider Neumann boundary conditions, which relate to a no-flux boundary. The only other paper we are aware of that deals with such a situation is [16]. In this paper the authors prove existence of solutions to the MFG problem with non-local dependence on the distribution using a Schauder fixed point argument. They then perturb the solutions to prove existence in the case of a local dependence on the distribution. Other typical methods of proof use continuation methods [21, 23, 26], Schauder’s fixed point theorem [5, 17] or variational approaches through energy minimisation problems [14, 20]. In our proof we exploit the linear-quadratic nature of the control. This was done in the time-dependent case in [30] where the problem was reduced to a forward-backward system of heat equations, but we don’t think our method has been considered in the stationary case. Our result sits nicely alongside the only other result for Neumann boundary conditions [16]. On the one hand the Hamiltonian used in [16] is more general than ours, however the regularity assumptions and the form of nonlinearity h required in [16] is relaxed in our case.
Due to assumption (A2), which states that the running cost h is an increasing function of density,
studied extensively by Gomes and collaborators in a number of papers e.g [20, 25, 28]. Although the setting in these papers focusses on domains with periodic boundary conditions, it is worth mentioning the types of techniques used and how they compare to the method in this paper. In [25] a Hopf–Cole transformation is used to prove existence and uniqueness of minimisers of an energy functional related to a specific case of an MFG with periodic boundary conditions and a cost h that is logarithmic in the density. The concepts used in our existence and uniqueness proof are similar to those used in [25], though we are able to generalise the density dependence and consider Neumann boundary value problems. In [26] the results of [25] are extended using a continuation method. There were further improved in [23] where a combination of a continuation method and Minty’s method is used. In both cases the methods allow the authors to perturb a problem for which existence and uniqueness is known to prove existence and uniqueness of the problem of interest. The methods used there and in many subsequent works (e.g. [24]) rely on monotonicity properties of the operators. In our work presented here monotonicity similarly plays a central role in proving existence and uniqueness — through both the use of the maximum principle to prove existence and uniqueness for strictly increasing functions h and through the ability to uniformly perturb an increasing function into a strictly increasing function through the addition of a logarithmic congestion term.
Our non-linear stationary BRS model (7) is an example of a stationary non-linear Fokker-Planck equation. Existence and uniqueness of solutions to non-linear Fokker-Planck equations have been studied extensively (see for example [11, 13] and references therein). Many results (e.g. in [11, 15, 40]) focus on non-local non-linear terms i.e. they consider Fokker-Planck equations of the form
(1) −σ
2
2 ∇
2m − ∇ · (m∇W ∗ m) = 0 ,
with suitable boundary conditions. Here ∇W ∗ m = R
Ω∇W (x − y)m(y) dy is the usual convolution
operator. For our model, we consider a local function of density h = h(x, m(x)), rather than a convolution term. In this case there are a number of results of existence and uniqueness of solutions to the stationary model, as well as convergence of the dynamic model to the stationary version, see e.g. [4, 12, 13, 37]. These papers all consider the term h to be either independent of x i.e. h = h(m), or of the form h = h1(x) + h2(m). So our result extends this case to more general local functions of the density. In
previous literature the proof for the local case, as in [37], relies on a related energy functional, for which minimisers can be proven to exist and be unique. Then these minimisers are also solutions to the Fokker-Planck equation. Our result takes a different approach, one that is more closely related to the case with a convolution term, as in [11]. For the non-local case (1) it has been frequently shown (c.f. [11, 39]) that solutions of the PDE are equivalent to fixed points of a non-linear map
T (m) = 1 Ze − 2 σ2W ∗m, where Z = Z Ω e−σ22W ∗mdx .
We approach existence and uniqueness of solutions to our PDE (7) in a similar vein, considering solutions to the implicit equation (10). While the proof in [11] relies on Schauder’s fixed point theorem, we are able to take advantage of h being a local function of density so instead we use the implicit function theorem and intermediate value theorem to prove our result.
This paper is organised as follows. We start by briefly introducing the time-dependent MFG and BRS models in Section 2.1, following a more detailed derivation presented in [1, 18]. We then describe how the dynamic problems relate to the stationary case. In Section 3 we present a proof of existence and uniqueness for the stationary BRS and MFG. Both proofs use similar arguments for proving existence and uniqueness, relying on the observation that both models involve a stationary Fokker-Planck equation with integral constraints. In Section 4 we describe an explicit solution to the MFG and BRS model. The explicit solution allows us to analyse in which problem specific parameter ranges the solutions are compareable and in which not. In Section 5 we illustrate the different behavior of solutions to both models with numerical simulations. Finally, in Section 6 we conclude with summarising the implications of the results found, specifically what they tell us about the similarities and differences between the two models. We also briefly comment on future directions for research.
Consider a distribution of agents which is absolutely continuous with respect to the Lebesgue measure. We denote the density of the distribution by a function m : Td→ [0, ∞). We take a representative agent, with state Xt∈ Td moving in this distribution according to the following SDE:
dXt= αtdt + σdBt
L(X0) = m0,
where L(X0) denotes the law of the random variable X0, m0 is a given initial distribution of all agents,
σ ∈ (0, ∞) denotes the size of idiosyncratic noise in the model and Btis a d-dimensional Wiener process.
The function αt: [0, T ] → Tdis a control chosen by the representative agent as a result of an optimisation
problem, from a set of admissable controls α ∈ A. The representative agent takes the distribution of other agents, m, to be given and attempts to optimise the following functional
J (α; m) = E " Z T 0 α2 s 2 + h(Xs, m(Xs)) ds # .
The functional J consists of a quadratic cost for the control αt and a density dependent cost function
h : Td×(0, ∞) → R over a finite time horizon T ∈ (0, ∞). Then the optimal cost trajectory u(x, t) is
u(x, t) = inf α∈AE " Z T t α2 s 2 + h(Xs, m(Xs)) ds Xt= x # .
The optimal control is given in terms of u by α∗t = −∇u(Xt, t). The optimal cost trajectory evolves
backwards in time according to
∂tu = |∇u|2 2 − h(x, m) − σ2 2 ∇ 2u u(x, T ) = 0 .
We complete the model by assuming all agents act in the same way as the representative agent, and so the backward PDE is coupled to a forward Fokker-Planck PDE describing the evolution of agents. So the full MFG model is given by
∂tu = |∇u|2 2 − h(x, m) − σ2 2 ∇ 2u (2a) ∂tm = ∇ · [m∇u] + σ2 2 ∇ 2m (2b) m(x, 0) = m0 (2c) u(x, T ) = 0 . (2d)
Following the approach in [1, 18], the corresponding BRS model arises through considering a rescaled cost functional over a short, rolling time horizon
J∆t(α; m) = E " Z t+∆t t α2 s 2 + 1 ∆th(Xs, m(Xs)) ds Xt= x # .
Going through a similar procedure to the MFG problem, approximating the result up to O(∆t) and taking the limit ∆t → 0, the optimal control is given by αt= − [∇h(x, m(x))]|x=Xt. Again we complete
the model by assuming all agents act in the same way as the representative agent. Then the distribution of agents evolves according to the Fokker-Planck equation
∂tm = ∇ · [(∇h(x, m(x))) m] + σ2 2 ∇ 2m (3a) m(x, 0) = m0. (3b)
In (3) the dynamics of agents is influenced by the current agent density only. Hence the anticipation behavior, which is characteristic for MFG, is ‘lost’. Only the current state drives the dynamics. We shall refer to equation (3) as the BRS strategy in the following.
2.2. From the dynamic problems to the stationary case. For the MFG the interpretation of the stationary problem is slightly subtle because in the dynamic case we are considering a problem set on a fixed time horizon so we cannot simply consider the stationary problem by setting ∂tu, ∂tm = 0
horizon T we use the notation ¯uT, ¯mT for solutions satisfying (2). Then we define the rescaled functions uT and mT by
uT(x, t) = ¯uT(x, tT ) , mT(x, t) = ¯mT(x, tT ) .
Then Theorem 1.2 in [8] states that under some mild assumptions on the data we have that as T → ∞: uT − Z Td uT dy → u , in L2(Td×(0, 1)) 1 Tu T → (1 − t)λ , in L2 (Td×(0, 1)) mT → m , in Lp(Td×(0, 1)) ,
where p depends on the space dimension d. Then the triple (m, u, λ) ∈ C2(Td) × C2(Td) × R satisfies the following stationary problem
−σ 2 2 ∇ 2m − ∇ · (m∇u) = 0 (4a) −σ 2 2 ∇ 2u + |∇u| 2 2 − h(x, m) + λ = 0 (4b) Z Td m dx = 1 (4c) Z Td u dx = 0 . (4d)
The corresponding stationary BRS model is obtained by setting ∂tm = 0 in (3). Hence we have
∇ · [(∇h(x, m(x))) m] +σ 2 2 ∇ 2m = 0 (5a) Z Td m dx = 1 . (5b)
Equation (5) can be understood as either the long-time behaviour of the dynamic BRS or, under suit-able convexity conditions (c.f. [37]), a competitive equilibrium distribution of the following minimisation problem
min E [h(Xt, m(Xt))]
(6)
By competitive equilibrium we mean a stationary distribution m for which E [h(X, m(X))] is minimised when L(X) = m.
3. Existence and Uniqueness of Stationary Solutions
In this section we will show that the MFG (4) and the BRS (5) admit unique solutions on a bounded domain Ω with no flux boundary conditions. We make the following assumptions on the function h(x, m) : Ω × (0, ∞) → R and the domain Ω:
(A1) Ω ⊂ Rd is an open bounded set with a C2,α boundary, for some α ∈ (0, 1) and d ≥ 1.
(A2) h(x, ·) is an increasing function for every x ∈ Ω.
(A3) There exists a continuous function g : (0, ∞) → [0, ∞) such that supx∈Ω|h(x, m)| ≤ g(m) for
every m ∈ (0, ∞).
Since Ω is bounded, we can now define |Ω| = R
Ωdx, where this integral is with respect to the standard
Lebesgue measure. Furthermore, we denote the unit outer normal vector by ν. For the BRS we further assume:
(BRS1) h ∈ C2(Ω × (0, ∞)) ∩ C1 Ω × (0, ∞).¯
(BRS2) There exists a continuous function f : (0, ∞) → [0, ∞) such that supx∈Ω|∇xh(x, m)| ≤ f (m) for
every m ∈ (0, ∞).
While for the MFG we will assume: (MFG1) h ∈ C (Ω × (0, ∞))
(MFG2) limm→0supx∈Ωh(x, m) < infx∈Ωh
x, 1
|Ω|
(MFG3) supx∈Ωh
x,|Ω|1 < limm→∞infx∈Ωh(x, m).
Discussion of assumptions: Since we are interested in classical solutions (generally in C2(Ω)∩C1 Ω)¯
the above assumptions on the cost and domain ensure sufficient regularity and boundedness of h. It is worth mentioning why we need h to be increasing. This assumption of an increasing function can be re-lated to “crowd aversion”. When h is increasing in m then areas of high density are more highly penalised than low density areas in the optimisation problem related to the MFG and BRS (see Section 2.1). This prevents “accumulation points” occurring where higher density is preferable and a Dirac delta might be introduced into the solution. As well as being a problem for regularity, the position of the Dirac deltas would be sensitive on the data and so uniqueness could not be guaranteed.
Assumptions (MFG2) and (MFG3) are also less intuitive than the rest of the assumptions. The MFG
problem has two integral constraints related to it. We prove that these constraints can be satisfied using the intermediate value theorem. In doing so we show that two functions m1, m2exist such that m1(x) ≤
1
|Ω| ≤ m2(x) for every x ∈ Ω. The existence of these functions is guaranteed if assumptions (MFG2)
and (MFG3) hold. As it may not be initially clear what kind of function h satisfies our requirements,
some sufficient conditions if h(x, m) = h1(x) + h2(m) are:
• h1∈ C2(Ω) ∩ C1 Ω¯ • h2∈ C2((0, ∞)) • h2 is increasing • limm→0h2(m) < h2 1 |Ω| + infx∈Ωh1(x) • limm→∞h2(m) > h2 1 |Ω| + supx∈Ωh1(x)
3.1. Best Reply Strategy. We start by defining the notion of classical solutions we are aiming for. Definition 3.1. Let assumptions (A1)–(A3) and (BRS1)–(BRS2) be satisfied. Then the stationary BRS
boundary value problem is to find a function m : Ω → (0, ∞) satisfying m ∈ C2(Ω) ∩ C1 Ω¯ (7a) −σ 2 2 ∇ 2m − ∇ · (m∇[h(x, m)]) = 0 , x ∈ Ω (7b) −σ 2 2 ∇m · ν − m∇[h(x, m)] · ν = 0 , x ∈ ∂Ω (7c) Z Ω m dx = 1 . (7d)
Throughout this subsection we will assume (A1)–(A3) and (BRS1)–(BRS2) hold.
Lemma 3.2. For any Z ∈ (0, ∞) there exists a unique mZ : Ω → (0, ∞) such that
(8) mZ(x) = 1 Ze −2 σ2h(x,mZ(x)). Furthermore, mZ ∈ C2(Ω) ∩ C1 Ω.¯
Proof. Fix Z ∈ (0, ∞) and x ∈ Ω. Consider GZ,x: (0, ∞) → R given by
GZ,x(m) =
1 Ze
−2
σ2h(x,m)− m .
This is a strictly decreasing function of m. Furthermore, since h is increasing and continuous with respect to m and supx∈Ωh(x, m) ≤ g(m), we must have limm→0supx∈Ωh(x, m) ≤ supx∈Ωh(x, 1) ≤ g(1) < ∞.
So we get the following limit inequality as m → 0, which holds uniformly in x: lim
m→0GZ,x(m) > 0 .
So there exists some > 0, independent of x, such that GZ,x() > 0. Furthermore, after defining a
constant C = infx∈Ωh(x, ), it is clear that
Therefore by the intermediate value theorem and strict monotonicity of GZ,x there exists a unique
m = mZ(x) > 0 such that GZ,x(mZ(x)) = 0. Hence the first result follows. In order to show the regularity
requirement that mZ ∈ C2(Ω) ∩ C1 Ω, we need to show¯
(1) mZ ∈ C2(Ω).
(2) For any x ∈ ∂Ω, limy→x, y∈ΩmZ(y) exists.
(3) For any x ∈ ∂Ω, limy→x, y∈Ω∇mZ(y) exists.
The assertion that mZ ∈ C2(Ω) follows from the implicit function theorem. For the implicit function
theorem to hold we require that GZ,x(m) is a C2 function with respect to x and m at (x, mZ(x)) and
that G0Z,x(mZ(x)) 6= 0. The first requirement is true from our assumption that h ∈ C2(Ω × (0, ∞)), the
second requirement is true since G0Z,x(m) = −
2
σ2Z∂mh(x, m)e − 2
σ2h(x,m)− 1 ≤ −1 < 0 .
To prove that limy→x, y∈Ωm(y) exists for every x ∈ ∂Ω it is enough to show mZ is uniformly Lipschitz
in Ω. Since mZ ∈ C2(Ω) it is therefore enough to show k∇mZk∞< ∞. Note that C, defined above are
independent of x, so we must have ≤ kmZk∞≤ Z1e −2
σ2C+ < ∞. Then by differentiating the implicit
formula for mZ we get
(9) ∇mZ=
−2mZ∇xh(x, mZ)
σ2+ 2m
Z∂mh(x, mZ)
.
But ∂mh ≥ 0 since h is increasing, also mZ is uniformly bounded as seen above. Similarly, using f from
assumption (BRS2) we find k∇xh(·, mZ(·))k∞< ∞, hence k∇mZk∞< ∞.
To prove the final assertion that limy→x, y∈Ω∇mZ(y) exists for any x ∈ ∂Ω, we note the formula
for ∇mZ is given by (9). Since mZ ∈ C0 Ω, km¯ Zk∞ < ∞, ∇xh, ∂mh ∈ C0 Ω × (0, ∞) and σ¯ 2+
2mZ∂mh(x, mZ) ≥ σ2, then the right hand side of (9) has limit as y → x for every x ∈ ∂Ω. Hence ∇mZ
does as well.
Definition 3.3. Let mZ be given by (8). Then we define the following function Φ : (0, ∞) → (0, ∞)
Φ(Z) = Z
Ω
mZ dx .
Lemma 3.4. There exists ¯Z, ¯
Z such that Φ ¯Z ≥ 1 and Φ ( ¯ Z) ≤ 1. Proof. Take C1 = infx∈Ωh
x,|Ω|1 . So C1∈−g Ω1 , g Ω1. Then, with ¯ Z = |Ω|e−σ22 C1 ∈ (0, ∞), we have G ¯ Z,x 1 |Ω| ≤ 0 for every x ∈ Ω . Hence, m ¯
Z(x) ≤|Ω|1 because GZ,xis a strictly decreasing function. So
Φ ( ¯
Z) ≤ km
¯
Zk∞|Ω| ≤ 1 .
We can similarly find ¯Z by taking C2 = supx∈Ωh
x,|Ω|1 . So C2 ∈ −g Ω1 , g Ω1. Then, with ¯ Z = |Ω|e−σ22 C2 ∈ (0, ∞), we have GZ,x¯ 1 |Ω| ≥ 0 for every x ∈ Ω . Hence, mZ¯(x) ≥|Ω|1 because GZ,xis a strictly decreasing function. So
Φ ¯Z ≥ kmZ¯k∞|Ω| ≥ 1 .
Lemma 3.5. There exists a unique Z∗∈ (0, ∞) such that Φ (Z∗) = 1.
Proof. If Φ is continuous and strictly decreasing the intermediate value theorem and the Lemma 3.4 give the result. We start by proving that Φ is strictly decreasing. First note that if mZ(x) is strictly decreasing
in Z for every x then Φ must be strictly decreasing because mZ is continuous with respect to x. Take
Z1< Z2. Then mZ1(x) satisfies
1 Z1
e−σ22 h(x,mZ1(x))− mZ
So 1 Z2 e−σ22h(x,mZ1(x))− mZ1(x) < 1 Z1 e−σ22h(x,mZ1(x))− mZ1(x) = 0 .
Hence GZ2,x(mZ1(x)) < 0. Then mZ2(x) < mZ1(x) for all x ∈ Ω since GZ,x is a strictly decreasing
function. To show that Φ is continuous at Z ∈ (0, ∞), take < Z0< Z. Then
|Φ(Z) − Φ(Z0)| = Φ(Z0) − Φ(Z) = Z Ω mZ0(x) dx − Φ(Z) = Z Z0 Z Ω 1 Ze −h(x,mZ0(x))dx − Φ(Z) ≤ Z Z0 Z Ω 1 Ze −h(x,mZ(x))dx − Φ(Z) ≤ Φ(Z) (Z − Z 0) ≤ Φ() (Z − Z 0) .
By exchanging Z and Z0 we can similarly show the analogous result for < Z < Z0, therefore Φ is locally
Lipschitz and hence continuous.
Theorem 3.6. There exists a unique solution m : Ω → (0, ∞) to the stationary BRS (7).
Proof. Take m(x) = mZ∗(x), with Z∗ defined as in Lemma 3.5. Then from Lemmas 3.2 and 3.5 we have
shown there exists a unique m : Ω → (0, ∞) satisfying m ∈ C2(Ω) ∩ C1 Ω¯
(10a)
There exists Z ∈ (0, ∞) such that m = 1 Ze − 2 σ2h(x,m), x ∈ Ω (10b) Z Ω m dx = 1 . (10c)
Now for any m : Ω → (0, ∞) we can define φ(m) by φ(m) = e−σ22h(x,m). Suppose m is a solution to (7),
then φ(m)m = meσ22h(x,m) and so m φ(m) ∈ H
1(Ω) because m ∈ C1 Ω, h ∈ C¯ 1 Ω × (0, ∞)¯
and h is increasing in m. Therefore a solution to (7) is equivalent to a solution of
m ∈ C2(Ω) ∩ C1 Ω¯ (11a) m φ(m)∈ H 1(Ω) (11b) ∇ · φ(m)∇ m φ(m) = 0 , x ∈ Ω (11c) φ(m)∇ m φ(m) · ν = 0 , x ∈ ∂Ω (11d) Z Ω m dx = 1 . (11e) Now if we multiply (11c) by m
φ(m), and integrate over Ω, then using Green’s formula and the boundary
condition (11d) we get 0 = Z Ω m φ(m)∇ · φ(m)∇ m φ(m) dx = − Z Ω φ(m) ∇ m φ(m) 2 dx . But φ(m) > 0 and ∇ m φ(m) 2
≥ 0 for every x ∈ Ω. Hence this is only true if ∇ m φ(m)
= 0 for every x ∈ Ω, i.e. if there exists Z ∈ (0, ∞) such that m = Z1φ(m). Conversely, if m ∈ C2(Ω) ∩ C1 Ω and there¯ exists Z ∈ (0, ∞) such that m = Z1φ(m), then m satisfies (11a)–(11d). So a solution of (11) is equivalent to a solution of (10). Therefore the systems (7) and (10) are equivalent. Hence we have shown existence and uniqueness of solutions to (10) by proving existence and uniqueness of solutions to (7).
Definition 3.7. The stationary MFG boundary value problem is to find m : Ω → (0, ∞), u : Ω → R and λ ∈ R satisfying the following PDE system
m ∈ C2(Ω) ∩ C1 Ω¯ (12a) u ∈ C2(Ω) ∩ C1 Ω¯ (12b) −σ 2 2 ∇ 2m − ∇ · (m∇u) = 0 , x ∈ Ω (12c) −σ 2 2 ∇ 2u +|∇u|2 2 − h(x, m) + λ = 0 , x ∈ Ω (12d) −σ 2 2 ∇m · ν = 0 , x ∈ ∂Ω (12e) −∇u · ν = 0 , x ∈ ∂Ω (12f) Z Ω m dx = 1, (12g) Z Ω u dx = 0 . (12h)
Remark 3.8. Here, following the method of Section 3.1, we note that for any u ∈ C2(Ω) ∩ C1 Ω, a¯ solution m of (12a), (12c), (12e), (12g) is equivalent to a solution of
m ∈ C2(Ω) ∩ C1 Ω¯ (13a) m = 1 Ze − 2 σ2u, x ∈ Ω (13b) Z = Z Ω e−σ22 udx . (13c)
Then by the arguments in Section 3.1, a unique m satisfying (13) exists and is the unique solution to (12a), (12c), (12e), (12g). So from now we only consider that solution.
Proposition 3.9. There exists a unique solution (m, u, λ) ∈C2(Ω) ∩ C1 Ω × C¯ 2(Ω) ∩ C1 Ω¯ × R
to the stationary MFG boundary value problem if and only if there exists a unique solution (u, λ, Z) ∈ C2(Ω) ∩ C1 Ω¯ × R ×(0, ∞) to u ∈ C2(Ω) ∩ C1 Ω¯ (14a) −σ 2 2 ∇ 2u + |∇u|2 2 − h x, 1 Ze − 2 σ2u + λ = 0 , x ∈ Ω (14b) −∇u · ν = 0 , x ∈ ∂Ω (14c) Z Ω 1 Ze − 2 σ2u dx = 1, (14d) Z Ω u dx = 0 . (14e)
Proof. First assume a unique solution (m, u, λ) to (12) exists, then thanks to remark 3.8, we have m =
1 Ze
−2
σ2u, for Z satisfying (13c). Then the triple (u, λ, Z) is clearly a solution to (14). Furthermore,
suppose another solution (u0, λ0, Z0) to (14) exists. Then (m0, u0, λ0), with m0 = Z1e−σ22u 0
, is a solution to (12). But since we assumed such solutions are unique, we have (m0, u0, λ0) = (m, u, λ) and hence (u0, λ0, Z0) = (u, λ, Z), so the solution to (14) is unique.
Next we assume that a unique solution (u, λ, Z) to (14) exists. Then, defining m = Z1e−σ22u, (m, u, λ)
is a solution to (12). Now suppose (m0, u0, λ0) is another solution then (again using remark 3.8) m0 =
1 Z0e
−2 σ2u
0
, where Z0 satisfies (13c). So (u0, λ0, Z0) satisfies (14). By uniqueness (u0, λ0, Z0) = (u, λ, Z) and
so (m, u, λ) is also unique.
Theorem 3.10. There exists a unique solution (m, u, λ) of the MFG system (12).
Proof (outline). First note, as a result of Proposition 3.9, we only need to prove existence and uniqueness of a solution to (14) and existence and uniqueness for the MFG system (12) will follow. The proof is split into the following steps:
(1) Show that for any pair of constants (λ, Z) there exists a unique solution, denoted by uλ,Z,
(2) Show that for any constant Z there exists a unique λ = λ(Z) such that uλ(Z),Z satisfies (14d)
(see Proposition 3.16)
(3) Show that there exists a unique Z = Z∗ such that uλ(Z∗),Z∗ satisfies (14e).
Then uλ(Z∗),Z∗, λ (Z∗) , Z∗ is a solution to (14). Uniqueness follows from uniqueness obtained at
each step of the proof outlined. We prove step 1 using a variant of the method of upper and lower solutions in the spirit of [38], so that it applies to our case of Neumann boundary conditions. We prove steps 2 and 3 by iteratively using the intermediate value theorem - in a similar manner to the proof of Lemma 3.5. Note that we will first do this for h which is strictly increasing in m. Then by considering h(x, m) = h(x, m) + log (|Ω|m), and taking the limit as → 0 we will prove it in the more general
setting when h is increasing.
Lemma 3.11. There exists Λ1, Λ2∈ [−∞, ∞] with Λ1< Λ2 such that for every λ ∈ (Λ1, Λ2) and Z > 0,
there exist two constants ¯ uλ,Z ≤ 0 ≤ ¯uλ,Z satisfying (15) − h x, 1 Ze − 2 σ2¯uλ,Z + λ ≤ 0 ≤ −h x, 1 Ze − 2 σ2uλ,Z¯ + λ .
Proof. Take Λ1= limm→0supx∈Ωh(x, m) and Λ2= limm→∞infx∈Ωh(x, m). First Λ1< Λ2by combining
assumptions (MFG2) and (MFG3). Then, since h is continuous and increasing in m, for any λ ∈ (Λ1, Λ2)
there exists M1 λ, M
2
λ∈ (0, ∞) such that h(x, m) ≤ λ for all (x, m) ∈ Ω× 0, M 1
λ, and similarly h(x, m) ≥ λ
for all (x, m) ∈ Ω ×Mλ2, ∞. We define the upper and lower constants for λ ∈ (Λ1, Λ2) as
¯ uλ,Z= max −σ 2 2 log ZM 1 λ, 0 ¯ uλ,Z= min −σ 2 2 log ZM 2 λ, 0 . Then clearly −h x, 1 Ze − 2 σ2¯uλ,Z + λ = −h x, min Mλ1, 1 + λ ≥ −h x, M1 λ + λ ≥ 0 ,
while the reverse inequality is true for ¯
uλ,Z. Hence ¯uλ,Z,
¯
uλ,Z are the required upper and lower constants.
Proposition 3.12. Define C2,τ Ω as the set of functions u ∈ C¯ 2 Ω whose second partial derivatives¯ are all H¨older continuous with exponent τ on ¯Ω. Assume h is strictly increasing with respect to m. Then, for every λ ∈ (Λ1, Λ2) and every Z ∈ (0, ∞) there exists a unique function, uλ,Z ∈ C2,τ Ω ⊂¯
C2(Ω) ∩ C1 Ω for some τ ∈ (0, 1), which satisfies (14a)–(14c). Furthermore,¯
¯
uλ,Z≤ uλ,Z≤ ¯uλ,Z.
Proof. Existence is an application of Corollary 2.9 in [38], which states that a solution uλ,Z ∈ C2,τ Ω¯
to (14a)–(14c) exists provided the following properties hold: (1) There exist constants
¯
uλ,Z≤ 0 ≤ ¯uλ,Z satisfying (15) for every x ∈ ¯Ω.
(2) There exists a continuous function f : [0, ∞) → [0, ∞) such that the following inequality holds for every (x, u, p) ∈ ¯Ω × R × Rd |p|2 2 − h x, 1 Ze − 2 σ2u + λ ≤ f (|u|) 1 + |p|2 .
Property 1 is true from Lemma 3.11. Property 2 can be shown to be true by taking f (u) = max 1 2, |λ| + max g 1 Ze − 2 σ2u , g 1 Ze 2 σ2u , where g is defined in assumption (A3).
We prove uniqueness using the strong maximum principle and Hopf’s Lemma as stated in [22] (Sec-tion 6.4.2. pp. 330–333). Suppose there are two solu(Sec-tions, u1, u2∈ C2(Ω)∩C1 Ω to (14a), (14b). Define¯
a = ∇(u1+ u2). Then a ∈ L∞ Ω. Now suppose u¯ 16= u2and define v = u1− u2. Then v must attain its
maximum at some point ¯x ∈ ¯Ω. First suppose ¯x ∈ Ω. Then there exists an open bounded and connected region V such that V ⊂ Ω, ¯x ∈ V and v > 0 for all x ∈ V . Hence, since h(x, ·) is increasing, we have
−σ
2
2 ∇
2v +1
So by the strong maximum principle v is constant in V . Therefore we must have h x, 1 Ze − 2 σ2u1(x) = h x, 1 Ze − 2 σ2u2(x) for every x ∈ V .
So u1= u2 in V because h is strictly increasing, which leads to a contradiction. Therefore the only other
option is that ¯x ∈ ∂Ω and v(x) < v(¯x) for every x ∈ Ω. Hence by Hopf’s Lemma (which we can use because ∂Ω is C2) ∂v∂ν¯x> 0, but by the boundary condition (14c), ∂v∂ν = ∂u1∂ν −∂u2∂ν = 0. This again leads to a contradiction. Therefore u1= u2, and therefore the solution is unique.
Remark 3.13. It should be noted that the same method to prove uniqueness can be used to prove that uλ1,Z≥ uλ2,Z for all λ1≤ λ2
Lemma 3.14. Assume h is strictly increasing with respect to m. Then for every x ∈ Ω, uλ,Z(x) is
decreasing with respect to λ and Z.
Proof. In Light of remark 3.13 we need only to prove uλ,Z1 ≥ uλ,Z2 for all Z1 ≤ Z2. However, by
substitution we find that u = uλ,Z1−σ 2 2 log
Z2
Z1 satisfies (14a)–(14c) with Z = Z2. So by uniqueness of
solutions to this PDE proved in Proposition 3.12 we see that uλ,Z2 = u ≤ uλ,Z1.
Proposition 3.15. Define Φ : (Λ1, Λ2) × (0, ∞) → L∞(Ω) by
Φ(λ, Z) = uλ,Z,
where uλ,Z is the unique solution to (14a)–(14c) as found in the Proposition 3.12. Assume h is strictly
increasing with respect to m. Then Φ is continuous (with respect to L∞ norm).
Proof. We will prove Φ is sequentially continuous. Let (λn, Zn) be a sequence in (Λ1, Λ2) × (0, ∞) that
converges to (λ, Z) ∈ (Λ1, Λ2) × (0, ∞). We consider two sequences: (λ (i)
n , Zn(i)) for i = 1, 2, which we use
to sandwich our original sequence. We set these sequences with the following conditions (1) λ(1)n = infj≥nλj
(2) λ(2)n = supj≥nλj
(3) Zn(1)= infj≥nZj
(4) Zn(2)= supj≥nZj
In the first part of this proof we show that for each i = 1, 2, there exists a subsequence (λ(i)nk, Z (i) nk) such
that uλ(i)
nk,Z(i)nk → uλ,Z. We will only show this for i = 1 as the case i = 2 is identical. Clearly the
sequence (λ(1)n , Z (1)
n ) also converges to (λ, Z). So there exists a subsequence nk such that uλ(1)
nk,Z(1)nk → u∗
in C2(Ω) ∩ C1( ¯Ω) because u
λ(1)nk,Znk(1) ∈ C
2,τ( ¯Ω) (by Proposition 3.12), which is compactly embedded in
C2( ¯Ω) ⊂ C2(Ω) ∩ C1( ¯Ω). Therefore we also get the following pointwise convergence 0 = lim k→∞ −σ 2 2 ∇ 2u λnk,Znk + 1 2 ∇uλnk,Znk 2 − h x, 1 Znk e−σ22uλnk,Znk + λnk = −σ 2 2 ∇ 2u ∗+ |∇u∗| 2 2 − h x, 1 Ze − 2 σ2u∗ + λ 0 = lim k→∞∇uλnk,Znk · ν|x∈∂Ω= ∇u∗· ν|x∈∂Ω.
So u∗= uλ,Z, by uniqueness proved in Proposition 3.12. Now by design we have λ (2)
nk ≥ λn ≥ λ (1) nk for all
n ≥ nk and similarly for Zn, hence uλ(1) nk,Z (1) nk ≥ uλn,Zn≥ uλ (2) nk,Z (2) nk by Lemma 3.14. So uλn,Zn→ uλ,Z in L∞(Ω).
Proposition 3.16. For each Z ∈ (0, ∞) define I1(·; Z) : (Λ1, Λ2) → R by
Assume h is strictly increasing with respect to m. Then for every Z ∈ (0, ∞) there exists a unique λ = λ(Z) such that I1(λ(Z); Z) = 0, furthermore
(16) inf x∈Ωh x, 1 Z ≤ λ(Z) ≤ sup x∈Ω h x, 1 Z .
Proof. We use the intermediate value theorem to prove this proposition. There are three parts we have to prove
(1) For every Z ∈ (0, ∞) there exists λ1≤ λ2∈ (Λ1, Λ2) such that I1(λ1; Z) ≤ 0 and I1(λ2; Z) ≥ 0.
(2) I1(λ; Z) is continuous with respect to λ in [λ1, λ2].
(3) I1(λ; Z) is strictly decreasing with respect to λ.
Part (1) and part (2) allow us to use the intermediate value theorem to show that for every Z ∈ (0, ∞) there exists some λ such that I1(λ; Z) = 0. Part (3) shows that this λ is unique, so the function Z 7→ λ(Z)
is well defined.
Part (1): Take λ1= supx∈Ωh x,Z1 > Λ1. Then recall that ¯uλ1,Z= max
−σ2 2 log(ZM 1 λ1), 0 , where M1 λ1 satisfies h x, M 1
λ1 ≤ λ1. But we can take Mλ11 = 1
Z by our choice of λ1. So uλ1,Z≤ ¯uλ1,Z= 0, and
therefore I1(λ1; Z) ≤ 0. The choice for λ2 is λ2= infx∈Ωh x,Z1 and the proof is similar to the above.
Part (2): Take λ1, λ2 as above. By Propositions 3.12 and 3.15 and Lemma 3.14, uλ,Z is continuous
with respect to λ in L∞(Ω) and ¯
uλ2,Z ≤ uλ,Z ≤ ¯uλ1,Z for any λ ∈ [λ1, λ2]. So by the dominated
convergence theorem I1is continuous in λ.
Part (3): Take λ1 < λ2, from Lemma 3.14 we know uλ1,Z ≥ uλ2,Z. Clearly, since solutions to the
PDE (14a)–(14c) are unique, there exists a ∈ Ω such that uλ1,Z(a) 6= uλ2,Z(a). Hence, uλ1,Z(a) >
uλ2,Z(a) and so by continuity I1(λ1, Z) > I1(λ2, Z).
Remark 3.17. This proposition ensures that for any Z ∈ (0, ∞) we can find λ = λ(Z) and u = uλ(Z),Z
satisfying (14a)–(14c), (14e), so we are left to find Z∗ such that (14d) holds.
Lemma 3.18. Assume h is strictly increasing with respect to m. Then the function λ(Z) is strictly decreasing.
Proof. From Lemma 3.14, uλ,Z is strictly decreasing with respect to Z. Now suppose Z1< Z2then
0 = I1(λ(Z2), Z2) = I1(λ(Z1), Z1) > I1(λ(Z1), Z2) .
Therefore, since I1 is strictly decreasing in λ, λ(Z2) < λ(Z1) so λ(Z) is strictly decreasing with respect
to Z.
Lemma 3.19. Assume h is strictly increasing with respect to m. Then the function λ(Z) is continuous.
Proof. We will prove λ(Z) is sequentially continuous. Let Zn be a sequence in (0, ∞) that converges to
Z ∈ (0, ∞). We consider two sequences: Zn(i)for i = 1, 2, which we use to sandwich our original sequence.
We choose these sequences as follows (1) Zn(1)= infj≥nZj
(2) Zn(2)= supj≥nZj
Now, Zn(1), Z (2)
n → Z and are increasing and decreasing sequences respectively. Furthermore, there
exists ¯ Z, ¯Z such that Zn(1), Z (2) n ∈ [ ¯
Z, ¯Z] for every n ∈ N. So, since λ(Z) is decreasing, λ(Zn(1)), λ(Z (2) n ) ∈
[λ( ¯Z), λ( ¯
Z)] and are decreasing and increasing respectively. Therefore, λ(Zn(1)) → λ(1) and λ(Z (2) n ) → λ(2). Using continuity of I 1 we get for i = 1, 2: 0 = lim n→∞I1 λ(Zn(i)), Zn(i)= I1 λ(i), Z. Hence by definition λ(1) = λ(2) = λ(Z). Since Z(i)
n bound Zn, then λ(Z (i)
n ) bound λ(Zn). Hence
Proposition 3.20. Define I2: (0, ∞) → (0, ∞) by (17) I2(Z) = Z Ω 1 Ze − 2 σ2uλ(Z),Z dx .
Assume h is strictly increasing with respect to m. Then there exists a unique Z∗ ∈ (0, ∞) such that I2(Z∗) = 1.
Proof. Similar to the proof of Proposition 3.16, we prove this proposition using the intermediate value theorem. Again there are three parts we have to prove
(1) There exists Z1≤ Z2∈ (0, ∞) such that I2(Z1) ≥ 1 and I2(Z2) ≤ 1.
(2) I2(Z) is continuous with respect to Z for all Z ∈ [Z1, Z2].
(3) I2(Z) is strictly decreasing with respect to Z.
Steps (1) and (2) prove existence via the intermediate value theorem, step (3) proves uniqueness. Step (1): From assumption (MFG2), we can find Z2≥ |Ω| such that
(18) sup x∈Ω h x, 1 Z2 ≤ inf x∈Ωh x, 1 |Ω| . Then, since λ(Z2) ≤ supx∈Ωh
x,Z21 (from (16)), it follows that
uλ(Z2),Z2≥ usup x∈Ωh x, 1 Z2 ,Z2 ≥¯usupx∈Ωhx, 1 Z2 ,Z2 = min −σ 2 2 log Z2M, 0 , where M satisfies h(x, M ) ≥ supx∈Ωhx,Z1
2
for all x (from the proof of Lemma 3.11). But from (18), this is clearly satisfied by M = |Ω|1 , and in this case min−σ2
2 log Z2M, 0 = −σ22logZ2|Ω|. Thus I2(Z2) ≤ Z Ω 1 Z2 e−σ22 −σ2 2 logZ2|Ω| dx = Z Ω 1 |Ω| dx = 1 .
A similar procedure works to find Z1, in which case Z1 satisfies Z1 ≤ |Ω| and infx∈Ωh
x,Z11 ≥ supx∈Ωh
x,|Ω|1 .
Step (2): Take Z1≤ Z2as in Step (1). Then for every Z ∈ [Z1, Z2] there exists C1, C2∈ R such that
( by (16)) C2= inf x∈Ωh x, 1 Z2 ≤ λ(Z2) ≤ λ(Z) ≤ λ(Z1) ≤ sup x∈Ω h x, 1 Z1 = C1. So ¯
uC1,Z2 ≤ uλ(Z),Z ≤ ¯uC2,Z1 for every Z ∈ [Z1, Z2]. So we can use the dominated convergence theorem
along with continuity of uλ,Zwith respect (λ, Z) and continuity of λ(Z) with respect to Z to show I2(Z)
is continuous. Step (3): Take
¯
Z < ¯Z then there exists a ∈ Ω such that uλ( ¯ Z), ¯Z(a) < uλ( ¯Z), ¯Z(a) . Therefore, at a ∈ Ω: 1 ¯ Ze − 2 σ2uλ(¯Z),¯Z = 1 ¯ Ze −2 σ2uλ(¯Z), ¯Z > 1 ¯ Ze − 2 σ2uλ( ¯Z), ¯Z. So I2( ¯
Z) > I2( ¯Z) because of the continuity of uλ,Z. This proves I2 is strictly decreasing.
End of proof of Theorem 3.10. First let’s assume h is a strictly increasing function in m. Then we can choose the unique Z∗ ∈ (0, ∞) such that I2(Z∗) = 1. Then clearly the triple uλ(Z∗),Z∗, λ(Z∗), Z∗ is a
solution to the system (14). Furthermore, suppose (u0, λ0, Z0) is also a solution of (14). But this implies that u0 satisfies (14a)–(14c), so u0 = uλ0,Z0 from uniqueness proven in Proposition 3.12. Then uλ0,Z0 also
solves (14e), so by uniqueness proven in Proposition 3.16 we can show λ0= λ(Z0). Finally we now have u0= u
λ(Z0),Z0 meets the integral constraint (14d). So from uniqueness proven in Proposition 3.20 we have
Z0= Z∗. Therefore (u0, λ0, Z0) = uλ(Z∗),Z∗, λ(Z∗), Z∗. Hence the unique solution to the MFG problem
is given by mZ∗, uλ(Z∗),Z∗, λ(Z∗), where mZ∗ is defined by mZ∗= 1 Z∗e
− 2 σ2uz∗.
Now let’s assume h is an increasing function in m and define h(x, m) by
Then for every ∈ (0, 1], h is a strictly increasing function of m. Furthermore h still satisfies
assump-tions (MFG1)–(MFG3). Therefore there exists a unique solution (u, λ, Z) to the MFG system (14).
From Proposition 3.20, Z∈ [Z1, Z2] for some Z1, Z2∈ (0, ∞) such that
0 < Z1≤|Ω| ≤ Z2 < ∞ sup x∈Ω h x, 1 Z2 ≤ inf x∈Ωh x, 1 |Ω| inf x∈Ωh x, 1 Z1 ≥ sup x∈Ω h x, 1 |Ω| .
But by the definition of h we have h
x,Z12 ≤ hx,Z12 and h x,|Ω|1 = hx,|Ω|1 , and a similar inequality holds for Z1
. So we can find Z1 ∈ (0, |Ω|] and Z2 ∈ [|Ω|, ∞) independent of such that
Z∈ [Z1, Z2] for every ∈ (0, 1]. Now, from Lemma 3.18
λ= λ(Z) ∈λ(Z2), λ(Z1) .
So take a sequence nsuch that limn→∞n= 0. Then, since u∈ C2,τ Ω, which is compactly embedded¯
in C2(Ω) ∩ C1 Ω, there exists a subsequence also denoted by n such that u¯
n → u0 with convergence
in C2(Ω) ∩ C1 Ω, Z¯
n → Z0∈ [Z
1, Z2], and λ
n→ λ0∈λ(Z
2), λ(Z1). So we find, by taking limits
−σ 2 2 ∇ 2u 0+ |∇u0|2 2 − h x, 1 Z0 e−σ22u0 + λ0= 0 .
Similarly we can show (u0, λ0, Z0) satisfy (14). So we have proven existence of solutions for increasing h.
For uniqueness, let’s assume (u1, λ1, Z1) and (u2, λ2, Z2) are both solutions of (14) and λ1 ≤ λ2.
Define u = u2−σ 2 2 log Z 1 Z2 , then u satisfies −σ 2 2 ∇ 2u + |∇u|2 2 − h x, 1 Z1 e−σ22 u + λ1= 0 .
Now define v = u − u2 and suppose there exists x ∈ ¯Ω such that v(x) > 0. Then v has a maximum
at x∗ and v(x∗) > 0. By Hopf’s lemma x∗ ∈ Ω and by the maximum principle v is constant in the set Ω+ = {x ∈ Ω : v(x) ≥ 0} (see the proof of Proposition 3.12 for the details of such an argument). By
assumption x∗∈ Ω+ and v(x∗) > 0, so Ω+ = Ω by continuity of v. Hence v is constant in Ω and v > 0.
However, from the integral constraint (14d) we obtain
(19) 0 = Z Ω 1 Z1 e−σ22u1 1 − e−σ22v dx = |Ω|1 − e−σ22v ,
since 1 − e−σ22 v is constant. So v = 0, contradicting the assumption v(x∗) > 0. Therefore v(x) ≤ 0 for
all x ∈ Ω. This implies that 1 − e−σ22v ≤ 0, and subsequently that
0 = Z Ω 1 Z1 e−σ22u1 1 − e−σ22v dx ≤ 0 ,
with equality if and only if v = 0. Therefore u2= u, which implies (using the integral constraint (14e))
that Z1 = Z2, and subsequently that u1 = u2. Finally, by subtracting the PDE (14b) satisfied by u1
from the one satisfied by u2we find λ2= λ1. Therefore solutions are unique.
4. Quadratic Potential
4.1. The MFG. The stationary MFG model studied in [27] and [29], with the integral constraints used in this paper, is given by:
σ2 2 ∂ 2 xxm + ∂x(m∂xu) = 0 , x ∈ R , (20a) −|∂xu| 2 2 + log m + βx 2+σ 2 2 ∂ 2 xxu + λ = 0 , x ∈ R , (20b) Z R m dx = 1 . (20c)
where λ ∈ (−∞, ∞) is a constant to be found as part of the solution, and σ, β ≥ 0 are given parameters. Proposition 4.1. A solution to the stationary MFG system (20) exists and has an explicit form
m(x) =a π 1/2 e−ax2 (21a) u(x) = bx2 (21b) λ = logπ a − σ2b , (21c)
where the constants a, b, c ≥ 0 are given by
a = β, b = 0 , if σ = 0, or a = −1 + 1 + 2σ 4β1/2 σ4 , b = −1 + 1 + 2σ4β1/2 2σ2 , if σ > 0.
The proof is straight-forward using substitution.
4.2. The BRS. Next we consider the respective stationary BRS model. It is given by ∂x m∂x(log m + βx2) + σ2 2 ∂ 2 xxm = 0 , x ∈ R (22a) Z R m dx = 1 . (22b)
Proposition 4.2. The solution to the stationary BRS equation (22) is given by m(x) = 2β (2 + σ2)π 1/2 e− 2β (2+σ2 )x 2 .
Again the claim follows from substitution. 4.3. Comparison.
Proposition 4.3. For σ > 0, the stationary distributions of the MFG system (20) and the BRS (22) are given by normal distributions with mean 0 and variances a1 and a2 respectively, where
a1= σ4 −2 + 2(1 + 2σ4β)1/2 a2= 2 + σ2 4β . Then for fixed β ≥ 0
lim σ2→0 a2 a1 = 1 (23a) lim σ2→∞ a2 a1 = 1 (2β)1/2. (23b)
While for fixed σ > 0
(a) β = 0.1, σ 2 2 = 10 (b) β = 0.1, σ 2 2 = 0.2 (c) β = 1, σ22 = 10 (d) β = 1, σ2 2 = 0.2 (e) β = 10, σ22 = 10 (f) β = 10, σ2 2 = 0.2
Figure 1. Simulations of BRS and MFG with quadratic potential and logarithmic congestion
Proof. The first part of this proof is trivial from the previous propositions. Now a2
a1
=(2 + σ
2) (1 + 2σ4β)1/2− 1
2σ4β .
Using a Taylor expansion of (1 + x)1/2 around x = 0 gives behaviour for small σ2 i.e.
a2
a1
= (2 + σ
2)(σ4β + o(σ4))
Hence limσ2→0a2a
1 = limσ2→0 2+σ2
2 = 1. The other limit can be simply calculated
lim σ2→∞ a2 a1 = lim σ2→∞ (2 + σ2)(1 + 2σ4β)1/2 2σ4β = limσ2→∞ (2 + σ2)(2β)1/2σ2 2σ4β = 1 (2β)1/2.
The limits as β → 0, ∞ for fixed σ, follows from straight forward calculations. This result is an important first glimpse at how the behaviour of the BRS and MFG may vary, as well as the importance certain parameters play in the difference. The limit in (23a) shows that the existence of noise is vital to see any difference between the two models. However, as soon as there is noise, its effect on the relative difference plays a less important role than the strength of the quadratic potential, this can be seen in (23b) and (24b). Specifically the limit (24b) shows that the relative difference between the variances of the two distributions grows like β12, which means the BRS distribution reacts much
more rapidly with changes to the potential strength than the MFG. This suggests that the MFG is more affected by congestion or is a more congestion-averse model than the BRS one.
At a conceptual level this agrees with the formulation of the MFG and BRS systems. The agents in the BRS are acting myopically, only reacting to the situation as it currently exists, which isn’t the case in the MFG. As a result the BRS agents don’t ‘see’ the future congestion that will result from their behaviour and hence they move towards the minimum of βx2 more rapidly than the MFG agents who do see the future cost of the congestion that results from their behaviour. Therefore, thinking of the stationary solutions as the long time, time-averaged behaviour of the models then the stationary BRS will result in a distribution that appears to take into account the congestion less than the MFG and hence one with a smaller variance. This expectation is confirmed by the result (24b) and it in fact quantifies the extent to which the BRS ignores the congestion compared with the MFG.
For this model we have run a variety of simulations — both to confirm our numerical methods (see Section 5 for methods) and to visualise how the parameters affect the distributions. Figure 1 shows the results of these simulations on a bounded domain for a variety of parameter choices. Although the formulation on a bounded domain is slightly different than the one in this section, the same behaviour can be seen. For small σ the difference between the models doesn’t change much as β increases, while for large values of σ the BRS model is much more dramatically affected by changes to β. In both cases the BRS and MFG are more closely aligned when β is small.
5. Simulations
We conclude with presenting various computational experiments, which illustrate the difference between solutions to the BRS and MFG for different choices running costs and potentials.
5.1. Solving the stationary BRS and MFG. Solutions to the stationary BRS (7) can be computed by finding the zeros of the function GZ,x(m) at every discrete grid point x on a grid given Z. To compute
the roots of GZ,x at every grid point x we use a Newton-Raphson method. Then, having found mZ(x)
for a particular value of Z, we can differentiate the implicit formula mZ = 1ze − 2
σ2h(x,mZ) with respect
to Z and use a Newton-Raphson method to find Z such that Φ(Z) = R
ΩmZ dx = 1. In practice this
means iterating between the two Newton-Raphson methods: first finding mZn, then computing Zn+1,
then recomputing mZn+1 and repeating until convergence.
The solution to the stationary MFG (12) are found using an iterative procedure. Given an admissible initial iterate ml, l = 0 we solve the HJB equation (12d) to obtain ul and λl. Note that we include the constraint (12h) via a Lagrange multiplier. In the final step of the iteration we update the distribution of agents by solving the FPE (12c) using ul+1to obtain ml+1. This procedure is repeated until convergence. Note that we sometimes perform a damped update
ul= ωul−1+ (1 − ω)vl and ml= ml−1+ (1 − ω)ql
where vl, ql are the undamped solutions of each iteration process. This damping helps to ensure con-vergence. Solutions to the HJB and the FPE are obtained by using an H1 conforming finite element discretisation.
MFG cost will always be centred around 0 while the BRS cost could be above or below it. As a result, it was not clear that comparing the “costs” of the two models is especially useful and hence we have solely focussed on comparing the distribution of agents.
5.2. Single well potential. In the first examples we investigate the behavior of solutions to both models for cost functionals of the form h(x, m) = F (m) + βx2, using different functions F and parameters β. We
also analyse how the noise level σ affects the two stationary states. Note that the case F (m) = log(m) was already discussed in Section 4. We are particularly interested how penalising congestion by considering functions F (m) of the form F (m) = mαfor some α > 0 or F (m) = m 1
max−m for some mmax > 1 Ω affect
solutions. The last choice introduces a ‘barrier’ above which the density can not exceed. Using such a congestion term is more realistic from a modelling perspective than either the logarithmic or power–law term as it forces densities to stay below a certain physical reasonable limit. We will observe a similar dependence on the parameters β, σ compared with the logarithmic congestion term — and in fact the same can be said for all of our simulations. So for all values of σ the MFG model responded less to changes in the strength of the potential β compared with the BRS model, however the difference is most pronounced as σ increases and again it may be expected that as σ → 0 that the two models align very closely.
The most notable difference between the use of a logarithmic congestion term and a power–law congestion term is a difference in the shape of the distribution, particularly the flatness of the peak of the distribution as shown in figure 2. Importantly this characteristic is shared by both the MFG and the BRS, suggesting the congestion terms F (m) affect both models in similar ways. When looking at congestion terms of the form F (m) = m 1
max−m, we can find regimes where the behaviour is similar to the
logarithm, or more like a vastly exaggerated version of the power–law congestion. When mmax is large,
as in figure 3a, the resulting distribution for both models looks like a normal distribution, similar to the case with logarithmic congestion. In fact when mmaxis very large, a formal asymptotic analysis using a
Taylor expansion around mmax can be made which shows
1 mmax− m
≈ 1
mmax
. Therefore the BRS satisfies the equation
m ≈ 1 Ze − 2 σ2 βmmaxx2 −1 mmax ≈ 1 Ze −2β σ2x 2 , with a normalisation constant Z = R
Ωe −2β
σ2x 2
, which corresponds, on the whole space R, to a normal distribution with zero mean and variance σ4β2. The variance of the BRS solution found here differs from the logarithmic congestion case by 4β2. A similar analysis shows that when mmax → ∞ the MFG
(a) mmax= 10 (b) mmax= 1
Figure 3. Simulations of BRS and MFG with h(x, m) = mmax−m1 + x 2
approximately resembles a normal distribution with zero mean and variance σ2β2. In summary, solutions to the MFG and the BRS are both normal distributions with zero mean and with variances whose relative difference is 12.
However, when mmax is reduced, as in figure 3b, the peak flattens out in a similar but exaggerated
way compared to the power–law congestion. It is interesting to note that the BRS seems to respond more to the change in mmaxthan the MFG, this is in contrast to the role that σ plays in the two models where
the MFG responds more to changes in σ compared with the BRS.
5.3. Double well potential. The previous subsection has given insight into how the form of the con-gestion term affects both models. In this section we explore how the potential term affects each model. For this section we will consider costs of the form h(x, m) = F1(x) + log(m), where F1(x) will be a double
well potential (see figure 4). Since the key insight of this section is to understand how varying F1affects
the similarity of solutions to the BRS and MFG models, we have decided not to include results with different congestion terms other than the logarithm. From simulations it can be seen that the effect of changing the congestion term from log(m) to another term is very similar whether we are considering a single well or a double well.
Our simulations focus on five different double wells, which can be seen in figure 4. We vary the potentials as follows
(1) same depth, same width, (2) different depth, same width, (3) same depth, different width, (4) approximately similar perimeter, (5) approximately similar volume.
The simulations with the first two potentials, see figures 5 and 6, where the widths of the two wells are always the same, show that the two models display similar qualitative behaviour. As with the single well potential, as σ increases the discrepancy between the two models also increases, with the MFG model being more affected by the level of noise than the BRS. As expected, when the two wells are of equal depth (as in figure 5) then both the MFG and BRS attribute equal weight between the wells, while when one well is deeper than the other (as in figure 6) both models give more mass to the location of the deeper well. This is true for all values of σ, although the effect reduces as σ increases, as can be seen by comparing figures 5a and 6a to figures 5b and 6b respectively.
Up to this point we have seen that the qualitative behaviour of the two models tends to agree. However when we look at double well potentials where the width of the well is varying, we start to observe differences. In figure 7, where the wells have the same depth but different width, we see that the BRS still distributes density equally to the two wells. However the MFG model results in a higher density focussed in the wider well than in the narrower well. The reason the width has no effect on the BRS can be seen from studying the implicit equation. In each well we are solving m = Z1e−σ22 h(x,m). In
(a) Double well potential for Figure 5 (b) Double well potential for Figure 6
(c) Double well potential for Figure 7 (d) Double well potential for Figure 8
(e) Double well potential for Figure 9
Figure 4. Double well potentials for simulations
relative height of the distribution m will be the same in each well. The reason the MFG is affected by the width of the of the well is that in finding the MFG solution we are in fact solving an elliptic equation to find the function u, hence at each point x this u will be affected by factors that can’t be described by just looking at the value of h at that point. In other words, the BRS depends only on local properties of the cost h whereas the MFG depends also on non-local properties. To understand why the MFG assigns greater density to the wider well we need to look at the underlying optimisation problems related to the MFG and the BRS.
(a) σ22 = 0.2 (b) σ2
2 = 1
Figure 5. Simulation of BRS and MFG with logarithmic congestion and potential given in figure 4a (a) σ 2 2 = 0.2 (b) σ2 2 = 1
Figure 6. Simulation of BRS and MFG with logarithmic congestion and potential given in figure 4b
We have seen that increasing well width affects only the MFG while increasing well depth affects both the MFG and BRS. Now we can balance these effects to create situations in which the two models give completely different results. Figure 8 involves a double well where the width and depth of the wells differ but the area of the well is the same, while in figure 9 the perimeter of the wells was kept the same. In the case of a small noise term, then both the MFG and BRS favour the deeper wells. However with a larger noise term, figure 8b shows that there are cases where the wider shallower well is favoured by the MFG while the narrower, deeper well is favoured by the BRS.
6. Conclusion and outlook
(a) σ22 = 0.2 (b) σ2
2 = 1
Figure 7. Simulation of BRS and MFG with logarithmic congestion and potential given in figure 4c
(a) σ22 = 0.2 (b)
σ2 2 = 1
Figure 8. Simulation of BRS and MFG with logarithmic congestion and potential given in figure 4d
We supported our analytic results by numerical simulations and investigated the similarities and differ-ences of the MFG and BRS models systematically in various computational experiments. We are planning to extend the analysis and simulations to the dynamic case in the future, and consider cost functions other than linear-quadratic ones.
References
[1] M. Barker. From mean field games to the best reply strategy in a stochastic framework. Journal of Dynamics & Games, 6(4):291–314, 2019.
[2] J. D. Benamou, G. Carlier, and F. Santambrogio. Variational mean field games. Modeling and Simulation in Science, Engineering and Technology, 1:141–171, 2017.
[3] C. Bertucci, J.-M. Lasry, and P.-L. Lions. Some remarks on mean field games. Communications in Partial Differential Equations, 44:205–227, 2019.
[4] P. Biane and R. Speicher. Free diffusions, free entropy and free Fisher information. Annales de l’institut Henri Poincare (B) Probability and Statistics, 37:581–606, 2001.
[5] L. Boccardo, L. Orsina, and A. Porretta. Strongly coupled elliptic equations related to mean-field games systems. Journal of Differential Equations, 261:1796–1834, 2016.
[6] P. E. Caines, M. Huang, and R. P. Malham´e. Mean Field Games. In T. Basar and G. Zaccour, editors, Handbook of Dynamic Game Theory, pages 1–28. Springer Berlin Heidelberg, 2017.
[7] P. Cardaliaguet and P. Jameson Graber. Mean field games systems of first order. ESAIM - Control, Optimisation and Calculus of Variations, 21:690–722, 2015.
(a) σ22 = 0.2 (b) σ2
2 = 1
Figure 9. Simulation of BRS and MFG with logarithmic congestion and potential given in figure 4e
[9] R. Carmona and F. Delarue. Probabilistic Theory of Mean Field Games with Applications I, volume 83 of Probability Theory and Stochastic Modelling. Springer International Publishing, Cham, 2018.
[10] R. Carmona and F. Delarue. Probabilistic Theory of Mean Field Games with Applications II, volume 84 of Probability Theory and Stochastic Modelling. Springer International Publishing, Cham, 2018.
[11] J. A. Carrillo, R. S. Gvalani, G. A. Pavliotis, and A. Schlichting. Long-Time Behaviour and Phase Transitions for the Mckean–Vlasov Equation on the Torus. Archive for Rational Mechanics and Analysis, 2019.
[12] J. A. Carrillo, R. J. McCann, and C. Villani. Kinetic equilibration rates for granular media and related equations: Entropy dissipation and mass transportation estimates. Revista Matematica Iberoamericana, 19:971–1018, 2003. [13] J. A. Carrillo, R. J. McCann, and C. Villani. Contractions in the 2-Wasserstein length space and thermalization of
granular media. Archive for Rational Mechanics and Analysis, 179:217–263, 2006.
[14] A. Cesaroni and M. Cirant. Concentration of ground states in stationary mean-field games systems. Analysis and PDE, 12:737–787, 2018.
[15] L. Chayes and V. Panferov. The McKean-Vlasov equation in finite volume. Journal of Statistical Physics, 138:351–380, 2010.
[16] M. Cirant. Multi-population Mean Field Games systems with Neumann boundary conditions. Journal de Math´ematiques Pures et Appliqu´ees, 103:1294–1315, 2015.
[17] M. Cirant. Stationary focusing mean-field games. Communications in Partial Differential Equations, 41:1324–1346, 2016.
[18] P. Degond, M. Herty, and J. G. Liu. Meanfield games and model predictive control. Communications in Mathematical Sciences, 15:1403–1422, 2017.
[19] P. Degond, J. G. Liu, and C. Ringhofer. Large-scale dynamics of mean-field games driven by local nash equilibria. Journal of Nonlinear Science, 24:93–115, 2014.
[20] D. Evangelista, R. Ferreira, D. A. Gomes, L. Nurbekyan, and V. Voskanyan. First-order, stationary mean-field games with congestion. Nonlinear Analysis, Theory, Methods and Applications, 173:37–74, 2018.
[21] D. Evangelista and D. A. Gomes. On the Existence of Solutions for Stationary Mean-Field Games with Congestion. Journal of Dynamics and Differential Equations, 30:1365–1388, 2018.
[22] L. C. Evans. Partial Differential Equations. American Mathematical Society, USA, 1998.
[23] R. Ferreira and D. Gomes. Existence of Weak Solutions to Stationary Mean-Field Games through Variational Inequal-ities. SIAM Journal on Mathematical Analysis, 50:5969–6006, 2018.
[24] R. Ferreira, D. Gomes, and T. Tada. Existence of weak solutions to first-order stationary mean-field games with Dirichlet conditions. Proceedings of the American Mathematical Society, 147:4713–4731, 2019.
[25] D. Gomes and H. S´anchez Morgado. A stochastic Evans-Aronsson problem. Transactions of the American Mathematical Society, 366:903–929, 2013.
[26] D. A. Gomes, S. Patrizi, and V. Voskanyan. On the existence of classical solutions for stationary extended mean field games. Nonlinear Analysis, Theory, Methods and Applications, 99:49–79, 2014.
[27] D. A. Gomes, E. A. Pimentel, and V. Voskanyan. Regularity Theory for Mean-Field Game Systems. SpringerBriefs in Mathematics. Springer International Publishing, Cham, 2016.
[28] D. A. Gomes and V. K. Voskanyan. Extended deterministic mean-field games. SIAM Journal on Control and Opti-mization, 54:1030–1055, 2016.
[29] O. Gu´eant. A reference case for mean field games models. Journal de Math´ematiques Pures et Appliqu´ees, 92:276–294, 2009.
[30] O. Gu´eant. Mean field games equations with quadratic Hamiltonian: A specific approach, volume 22. 2012.
[31] M. Huang, P. E. Caines, and R. P. Malham´e. An invariance principle in large population stochastic dynamic games. Journal of Systems Science and Complexity, 20:162–172, 2007.
[33] M. Huang, R. Malham´e, and P. Caines. Large population stochastic dynamic games: closed-loop McKean-Vlasov systems and the Nash certainty equivalence principle. Communications in Information & Systems, 6:221–251, 2006. [34] J.-M. Lasry and P.-L. Lions. Jeux `a champ moyen. I – Le cas stationnaire. Comptes Rendus Mathematique, 343:619–625,
2006.
[35] J.-M. Lasry and P.-L. Lions. Jeux `a champ moyen. II – Horizon fini et contrˆole optimal. Comptes Rendus Mathematique, 343:679–684, 2006.
[36] J. M. Lasry and P. L. Lions. Mean field games. Japanese Journal of Mathematics, 2:229–260, 2007. [37] R. J. McCann. A convexity principle for interacting gases. Advances in Mathematics, 128:153–179, 1997.
[38] K. Schmitt. Boundary value problems for quasilinear second order elliptic equations. Nonlinear Analysis, Theory, Methods & Applications, 2:263–309, 1978.
[39] Y. Tamura. On asymptotic behaviours of the solution of a non-linear diffusion process. J. Fac. Sci. Univ. Tokyo Sect. 1A, Math, 31:195–221, 1984.