McKean–Vlasov Optimal Control: Limit Theory and Equivalence Between Different Formulations

Mao Fabrice Djete
Mao Fabrice Djete
[email protected]
Centre de Mathématiques Appliquées, École Polytechnique, 91120 Palaiseau, France;
Search for more papers by this author
,
Dylan Possamaï
Dylan Possamaï
[email protected]
https://orcid.org/0000-0002-9364-0124
Department of Mathematics, Eidgenössische Technische Hochschule Zürich, 8092 Zürich, Switzerland;
Search for more papers by this author
,
Xiaolu Tan
Xiaolu Tan
[email protected]
Department of Mathematics, The Chinese University of Hong Kong, Hong Kong
Search for more papers by this author

Mao Fabrice Djete

[email protected]

Centre de Mathématiques Appliquées, École Polytechnique, 91120 Palaiseau, France;

Search for more papers by this author

Dylan Possamaï

[email protected]

https://orcid.org/0000-0002-9364-0124

Department of Mathematics, Eidgenössische Technische Hochschule Zürich, 8092 Zürich, Switzerland;

Search for more papers by this author

Xiaolu Tan

[email protected]

Department of Mathematics, The Chinese University of Hong Kong, Hong Kong

Search for more papers by this author

Published Online:14 Feb 2022https://doi.org/10.1287/moor.2021.1232

References

[1] Acciaio B, Backhoff-Veraguas J, Carmona R (2019) Extended mean field control problems: Stochastic maximum principle and transport perspective. SIAM J. Control Optim. 57(6):3666–3693.Crossref, Google Scholar
[2] Andersson D, Djehiche B (2011) A maximum principle for SDEs of mean–field type. Appl. Math. Optim. 63(3):341–356.Crossref, Google Scholar
[3] Bahlali K, Mezerdi M, Mezerdi B (2014) Existence of optimal controls for systems governed by mean–field stochastic differential equations. Afrika Statistika 9(1):627–645.Crossref, Google Scholar
[4] Bahlali K, Mezerdi M, Mezerdi B (2017) Existence and optimality conditions for relaxed mean–field stochastic control problems. Systems Control Lett. 102:1–8.Crossref, Google Scholar
[5] Bahlali K, Mezerdi M, Mezerdi B (2018) On the relaxed mean–field stochastic control problem. Stochastics Dynamics 18(3):1850024.Crossref, Google Scholar
[6] Bahlali K, Mezerdi M, Mezerdi B (2020) Stability of McKean–Vlasov stochastic differential equations and applications. Stochastic Dynamics 20(01):2050007.Crossref, Google Scholar
[7] Balata A, Huré C, Laurière M, Pham H, Pimentel I (2019) A class of finite–dimensional numerically solvable McKean–Vlasov control problems. ESAIM Proc. Surveys 65:114–144.Crossref, Google Scholar
[8] Bandini E, Cosso A, Fuhrman M, Pham H (2018) Randomization method and backward SDEs for optimal control of partially observed path–dependent stochastic systems. Ann. Appl. Probab. 28(3):1634–1678.Crossref, Google Scholar
[9] Basei M, Pham H (2019) A weak martingale approach to linear–quadratic McKean–Vlasov stochastic control problems. J. Optim. Theory Appl. 181(2):347–382.Crossref, Google Scholar
[10] Bayraktar E, Cosso A, Pham H (2018) Randomized dynamic programming principle and Feynman–Kac representation for optimal control of McKean–Vlasov dynamics. Trans. Amer. Math. Soc. 370(3):2115–2160.Crossref, Google Scholar
[11] Bensoussan A, Frehse J, Yam S (2015) The master equation in mean field theory. J. Mathématiques Pures Appliquées 103(6):1441–1474.Crossref, Google Scholar
[12] Björk T, Murgoci A (2014) A theory of Markovian time–inconsistent stochastic control in discrete time. Finance Stochastics 18(3):545–592.Crossref, Google Scholar
[13] Björk T, Khapko M, Murgoci A (2017) On time–inconsistent stochastic control in continuous time. Finance Stochastics 21(2):331–360.Crossref, Google Scholar
[14] Buckdahn R, Djehiche B, Li J (2011) A general stochastic maximum principle for SDEs of mean–field type. Appl. Math. Optim. 64(2):197–216.Crossref, Google Scholar
[15] Budhiraja A, Dupuis P, Fischer M (2012) Large deviation properties of weakly interacting processes via weak convergence methods. Ann. Probab. 40(1):74–102.Crossref, Google Scholar
[16] Carmona R, Delarue F (2015) Forward–backward stochastic differential equations and controlled McKean–Vlasov dynamics. Ann. Probab. 43(5):2647–2700.Crossref, Google Scholar
[17] Carmona R, Delarue F, Lachapelle A (2013) Control of McKean–Vlasov dynamics vs. mean field games. Math. Financial Econom. 7(2):131–166.Crossref, Google Scholar
[18] Carmona R, Delarue F, Lacker D (2016) Mean field games with common noise. Ann. Probab. 44(6):3740–3803.Crossref, Google Scholar
[19] Chala A (2014) The relaxed optimal control problem for mean–field SDEs systems and application. Automatica J. IFAC 50(3):924–930.Crossref, Google Scholar
[20] Dellacherie C, Meyer P-A (1978) Probabilities and Potential. Mathematics Studies, vol. 29 (North–Holland).Google Scholar
[21] Djete M (2020) Some results on the McKean–Vlasov optimal control and mean field games: Limit theorems, dynamic programming principle and numerical approximations. Unpublished PhD thesis, Université Paris Dauphine PSL, France.Google Scholar
[22] Djete M, Possamaï D, Tan X (2019) McKean–Vlasov optimal control: The dynamic programming principle. Preprint, submitted July 20, https://arxiv.org/abs/1907.08860.Google Scholar
[23] El Karoui N, Méléard S (1990) Martingale measures and stochastic calculus. Probab. Theory Related Fields 84(1):83–101.Crossref, Google Scholar
[24] El Karoui N, Tan X (2013) Capacities, measurable selection and dynamic programming part II: Application in stochastic control problems. Preprint, submitted October 12, https://arxiv.org/abs/1310.3364.Google Scholar
[25] El Karoui N, Huu Nguyen D, Jeanblanc-Picqué M (1987) Compactification methods in the control of degenerate diffusions: Existence of an optimal control. Stochastics 20(3):169–219.Crossref, Google Scholar
[26] Élie R, Mastrolia T, Possamaï D (2019) A tale of a principal and many many agents. Math. Oper. Res. 44(2):440–467.Link, Google Scholar
[27] Élie R, Hubert E, Mastrolia T, Possamaï D (2021) Mean–field moral hazard for optimal energy demand response management. Math. Finance 31(1):399–473.Crossref, Google Scholar
[28] Filippov A (1962) On certain questions in the theory of optimal control. J. Soc. Indust. Appl. Math. Ser. A Control 1(1):76–84.Crossref, Google Scholar
[29] Fischer M, Livieri G (2016) Continuous time mean–variance portfolio optimization through the mean field approach. ESAIM Probab. Statist. 20:30–44.Crossref, Google Scholar
[30] Gärtner J (1988) On the McKean–Vlasov limit for interacting diffusions. Mathematische Nachrichten 137(1):197–248.Crossref, Google Scholar
[31] Graber P (2016) Linear quadratic mean field type control and mean field games with common noise, with application to production of an exhaustible resource. Appl. Math. Optim. 74(3):459–486.Crossref, Google Scholar
[32] Graham C, Méléard S (1997) Stochastic particle approximations for generalized Boltzmann models and convergence estimates. Ann. Probab. 25(1):115–132.Crossref, Google Scholar
[33] Haussmann U, Lepeltier J-P (1990) On the existence of optimal controls. SIAM J. Control Optim. 28(4):851–902.Crossref, Google Scholar
[34] Hernández C, Possamaï D (2020) Me, myself and I: A general theory of non–Markovian time–inconsistent stochastic control for sophisticated agents. Preprint, submitted February 28, https://arxiv.org/abs/2002.12572.Google Scholar
[35] Huang M, Caines P, Malhamé R (2003) Individual and mass behaviour in large population stochastic wireless power control problems: Centralized and Nash equilibrium solutions. Abdallah C, Lewis F, eds. Proc. 42nd IEEE Conf. Decision Control (IEEE), 98–103.Google Scholar
[36] Huang M, Caines P, Malhamé R (2007) An invariance principle in large population stochastic dynamic games. J. Systems Sci. Complexity 20(2):162–172.Crossref, Google Scholar
[37] Huang M, Caines P, Malhamé R (2007) Large–population cost–coupled LQG problems with nonuniform agents: Individual–mass behavior and decentralized ε–Nash equilibria. IEEE Trans. Automatic Control 52(9):1560–1571.Crossref, Google Scholar
[38] Huang M, Caines P, Malhamé R (2007) The Nash certainty equivalence principle and McKean–Vlasov systems: An invariance principle and entry adaptation. Castanon D, Spall J, eds. 46th IEEE Conf. Decision Control (IEEE), 121–126.Google Scholar
[39] Huang M, Malhamé R, Caines P (2006) Large population stochastic dynamic games: closed–loop McKean–Vlasov systems and the Nash certainty equivalence principle. Comm. Inform. Systems 6(3):221–252.Crossref, Google Scholar
[40] Jacod J (1985) Grossissement initial, hypothèse (H′) et théorème de Girsanov. Jeulin T, Yor M, eds. Grossissements de Filtrations: Exemples et Applications. Lecture Notes in Mathematics, vol. 1118 (Springer–Verlag, Berlin, Heidelberg), 15–35.Crossref, Google Scholar
[41] Jourdain B, Méléard S (1998) Propagation of chaos and fluctuations for a moderate model with smooth initial data. Annales de l’institut Henri Poincaré, Probabilités et Statistiques (B) 34(6):727–766.Crossref, Google Scholar
[42] Jourdain B, Reygner J (2013) Propagation of chaos for rank–based interacting diffusions and long time behaviour of a scalar quasilinear parabolic equation. Stochastic Partial Differential Equations Anal. Comput. 1(3):455–506.Crossref, Google Scholar
[43] Kac J (1956) Foundations of kinetic theory. Neyman J, ed. Proc. Third Berkeley Sympos. Math. Statistics Probab., Vol. 3: Contributions Astronomy Phys. (University of California Press), 171–197.Crossref, Google Scholar
[44] Kurtz TG (2014) Weak and strong solutions of general stochastic models. Electronic Comm. Probab. 19(58):1–16.Google Scholar
[45] Lacker D (2015) Mean field games via controlled martingale problems: Existence of Markovian equilibria. Stochastic Processes Their Appl. 125(7):2856–2894.Crossref, Google Scholar
[46] Lacker D (2016) A general characterization of the mean field limit for stochastic differential games. Probab. Theory Related Fields 165(3–4):581–648.Crossref, Google Scholar
[47] Lacker D (2017) Limit theory for controlled McKean–Vlasov dynamics. SIAM J. Control Optim. 55(3):1641–1672.Crossref, Google Scholar
[48] Lacker D, Shkolnikov M, Zhang J (2020) Superposition and mimicking theorems for conditional McKean–Vlasov equations. Preprint, submitted March 20, https://arxiv.org/abs/2004.00099.Google Scholar
[49] Lasry J-M, Lions P-L (2006) Jeux à champ moyen. I–Le cas stationnaire. Comptes Rendus Mathématique 343(9):619–625.Crossref, Google Scholar
[50] Lasry J-M, Lions P-L (2006) Jeux à champ moyen. II–Horizon fini et contrôle optimal. Comptes Rendus Mathématique 343(10):679–684.Crossref, Google Scholar
[51] Lasry J-M, Lions P-L (2007) Mean field games. Japanese J. Math. 2(1):229–260.Crossref, Google Scholar
[52] Lauriére M, Pironneau O (2014) Dynamic programming for mean–field type control. Comptes Rendus Mathématique 352(9):707–713.Crossref, Google Scholar
[53] Lions P-L Théorie des jeux de champ moyen et applications. Cours du Collège de France. http://www.college-de-france.fr/default/EN/all/equder/audiovideo.jsp, 2006–2012.Google Scholar
[54] Liptser R, Shiryaev A (1977) Statistics of Random Processes (Springer–Verlag).Crossref, Google Scholar
[55] McKean H Jr (1969) Propagation of Chaos for a Class of Non–linear Parabolic Equations. Lecture Series on Differential Equations, Session 7: Stochastic Differential Equations (Fort Belvoir Defense Technical Information Center), 41–57.Google Scholar
[56] Méléard S (1992) Martingale measure approximation, application to the control of diffusions. Prépublication du laboratoire de probabilités, université Paris VI.Google Scholar
[57] Méléard S (1992) Representation and approximation of martingale measures. Rozovskii B, Sowers R, eds. Stochastic Partial Differential Equations Their Appl. Proc. IFIP WG 7/1 Internat. Conf. (Springer–Verlag, Berlin, Heidelberg, New York).Crossref, Google Scholar
[58] Méléard S (1996) Asymptotic behaviour of some interacting particle systems: McKean–Vlasov and Boltzmann models. Talay D, Tubaro L, eds. Probabilistic Models for Nonlinear Partial Differential Equations. Lecture Notes in Mathematics, vol. 1627 (Springer–Verlag, Berlin, Heidelberg), 42–95.Crossref, Google Scholar
[59] Méléard S, Roelly-Coppoletta S (1987) A propagation of chaos result for a system of particles with moderate interaction. Stochastic Processes Their Appl. 26:317–332.Crossref, Google Scholar
[60] Neufeld A, Nutz M (2014) Measurability of semimartingale characteristics with respect to the probability law. Stochastic Processes Their Appl. 124(11):3819–3845.Crossref, Google Scholar
[61] Oelschläger K (1984) A martingale approach to the law of large numbers for weakly interacting stochastic processes. Ann. Probab. 12(2):458–479.Crossref, Google Scholar
[62] Oelschläger K (1985) A law of large numbers for moderately interacting diffusion processes. Zeitschrift Wahrscheinlichkeitstheorie Verwandte Gebiete 69(2):279–322.Crossref, Google Scholar
[63] Pham H (2016) Linear quadratic optimal control of conditional McKean–Vlasov equation with random coefficients and applications. Probab. Uncertainty Quant. Risk 1(7):1–26.Google Scholar
[64] Pham H, Wei X (2017) Dynamic programming for optimal control of stochastic McKean–Vlasov dynamics. SIAM J. Control Optim. 55(2):1069–1101.Crossref, Google Scholar
[65] Pham H, Wei X (2018) Bellman equation and viscosity solutions for mean–field stochastic control problem. ESAIM Control Optim. Calculus Variations 24(1):437–461.Crossref, Google Scholar
[66] Possamaï D, Tan X, Zhou C (2018) Stochastic control for a class of nonlinear kernels and applications. Ann. Probab. 46(1):551–603.Crossref, Google Scholar
[67] Shkolnikov M (2012) Large systems of diffusions interacting through their ranks. Stochastic Processes Their Appl. 122(4):1730–1747.Crossref, Google Scholar
[68] Snitzman A-S (1991) Topics in propagation of chaos. Hennequin P, ed. École d’été de probabilités de Saint–Flour XIX – 1989. Lecture Notes in Mathematics, number 1464 (Springer, Berlin, Heidelberg), 165–251.Google Scholar
[69] Stroock D, Varadhan S (1997) Multidimensional Diffusion Processes. Grundlehren der Mathematischen Wissenschaften, vol. 233 (Springer–Verlag, Berlin, Heidelberg).Google Scholar
[70] Zalashko A (2017) Causal optimal transport: theory and applications. Unpublished PhD thesis, Universität Wien, Austria.Google Scholar

cover image Mathematics of Operations Research

Volume 47, Issue 4

November 2022

Pages 2547-3399, C2

Article Information

Metrics

Information

Received:July 01, 2021
Accepted:August 25, 2021
Published Online:February 14, 2022

Cite as

Mao Fabrice Djete, Dylan Possamaï, Xiaolu Tan (2022) McKean–Vlasov Optimal Control: Limit Theory and Equivalence Between Different Formulations. Mathematics of Operations Research 47(4):2891-2930.

https://doi.org/10.1287/moor.2021.1232

Keywords

Acknowledgments

The authors thank Daniel Lacker and three anonymous reviewers for their helpful comments and suggestions.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

McKean–Vlasov Optimal Control: Limit Theory and Equivalence Between Different Formulations

References

Volume 47, Issue 4

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News