McKean–Vlasov Optimal Control: Limit Theory and Equivalence Between Different Formulations

Published Online:https://doi.org/10.1287/moor.2021.1232

References

  • [1] Acciaio B, Backhoff-Veraguas J, Carmona R (2019) Extended mean field control problems: Stochastic maximum principle and transport perspective. SIAM J. Control Optim. 57(6):3666–3693.CrossrefGoogle Scholar
  • [2] Andersson D, Djehiche B (2011) A maximum principle for SDEs of mean–field type. Appl. Math. Optim. 63(3):341–356.CrossrefGoogle Scholar
  • [3] Bahlali K, Mezerdi M, Mezerdi B (2014) Existence of optimal controls for systems governed by mean–field stochastic differential equations. Afrika Statistika 9(1):627–645.CrossrefGoogle Scholar
  • [4] Bahlali K, Mezerdi M, Mezerdi B (2017) Existence and optimality conditions for relaxed mean–field stochastic control problems. Systems Control Lett. 102:1–8.CrossrefGoogle Scholar
  • [5] Bahlali K, Mezerdi M, Mezerdi B (2018) On the relaxed mean–field stochastic control problem. Stochastics Dynamics 18(3):1850024.CrossrefGoogle Scholar
  • [6] Bahlali K, Mezerdi M, Mezerdi B (2020) Stability of McKean–Vlasov stochastic differential equations and applications. Stochastic Dynamics 20(01):2050007.CrossrefGoogle Scholar
  • [7] Balata A, Huré C, Laurière M, Pham H, Pimentel I (2019) A class of finite–dimensional numerically solvable McKean–Vlasov control problems. ESAIM Proc. Surveys 65:114–144.CrossrefGoogle Scholar
  • [8] Bandini E, Cosso A, Fuhrman M, Pham H (2018) Randomization method and backward SDEs for optimal control of partially observed path–dependent stochastic systems. Ann. Appl. Probab. 28(3):1634–1678.CrossrefGoogle Scholar
  • [9] Basei M, Pham H (2019) A weak martingale approach to linear–quadratic McKean–Vlasov stochastic control problems. J. Optim. Theory Appl. 181(2):347–382.CrossrefGoogle Scholar
  • [10] Bayraktar E, Cosso A, Pham H (2018) Randomized dynamic programming principle and Feynman–Kac representation for optimal control of McKean–Vlasov dynamics. Trans. Amer. Math. Soc. 370(3):2115–2160.CrossrefGoogle Scholar
  • [11] Bensoussan A, Frehse J, Yam S (2015) The master equation in mean field theory. J. Mathématiques Pures Appliquées 103(6):1441–1474.CrossrefGoogle Scholar
  • [12] Björk T, Murgoci A (2014) A theory of Markovian time–inconsistent stochastic control in discrete time. Finance Stochastics 18(3):545–592.CrossrefGoogle Scholar
  • [13] Björk T, Khapko M, Murgoci A (2017) On time–inconsistent stochastic control in continuous time. Finance Stochastics 21(2):331–360.CrossrefGoogle Scholar
  • [14] Buckdahn R, Djehiche B, Li J (2011) A general stochastic maximum principle for SDEs of mean–field type. Appl. Math. Optim. 64(2):197–216.CrossrefGoogle Scholar
  • [15] Budhiraja A, Dupuis P, Fischer M (2012) Large deviation properties of weakly interacting processes via weak convergence methods. Ann. Probab. 40(1):74–102.CrossrefGoogle Scholar
  • [16] Carmona R, Delarue F (2015) Forward–backward stochastic differential equations and controlled McKean–Vlasov dynamics. Ann. Probab. 43(5):2647–2700.CrossrefGoogle Scholar
  • [17] Carmona R, Delarue F, Lachapelle A (2013) Control of McKean–Vlasov dynamics vs. mean field games. Math. Financial Econom. 7(2):131–166.CrossrefGoogle Scholar
  • [18] Carmona R, Delarue F, Lacker D (2016) Mean field games with common noise. Ann. Probab. 44(6):3740–3803.CrossrefGoogle Scholar
  • [19] Chala A (2014) The relaxed optimal control problem for mean–field SDEs systems and application. Automatica J. IFAC 50(3):924–930.CrossrefGoogle Scholar
  • [20] Dellacherie C, Meyer P-A (1978) Probabilities and Potential. Mathematics Studies, vol. 29 (North–Holland).Google Scholar
  • [21] Djete M (2020) Some results on the McKean–Vlasov optimal control and mean field games: Limit theorems, dynamic programming principle and numerical approximations. Unpublished PhD thesis, Université Paris Dauphine PSL, France.Google Scholar
  • [22] Djete M, Possamaï D, Tan X (2019) McKean–Vlasov optimal control: The dynamic programming principle. Preprint, submitted July 20, https://arxiv.org/abs/1907.08860.Google Scholar
  • [23] El Karoui N, Méléard S (1990) Martingale measures and stochastic calculus. Probab. Theory Related Fields 84(1):83–101.CrossrefGoogle Scholar
  • [24] El Karoui N, Tan X (2013) Capacities, measurable selection and dynamic programming part II: Application in stochastic control problems. Preprint, submitted October 12, https://arxiv.org/abs/1310.3364.Google Scholar
  • [25] El Karoui N, Huu Nguyen D, Jeanblanc-Picqué M (1987) Compactification methods in the control of degenerate diffusions: Existence of an optimal control. Stochastics 20(3):169–219.CrossrefGoogle Scholar
  • [26] Élie R, Mastrolia T, Possamaï D (2019) A tale of a principal and many many agents. Math. Oper. Res. 44(2):440–467.LinkGoogle Scholar
  • [27] Élie R, Hubert E, Mastrolia T, Possamaï D (2021) Mean–field moral hazard for optimal energy demand response management. Math. Finance 31(1):399–473.CrossrefGoogle Scholar
  • [28] Filippov A (1962) On certain questions in the theory of optimal control. J. Soc. Indust. Appl. Math. Ser. A Control 1(1):76–84.CrossrefGoogle Scholar
  • [29] Fischer M, Livieri G (2016) Continuous time mean–variance portfolio optimization through the mean field approach. ESAIM Probab. Statist. 20:30–44.CrossrefGoogle Scholar
  • [30] Gärtner J (1988) On the McKean–Vlasov limit for interacting diffusions. Mathematische Nachrichten 137(1):197–248.CrossrefGoogle Scholar
  • [31] Graber P (2016) Linear quadratic mean field type control and mean field games with common noise, with application to production of an exhaustible resource. Appl. Math. Optim. 74(3):459–486.CrossrefGoogle Scholar
  • [32] Graham C, Méléard S (1997) Stochastic particle approximations for generalized Boltzmann models and convergence estimates. Ann. Probab. 25(1):115–132.CrossrefGoogle Scholar
  • [33] Haussmann U, Lepeltier J-P (1990) On the existence of optimal controls. SIAM J. Control Optim. 28(4):851–902.CrossrefGoogle Scholar
  • [34] Hernández C, Possamaï D (2020) Me, myself and I: A general theory of non–Markovian time–inconsistent stochastic control for sophisticated agents. Preprint, submitted February 28, https://arxiv.org/abs/2002.12572.Google Scholar
  • [35] Huang M, Caines P, Malhamé R (2003) Individual and mass behaviour in large population stochastic wireless power control problems: Centralized and Nash equilibrium solutions. Abdallah C, Lewis F, eds. Proc. 42nd IEEE Conf. Decision Control (IEEE), 98–103.Google Scholar
  • [36] Huang M, Caines P, Malhamé R (2007) An invariance principle in large population stochastic dynamic games. J. Systems Sci. Complexity 20(2):162–172.CrossrefGoogle Scholar
  • [37] Huang M, Caines P, Malhamé R (2007) Large–population cost–coupled LQG problems with nonuniform agents: Individual–mass behavior and decentralized ε–Nash equilibria. IEEE Trans. Automatic Control 52(9):1560–1571.CrossrefGoogle Scholar
  • [38] Huang M, Caines P, Malhamé R (2007) The Nash certainty equivalence principle and McKean–Vlasov systems: An invariance principle and entry adaptation. Castanon D, Spall J, eds. 46th IEEE Conf. Decision Control (IEEE), 121–126.Google Scholar
  • [39] Huang M, Malhamé R, Caines P (2006) Large population stochastic dynamic games: closed–loop McKean–Vlasov systems and the Nash certainty equivalence principle. Comm. Inform. Systems 6(3):221–252.CrossrefGoogle Scholar
  • [40] Jacod J (1985) Grossissement initial, hypothèse (H′) et théorème de Girsanov. Jeulin T, Yor M, eds. Grossissements de Filtrations: Exemples et Applications. Lecture Notes in Mathematics, vol. 1118 (Springer–Verlag, Berlin, Heidelberg), 15–35.CrossrefGoogle Scholar
  • [41] Jourdain B, Méléard S (1998) Propagation of chaos and fluctuations for a moderate model with smooth initial data. Annales de l’institut Henri Poincaré, Probabilités et Statistiques (B) 34(6):727–766.CrossrefGoogle Scholar
  • [42] Jourdain B, Reygner J (2013) Propagation of chaos for rank–based interacting diffusions and long time behaviour of a scalar quasilinear parabolic equation. Stochastic Partial Differential Equations Anal. Comput. 1(3):455–506.CrossrefGoogle Scholar
  • [43] Kac J (1956) Foundations of kinetic theory. Neyman J, ed. Proc. Third Berkeley Sympos. Math. Statistics Probab., Vol. 3: Contributions Astronomy Phys. (University of California Press), 171–197.CrossrefGoogle Scholar
  • [44] Kurtz TG (2014) Weak and strong solutions of general stochastic models. Electronic Comm. Probab. 19(58):1–16.Google Scholar
  • [45] Lacker D (2015) Mean field games via controlled martingale problems: Existence of Markovian equilibria. Stochastic Processes Their Appl. 125(7):2856–2894.CrossrefGoogle Scholar
  • [46] Lacker D (2016) A general characterization of the mean field limit for stochastic differential games. Probab. Theory Related Fields 165(3–4):581–648.CrossrefGoogle Scholar
  • [47] Lacker D (2017) Limit theory for controlled McKean–Vlasov dynamics. SIAM J. Control Optim. 55(3):1641–1672.CrossrefGoogle Scholar
  • [48] Lacker D, Shkolnikov M, Zhang J (2020) Superposition and mimicking theorems for conditional McKean–Vlasov equations. Preprint, submitted March 20, https://arxiv.org/abs/2004.00099.Google Scholar
  • [49] Lasry J-M, Lions P-L (2006) Jeux à champ moyen. I–Le cas stationnaire. Comptes Rendus Mathématique 343(9):619–625.CrossrefGoogle Scholar
  • [50] Lasry J-M, Lions P-L (2006) Jeux à champ moyen. II–Horizon fini et contrôle optimal. Comptes Rendus Mathématique 343(10):679–684.CrossrefGoogle Scholar
  • [51] Lasry J-M, Lions P-L (2007) Mean field games. Japanese J. Math. 2(1):229–260.CrossrefGoogle Scholar
  • [52] Lauriére M, Pironneau O (2014) Dynamic programming for mean–field type control. Comptes Rendus Mathématique 352(9):707–713.CrossrefGoogle Scholar
  • [53] Lions P-L Théorie des jeux de champ moyen et applications. Cours du Collège de France. http://www.college-de-france.fr/default/EN/all/equder/audiovideo.jsp, 2006–2012.Google Scholar
  • [54] Liptser R, Shiryaev A (1977) Statistics of Random Processes (Springer–Verlag).CrossrefGoogle Scholar
  • [55] McKean H Jr (1969) Propagation of Chaos for a Class of Non–linear Parabolic Equations. Lecture Series on Differential Equations, Session 7: Stochastic Differential Equations (Fort Belvoir Defense Technical Information Center), 41–57.Google Scholar
  • [56] Méléard S (1992) Martingale measure approximation, application to the control of diffusions. Prépublication du laboratoire de probabilités, université Paris VI.Google Scholar
  • [57] Méléard S (1992) Representation and approximation of martingale measures. Rozovskii B, Sowers R, eds. Stochastic Partial Differential Equations Their Appl. Proc. IFIP WG 7/1 Internat. Conf. (Springer–Verlag, Berlin, Heidelberg, New York).CrossrefGoogle Scholar
  • [58] Méléard S (1996) Asymptotic behaviour of some interacting particle systems: McKean–Vlasov and Boltzmann models. Talay D, Tubaro L, eds. Probabilistic Models for Nonlinear Partial Differential Equations. Lecture Notes in Mathematics, vol. 1627 (Springer–Verlag, Berlin, Heidelberg), 42–95.CrossrefGoogle Scholar
  • [59] Méléard S, Roelly-Coppoletta S (1987) A propagation of chaos result for a system of particles with moderate interaction. Stochastic Processes Their Appl. 26:317–332.CrossrefGoogle Scholar
  • [60] Neufeld A, Nutz M (2014) Measurability of semimartingale characteristics with respect to the probability law. Stochastic Processes Their Appl. 124(11):3819–3845.CrossrefGoogle Scholar
  • [61] Oelschläger K (1984) A martingale approach to the law of large numbers for weakly interacting stochastic processes. Ann. Probab. 12(2):458–479.CrossrefGoogle Scholar
  • [62] Oelschläger K (1985) A law of large numbers for moderately interacting diffusion processes. Zeitschrift Wahrscheinlichkeitstheorie Verwandte Gebiete 69(2):279–322.CrossrefGoogle Scholar
  • [63] Pham H (2016) Linear quadratic optimal control of conditional McKean–Vlasov equation with random coefficients and applications. Probab. Uncertainty Quant. Risk 1(7):1–26.Google Scholar
  • [64] Pham H, Wei X (2017) Dynamic programming for optimal control of stochastic McKean–Vlasov dynamics. SIAM J. Control Optim. 55(2):1069–1101.CrossrefGoogle Scholar
  • [65] Pham H, Wei X (2018) Bellman equation and viscosity solutions for mean–field stochastic control problem. ESAIM Control Optim. Calculus Variations 24(1):437–461.CrossrefGoogle Scholar
  • [66] Possamaï D, Tan X, Zhou C (2018) Stochastic control for a class of nonlinear kernels and applications. Ann. Probab. 46(1):551–603.CrossrefGoogle Scholar
  • [67] Shkolnikov M (2012) Large systems of diffusions interacting through their ranks. Stochastic Processes Their Appl. 122(4):1730–1747.CrossrefGoogle Scholar
  • [68] Snitzman A-S (1991) Topics in propagation of chaos. Hennequin P, ed. École d’été de probabilités de Saint–Flour XIX – 1989. Lecture Notes in Mathematics, number 1464 (Springer, Berlin, Heidelberg), 165–251.Google Scholar
  • [69] Stroock D, Varadhan S (1997) Multidimensional Diffusion Processes. Grundlehren der Mathematischen Wissenschaften, vol. 233 (Springer–Verlag, Berlin, Heidelberg).Google Scholar
  • [70] Zalashko A (2017) Causal optimal transport: theory and applications. Unpublished PhD thesis, Universität Wien, Austria.Google Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.