McKean–Vlasov Optimal Control: Limit Theory and Equivalence Between Different Formulations
Published Online:14 Feb 2022https://doi.org/10.1287/moor.2021.1232
References
- [1] (2019) Extended mean field control problems: Stochastic maximum principle and transport perspective. SIAM J. Control Optim. 57(6):3666–3693.Crossref, Google Scholar
- [2] (2011) A maximum principle for SDEs of mean–field type. Appl. Math. Optim. 63(3):341–356.Crossref, Google Scholar
- [3] (2014) Existence of optimal controls for systems governed by mean–field stochastic differential equations. Afrika Statistika 9(1):627–645.Crossref, Google Scholar
- [4] (2017) Existence and optimality conditions for relaxed mean–field stochastic control problems. Systems Control Lett. 102:1–8.Crossref, Google Scholar
- [5] (2018) On the relaxed mean–field stochastic control problem. Stochastics Dynamics 18(3):1850024.Crossref, Google Scholar
- [6] (2020) Stability of McKean–Vlasov stochastic differential equations and applications. Stochastic Dynamics 20(01):2050007.Crossref, Google Scholar
- [7] (2019) A class of finite–dimensional numerically solvable McKean–Vlasov control problems. ESAIM Proc. Surveys 65:114–144.Crossref, Google Scholar
- [8] (2018) Randomization method and backward SDEs for optimal control of partially observed path–dependent stochastic systems. Ann. Appl. Probab. 28(3):1634–1678.Crossref, Google Scholar
- [9] (2019) A weak martingale approach to linear–quadratic McKean–Vlasov stochastic control problems. J. Optim. Theory Appl. 181(2):347–382.Crossref, Google Scholar
- [10] (2018) Randomized dynamic programming principle and Feynman–Kac representation for optimal control of McKean–Vlasov dynamics. Trans. Amer. Math. Soc. 370(3):2115–2160.Crossref, Google Scholar
- [11] (2015) The master equation in mean field theory. J. Mathématiques Pures Appliquées 103(6):1441–1474.Crossref, Google Scholar
- [12] (2014) A theory of Markovian time–inconsistent stochastic control in discrete time. Finance Stochastics 18(3):545–592.Crossref, Google Scholar
- [13] (2017) On time–inconsistent stochastic control in continuous time. Finance Stochastics 21(2):331–360.Crossref, Google Scholar
- [14] (2011) A general stochastic maximum principle for SDEs of mean–field type. Appl. Math. Optim. 64(2):197–216.Crossref, Google Scholar
- [15] (2012) Large deviation properties of weakly interacting processes via weak convergence methods. Ann. Probab. 40(1):74–102.Crossref, Google Scholar
- [16] (2015) Forward–backward stochastic differential equations and controlled McKean–Vlasov dynamics. Ann. Probab. 43(5):2647–2700.Crossref, Google Scholar
- [17] (2013) Control of McKean–Vlasov dynamics vs. mean field games. Math. Financial Econom. 7(2):131–166.Crossref, Google Scholar
- [18] (2016) Mean field games with common noise. Ann. Probab. 44(6):3740–3803.Crossref, Google Scholar
- [19] (2014) The relaxed optimal control problem for mean–field SDEs systems and application. Automatica J. IFAC 50(3):924–930.Crossref, Google Scholar
- [20] (1978) Probabilities and Potential. Mathematics Studies, vol. 29 (North–Holland).Google Scholar
- [21] (2020) Some results on the McKean–Vlasov optimal control and mean field games: Limit theorems, dynamic programming principle and numerical approximations. Unpublished PhD thesis, Université Paris Dauphine PSL, France.Google Scholar
- [22] (2019) McKean–Vlasov optimal control: The dynamic programming principle. Preprint, submitted July 20, https://arxiv.org/abs/1907.08860.Google Scholar
- [23] (1990) Martingale measures and stochastic calculus. Probab. Theory Related Fields 84(1):83–101.Crossref, Google Scholar
- [24] (2013) Capacities, measurable selection and dynamic programming part II: Application in stochastic control problems. Preprint, submitted October 12, https://arxiv.org/abs/1310.3364.Google Scholar
- [25] (1987) Compactification methods in the control of degenerate diffusions: Existence of an optimal control. Stochastics 20(3):169–219.Crossref, Google Scholar
- [26] (2019) A tale of a principal and many many agents. Math. Oper. Res. 44(2):440–467.Link, Google Scholar
- [27] (2021) Mean–field moral hazard for optimal energy demand response management. Math. Finance 31(1):399–473.Crossref, Google Scholar
- [28] (1962) On certain questions in the theory of optimal control. J. Soc. Indust. Appl. Math. Ser. A Control 1(1):76–84.Crossref, Google Scholar
- [29] (2016) Continuous time mean–variance portfolio optimization through the mean field approach. ESAIM Probab. Statist. 20:30–44.Crossref, Google Scholar
- [30] (1988) On the McKean–Vlasov limit for interacting diffusions. Mathematische Nachrichten 137(1):197–248.Crossref, Google Scholar
- [31] (2016) Linear quadratic mean field type control and mean field games with common noise, with application to production of an exhaustible resource. Appl. Math. Optim. 74(3):459–486.Crossref, Google Scholar
- [32] (1997) Stochastic particle approximations for generalized Boltzmann models and convergence estimates. Ann. Probab. 25(1):115–132.Crossref, Google Scholar
- [33] (1990) On the existence of optimal controls. SIAM J. Control Optim. 28(4):851–902.Crossref, Google Scholar
- [34] (2020) Me, myself and I: A general theory of non–Markovian time–inconsistent stochastic control for sophisticated agents. Preprint, submitted February 28, https://arxiv.org/abs/2002.12572.Google Scholar
- [35] (2003) Individual and mass behaviour in large population stochastic wireless power control problems: Centralized and Nash equilibrium solutions. Abdallah C, Lewis F, eds. Proc. 42nd IEEE Conf. Decision Control (IEEE), 98–103.Google Scholar
- [36] (2007) An invariance principle in large population stochastic dynamic games. J. Systems Sci. Complexity 20(2):162–172.Crossref, Google Scholar
- [37] (2007) Large–population cost–coupled LQG problems with nonuniform agents: Individual–mass behavior and decentralized ε–Nash equilibria. IEEE Trans. Automatic Control 52(9):1560–1571.Crossref, Google Scholar
- [38] (2007) The Nash certainty equivalence principle and McKean–Vlasov systems: An invariance principle and entry adaptation. Castanon D, Spall J, eds. 46th IEEE Conf. Decision Control (IEEE), 121–126.Google Scholar
- [39] (2006) Large population stochastic dynamic games: closed–loop McKean–Vlasov systems and the Nash certainty equivalence principle. Comm. Inform. Systems 6(3):221–252.Crossref, Google Scholar
- [40] (1985) Grossissement initial, hypothèse (H′) et théorème de Girsanov. Jeulin T, Yor M, eds. Grossissements de Filtrations: Exemples et Applications. Lecture Notes in Mathematics, vol. 1118 (Springer–Verlag, Berlin, Heidelberg), 15–35.Crossref, Google Scholar
- [41] (1998) Propagation of chaos and fluctuations for a moderate model with smooth initial data. Annales de l’institut Henri Poincaré, Probabilités et Statistiques (B) 34(6):727–766.Crossref, Google Scholar
- [42] (2013) Propagation of chaos for rank–based interacting diffusions and long time behaviour of a scalar quasilinear parabolic equation. Stochastic Partial Differential Equations Anal. Comput. 1(3):455–506.Crossref, Google Scholar
- [43] (1956) Foundations of kinetic theory. Neyman J, ed. Proc. Third Berkeley Sympos. Math. Statistics Probab., Vol. 3: Contributions Astronomy Phys. (University of California Press), 171–197.Crossref, Google Scholar
- [44] (2014) Weak and strong solutions of general stochastic models. Electronic Comm. Probab. 19(58):1–16.Google Scholar
- [45] (2015) Mean field games via controlled martingale problems: Existence of Markovian equilibria. Stochastic Processes Their Appl. 125(7):2856–2894.Crossref, Google Scholar
- [46] (2016) A general characterization of the mean field limit for stochastic differential games. Probab. Theory Related Fields 165(3–4):581–648.Crossref, Google Scholar
- [47] (2017) Limit theory for controlled McKean–Vlasov dynamics. SIAM J. Control Optim. 55(3):1641–1672.Crossref, Google Scholar
- [48] (2020) Superposition and mimicking theorems for conditional McKean–Vlasov equations. Preprint, submitted March 20, https://arxiv.org/abs/2004.00099.Google Scholar
- [49] (2006) Jeux à champ moyen. I–Le cas stationnaire. Comptes Rendus Mathématique 343(9):619–625.Crossref, Google Scholar
- [50] (2006) Jeux à champ moyen. II–Horizon fini et contrôle optimal. Comptes Rendus Mathématique 343(10):679–684.Crossref, Google Scholar
- [51] (2007) Mean field games. Japanese J. Math. 2(1):229–260.Crossref, Google Scholar
- [52] (2014) Dynamic programming for mean–field type control. Comptes Rendus Mathématique 352(9):707–713.Crossref, Google Scholar
- [53] Théorie des jeux de champ moyen et applications. Cours du Collège de France. http://www.college-de-france.fr/default/EN/all/equder/audiovideo.jsp, 2006–2012.Google Scholar
- [54] (1977) Statistics of Random Processes (Springer–Verlag).Crossref, Google Scholar
- [55] (1969) Propagation of Chaos for a Class of Non–linear Parabolic Equations. Lecture Series on Differential Equations, Session 7: Stochastic Differential Equations (Fort Belvoir Defense Technical Information Center), 41–57.Google Scholar
- [56] (1992) Martingale measure approximation, application to the control of diffusions. Prépublication du laboratoire de probabilités, université Paris VI.Google Scholar
- [57] (1992) Representation and approximation of martingale measures. Rozovskii B, Sowers R, eds. Stochastic Partial Differential Equations Their Appl. Proc. IFIP WG 7/1 Internat. Conf. (Springer–Verlag, Berlin, Heidelberg, New York).Crossref, Google Scholar
- [58] (1996) Asymptotic behaviour of some interacting particle systems: McKean–Vlasov and Boltzmann models. Talay D, Tubaro L, eds. Probabilistic Models for Nonlinear Partial Differential Equations. Lecture Notes in Mathematics, vol. 1627 (Springer–Verlag, Berlin, Heidelberg), 42–95.Crossref, Google Scholar
- [59] (1987) A propagation of chaos result for a system of particles with moderate interaction. Stochastic Processes Their Appl. 26:317–332.Crossref, Google Scholar
- [60] (2014) Measurability of semimartingale characteristics with respect to the probability law. Stochastic Processes Their Appl. 124(11):3819–3845.Crossref, Google Scholar
- [61] (1984) A martingale approach to the law of large numbers for weakly interacting stochastic processes. Ann. Probab. 12(2):458–479.Crossref, Google Scholar
- [62] (1985) A law of large numbers for moderately interacting diffusion processes. Zeitschrift Wahrscheinlichkeitstheorie Verwandte Gebiete 69(2):279–322.Crossref, Google Scholar
- [63] (2016) Linear quadratic optimal control of conditional McKean–Vlasov equation with random coefficients and applications. Probab. Uncertainty Quant. Risk 1(7):1–26.Google Scholar
- [64] (2017) Dynamic programming for optimal control of stochastic McKean–Vlasov dynamics. SIAM J. Control Optim. 55(2):1069–1101.Crossref, Google Scholar
- [65] (2018) Bellman equation and viscosity solutions for mean–field stochastic control problem. ESAIM Control Optim. Calculus Variations 24(1):437–461.Crossref, Google Scholar
- [66] (2018) Stochastic control for a class of nonlinear kernels and applications. Ann. Probab. 46(1):551–603.Crossref, Google Scholar
- [67] (2012) Large systems of diffusions interacting through their ranks. Stochastic Processes Their Appl. 122(4):1730–1747.Crossref, Google Scholar
- [68] (1991) Topics in propagation of chaos. Hennequin P, ed. École d’été de probabilités de Saint–Flour XIX – 1989. Lecture Notes in Mathematics, number 1464 (Springer, Berlin, Heidelberg), 165–251.Google Scholar
- [69] (1997) Multidimensional Diffusion Processes. Grundlehren der Mathematischen Wissenschaften, vol. 233 (Springer–Verlag, Berlin, Heidelberg).Google Scholar
- [70] (2017) Causal optimal transport: theory and applications. Unpublished PhD thesis, Universität Wien, Austria.Google Scholar

