Optimality of Symmetric Independent Policies Under Decentralized Mean-Field Information Sharing for Stochastic Teams and Equivalence with McKean−Vlasov Control of a Representative Agent
Published Online:12 May 2026https://doi.org/10.1287/moor.2024.0489
References
- [1] (2020) Mean field games and applications: Numerical aspects. Cardaliaguet P, Porretta A, eds. Mean Field Games, Lecture Notes in Mathematics, vol. 2281 (Springer, Cham, Switzerland), 249–307.Crossref, Google Scholar
- [2] (2017) Mean field control hierarchy. Appl. Math. Optim. 76(1):93–135.Crossref, Google Scholar
- [3] (2022) Moment-driven predictive control of mean-field collective dynamics. SIAM J. Control Optim. 60(2):814–841.Crossref, Google Scholar
- [4] (1985) Ecole d’Ete de Probabilites de Saint-Flour XIII, 1983, Lecture Notes in Mathematics, vol. 1117 (Springer, Berlin).Google Scholar
- [5] (2023) Learning mean-field games with discounted and average costs. J. Machine Learn. Res. 24(17):1–59.Google Scholar
- [6] (2014) Team optimal control of coupled subsystems with mean-field sharing. 53rd IEEE Conf. Decision Control, 1669–1674. Google Scholar
- [7] (2015) Team-optimal solution of finite number of mean-field coupled LQG subsystems. IEEE 54th Annual Conf. Decision Control (CDC), 5308–5313.Google Scholar
- [8] (1997) Consequences of denseness of Dirac Young measures. J. Math. Anal. Appl. 207(2):536–540.Crossref, Google Scholar
- [9] (2019) On non-uniqueness and uniqueness of solutions in finite-horizon mean field games. ESAIM COCV 25:44.Crossref, Google Scholar
- [10] (2023) Mean field Markov decision processes. Appl. Math. Optim. 88(1):12.Crossref, Google Scholar
- [11] (2020) On non-uniqueness in mean field games. Proc. Amer. Math. Soc. 148(9):4091–4106.Crossref, Google Scholar
- [12] (2025) Finite approximations for mean-field type multi-agent control and their near optimality. Appl. Math. Optim. 92(1):7.Crossref, Google Scholar
- [13] (2018) Randomized dynamic programming principle and Feynman–Kac representation for optimal control of McKean–Vlasov dynamics. Trans. Amer. Math. Soc. 370(3):2115–2160.Crossref, Google Scholar
- [14] (2018) Denseness of adapted processes among causal couplings. Preprint, submitted May 8, https://arxiv.org/abs/1805.03185.Google Scholar
- [15] (2015) The master equation in mean field theory. J. Mathématiques Pures Appliquées 103(6):1441–1474.Crossref, Google Scholar
- [16] (1988) The probabilistic structure of controlled diffusion processes. Acta Appl. Math. 11(1):19–48.Crossref, Google Scholar
- [17] (2017) Mean field games. Basar T, Zaccour G, eds. Theory Handbook of Dynamic Game Theory (Springer, Cham, Switzerland), 1–28.Crossref, Google Scholar
- [18] (2023) An algebraic convergence rate for the optimal control of McKean–Vlasov dynamics. SIAM J. Control Optim. 61(6):3341–3369.Crossref, Google Scholar
- [19] (2020) Applications of mean field games in financial engineering and economic theory. Preprint, submitted December 9, https://arxiv.org/abs/2012.05237.Google Scholar
- [20] (2015) Forward–backward stochastic differential equations and controlled McKean–Vlasov dynamics. Ann. Probab. 43(5):2647–2700.Crossref, Google Scholar
- [21] (2018) Probabilistic Theory of Mean Field Games with Applications II: Mean Field Games with Common Noise and Master Equations (Springer, Cham, Switzerland).Crossref, Google Scholar
- [22] (2016) Mean field games with common noise. Ann. Probab. 44(6):3740–3803.Crossref, Google Scholar
- [23] (2023) Model-free mean-field reinforcement learning: Mean-field MDP and mean-field Q-learning. Ann. Appl. Probab. 33(6B):5334–5381.Crossref, Google Scholar
- [24] (2022) Controlling swarms toward flocks and mills. SIAM J. Control Optim. 60(3):1863–1891.Crossref, Google Scholar
- [25] (2004) Young Measures on Topological Spaces: With Applications in Control Theory and Probability Theory, vol. 571 (Springer Science & Business Media, Dordrecht, Netherlands).Crossref, Google Scholar
- [26] (2021) Finite state N-agent and mean field control problems. ESAIM COCV 27:31.Crossref, Google Scholar
- [27] (2020) Probabilistic approach to finite state mean field games. Appl. Math. Optim. 81(2):253–300.Google Scholar
- [28] (2020) Selection of equilibria in a linear quadratic mean-field game. Stochastic Processes Appl. 130(2):1000–1040.Crossref, Google Scholar
- [29] (1980) Finite exchangeable sequences. Ann. Probab. 8(4):745–764.Crossref, Google Scholar
- [30] (2022) McKean–Vlasov optimal control: Limit theory and equivalence between different formulations. Math. Oper. Res. 47(4):2891–2930.Link, Google Scholar
- [31] (2013) Discrete time mean-field stochastic linear-quadratic optimal control problems. Automatica 49(11):3222–3233.Crossref, Google Scholar
- [32] (2017) On the connection between symmetric N-player games and mean field games. Ann. Appl. Probab. 27(2):757–810.Crossref, Google Scholar
- [33] (2019) Mean-field optimal control as gamma-limit of finite agent controls. Eur. J. Appl. Math. 30(6):1153–1186.Crossref, Google Scholar
- [34] (2019) On non-unique solutions in mean field games. 2019 IEEE 58th Conf. Decision Control (CDC), 1219–1224.Google Scholar
- [35] (1996) Discrete-Time Markov Control Processes: Basic Optimality Criteria (Springer, New York).Crossref, Google Scholar
- [36] (2007) A survey of recent results in networked control systems. Proc. IEEE 95(1):138–162.Crossref, Google Scholar
- [37] (1980) Team decision theory and information structures. Proc. IEEE 68(6):644–654.Crossref, Google Scholar
- [38] (2012) Social optima in mean field LQG control: Centralized and decentralized strategies. IEEE Trans. Automatic Control 57(7):1736–1751.Crossref, Google Scholar
- [39] (2006) Large population stochastic dynamic games: Closed-loop McKean–Vlasov systems and the Nash certainty equivalence principle. Comm. Inform. Systems 6(3):221–251.Crossref, Google Scholar
- [40] (2007) Large-population cost-coupled LQG problems with nonuniform agents: Individual-mass behavior and decentralized ϵ-Nash equilibria. IEEE Trans. Automatic Control 52(9):1560–1571.Crossref, Google Scholar
- [41] (2025) Approximately optimal distributed stochastic controls beyond the mean field setting. Ann. Appl. Probab. 35(1):251–308.Crossref, Google Scholar
- [42] (2006) Probabilistic Symmetries and Invariance Principles (Springer Science & Business Media, New York).Google Scholar
- [43] (2020) Robustness to incorrect system models in stochastic control. SIAM J. Control Optim. 58(2):1144–1182.Crossref, Google Scholar
- [44] (2023) Q-learning for MDPs with general spaces: Convergence and near optimality via quantization under weak continuity. J. Machine Learn. Res. 24(199):1–34.Google Scholar
- [45] (2015) Mean field games via controlled martingale problems: Existence of Markovian equilibria. Stochastic Processes Appl. 125(7):2856–2894.Crossref, Google Scholar
- [46] (2017) Limit theory for controlled McKean–Vlasov dynamics. SIAM J. Control Optim. 55(3):1641–1672.Crossref, Google Scholar
- [47] (2020) On the convergence of closed-loop Nash equilibria to the mean field game limit. Ann. Appl. Probab. 30(4):1693–1761.Crossref, Google Scholar
- [48] (1981) Convergence of dynamic programming models. Math. Oper. Res. 6(4):493–512.Link, Google Scholar
- [49] (2007) Mean field games. Japanese J. Math. 2(1):229–260.Crossref, Google Scholar
- [50] (2016) Dynamic programming for mean-field type control. J. Optim. Theory Appl. 169(3):902–924.Crossref, Google Scholar
- [51] (2012) Information structures in optimal decentralized control. IEEE Conf. Decision Control.Google Scholar
- [52] (1955) Elements for a theory of teams. Management Sci. 1(2):127–137.Link, Google Scholar
- [53] (1985) Distributional strategies for games with incomplete information. Math. Oper. Res. 10(4):619–632.Link, Google Scholar
- [54] (2022) Mean-field Markov decision processes with common noise and open-loop controls. Ann. Appl. Probab. 32(2):1421–1458.Crossref, Google Scholar
- [55] (2023) Quantitative propagation of chaos for mean field Markov decision process with common noise. Electronic J. Probab. 28:1–24.Crossref, Google Scholar
- [56] (2015) Discrete-time mean-field stochastic linear–quadratic optimal control problems, II: Infinite horizon case. Automatica 57:65–77.Crossref, Google Scholar
- [57] (2016) Discrete time McKean–Vlasov control problem: A dynamic programming approach. Appl. Math. Optim. 74(3):487–506.Crossref, Google Scholar
- [58] (2017) Dynamic programming for optimal control of stochastic McKean–Vlasov dynamics. SIAM J. Control Optim. 55(2):1069–1101.Crossref, Google Scholar
- [59] (1962) Team decision problems. Ann. Math. Statist. 33(3):857–881.Crossref, Google Scholar
- [60] (2018) Markov–Nash equilibria in mean-field games with discounted cost. SIAM J. Control Optim. 56(6):4256–4287.Crossref, Google Scholar
- [61] (2018) Finite Approximations in Discrete-Time Stochastic Control: Quantized Models and Asymptotic Optimality (Birkhäuser, Cham, Switzerland).Crossref, Google Scholar
- [62] (2017) On the asymptotic optimality of finite approximations to Markov decision processes with Borel spaces. Math. Oper. Res. 42(4):945–978.Link, Google Scholar
- [63] (2021) Optimal policies for convex symmetric stochastic dynamic teams and their mean-field limit. SIAM J. Control Optim. 59(2):777–804.Crossref, Google Scholar
- [64] (2021) Optimal solutions to infinite-player stochastic teams and mean-field teams. IEEE Trans. Automatic Control. 66(3):1071–1086.Crossref, Google Scholar
- [65] (2023) Optimality of independently randomized symmetric policies for exchangeable stochastic teams with infinitely many decision makers. Math. Oper. Res. 48(3):1254–1285.Link, Google Scholar
- [66] (2024) Nash equilibria for exchangeable team-against-team games, their mean-field limit, and the role of common randomness. SIAM J. Control Optim. 62(3):1437–1464.Crossref, Google Scholar
- [67] (1982) Convergence of Lebesgue integrals with varying measures. Sankhyā Indian J. Statist. Ser. A 44(3):380–402.Google Scholar
- [68] (2023) Mean-field games among teams. Preprint, submitted October 18, https://arxiv.org/abs/2310.12282.Google Scholar
- [69] (2024) A mean field game approach for a class of linear quadratic discrete choice problems with congestion avoidance. Automatica 160:111420.Crossref, Google Scholar
- [70] (1988) Decentralized detection by a large number of sensors. Math. Control. Signals Systems 1(2):167–182.Crossref, Google Scholar
- [71] (1975) The intrinsic model for discrete stochastic control: Some open problems. Bensoussan A, Lions JL, eds. Control Theory, Numerical Methods and Computer Systems Modelling, Lecture Notes in Economics and Mathematical Systems, vol. 107 (Springer, Berlin), 322–335.Crossref, Google Scholar
- [72] (2024) Mean-field games with finitely many players: Independent learning and subjectivity. J. Machine Learn. Res. 25(419):1–69.Google Scholar
- [73] (2024) On Borkar and Young relaxed control topologies and continuous dependence of invariant measures on control policy. SIAM J. Control Optim. 62(4):2367–2386.Crossref, Google Scholar
- [74] (2013) Stochastic Networked Control Systems: Stabilization and Optimization Under Information Constraints (Springer, New York).Crossref, Google Scholar
- [75] (2024) Stochastic Teams, Games, and Control Under Information Constraints (Springer, Cham, Switzerland).Crossref, Google Scholar

