Optimality of Symmetric Independent Policies Under Decentralized Mean-Field Information Sharing for Stochastic Teams and Equivalence with McKean−Vlasov Control of a Representative Agent

Sina Sanjari
Corresponding Author
Sina Sanjari
[email protected]
https://orcid.org/0000-0002-6409-7994
Department of Mathematics and Computer Science, Royal Military College, Kingston, Ontario K7K 7B4, Canada
Search for more papers by this author
,
Naci Saldi
Naci Saldi
[email protected]
Department of Mathematics, Bilkent University, Ankara 06800, Turkey
Search for more papers by this author
,
Serdar Yüksel
Serdar Yüksel
[email protected]
https://orcid.org/0000-0001-6099-5001
Department of Mathematics and Statistics, Queen’s University, Kingston, Ontario K7L 3N6, Canada
Search for more papers by this author

Sina Sanjari

Corresponding Author

Sina Sanjari

[email protected]

https://orcid.org/0000-0002-6409-7994

Department of Mathematics and Computer Science, Royal Military College, Kingston, Ontario K7K 7B4, Canada

Search for more papers by this author

Naci Saldi

[email protected]

Department of Mathematics, Bilkent University, Ankara 06800, Turkey

Search for more papers by this author

Serdar Yüksel

[email protected]

https://orcid.org/0000-0001-6099-5001

Department of Mathematics and Statistics, Queen’s University, Kingston, Ontario K7L 3N6, Canada

Search for more papers by this author

Published Online:12 May 2026https://doi.org/10.1287/moor.2024.0489

References

[1] Achdou Y, Laurière M (2020) Mean field games and applications: Numerical aspects. Cardaliaguet P, Porretta A, eds. Mean Field Games, Lecture Notes in Mathematics, vol. 2281 (Springer, Cham, Switzerland), 249–307.Crossref, Google Scholar
[2] Albi G, Choi Y, Fornasier M, Kalise D (2017) Mean field control hierarchy. Appl. Math. Optim. 76(1):93–135.Crossref, Google Scholar
[3] Albi G, Herty M, Kalise D, Segala C (2022) Moment-driven predictive control of mean-field collective dynamics. SIAM J. Control Optim. 60(2):814–841.Crossref, Google Scholar
[4] Aldous DJ, Ibragimov IA, Jacod J (1985) Ecole d’Ete de Probabilites de Saint-Flour XIII, 1983, Lecture Notes in Mathematics, vol. 1117 (Springer, Berlin).Google Scholar
[5] Anahtarci B, Kariksiz C, Saldi N (2023) Learning mean-field games with discounted and average costs. J. Machine Learn. Res. 24(17):1–59.Google Scholar
[6] Arabneydi J, Mahajan A (2014) Team optimal control of coupled subsystems with mean-field sharing. 53rd IEEE Conf. Decision Control, 1669–1674. Google Scholar
[7] Arabneydi J, Mahajan A (2015) Team-optimal solution of finite number of mean-field coupled LQG subsystems. IEEE 54th Annual Conf. Decision Control (CDC), 5308–5313.Google Scholar
[8] Balder E (1997) Consequences of denseness of Dirac Young measures. J. Math. Anal. Appl. 207(2):536–540.Crossref, Google Scholar
[9] Bardi M, Fischer M (2019) On non-uniqueness and uniqueness of solutions in finite-horizon mean field games. ESAIM COCV 25:44.Crossref, Google Scholar
[10] Bäuerle N (2023) Mean field Markov decision processes. Appl. Math. Optim. 88(1):12.Crossref, Google Scholar
[11] Bayraktar E, Zhang X (2020) On non-uniqueness in mean field games. Proc. Amer. Math. Soc. 148(9):4091–4106.Crossref, Google Scholar
[12] Bayraktar E, Bäuerle N, Kara AD (2025) Finite approximations for mean-field type multi-agent control and their near optimality. Appl. Math. Optim. 92(1):7.Crossref, Google Scholar
[13] Bayraktar E, Cosso A, Pham H (2018) Randomized dynamic programming principle and Feynman–Kac representation for optimal control of McKean–Vlasov dynamics. Trans. Amer. Math. Soc. 370(3):2115–2160.Crossref, Google Scholar
[14] Beiglböck M, Lacker D (2018) Denseness of adapted processes among causal couplings. Preprint, submitted May 8, https://arxiv.org/abs/1805.03185.Google Scholar
[15] Bensoussan A, Frehse J, Yam S (2015) The master equation in mean field theory. J. Mathématiques Pures Appliquées 103(6):1441–1474.Crossref, Google Scholar
[16] Borkar V (1988) The probabilistic structure of controlled diffusion processes. Acta Appl. Math. 11(1):19–48.Crossref, Google Scholar
[17] Caines PE, Huang M, Malhamé RP (2017) Mean field games. Basar T, Zaccour G, eds. Theory Handbook of Dynamic Game Theory (Springer, Cham, Switzerland), 1–28.Crossref, Google Scholar
[18] Cardaliaguet P, Daudin S, Jackson J, Souganidis P (2023) An algebraic convergence rate for the optimal control of McKean–Vlasov dynamics. SIAM J. Control Optim. 61(6):3341–3369.Crossref, Google Scholar
[19] Carmona R (2020) Applications of mean field games in financial engineering and economic theory. Preprint, submitted December 9, https://arxiv.org/abs/2012.05237.Google Scholar
[20] Carmona R, Delarue F (2015) Forward–backward stochastic differential equations and controlled McKean–Vlasov dynamics. Ann. Probab. 43(5):2647–2700.Crossref, Google Scholar
[21] Carmona R, Delarue F (2018) Probabilistic Theory of Mean Field Games with Applications II: Mean Field Games with Common Noise and Master Equations (Springer, Cham, Switzerland).Crossref, Google Scholar
[22] Carmona R, Delarue F, Lacker D (2016) Mean field games with common noise. Ann. Probab. 44(6):3740–3803.Crossref, Google Scholar
[23] Carmona R, Laurière M, Tan Z (2023) Model-free mean-field reinforcement learning: Mean-field MDP and mean-field Q-learning. Ann. Appl. Probab. 33(6B):5334–5381.Crossref, Google Scholar
[24] Carrillo J, Rossi DKF, Trélat E (2022) Controlling swarms toward flocks and mills. SIAM J. Control Optim. 60(3):1863–1891.Crossref, Google Scholar
[25] Castaing C, Fitte PR, Valadier M (2004) Young Measures on Topological Spaces: With Applications in Control Theory and Probability Theory, vol. 571 (Springer Science & Business Media, Dordrecht, Netherlands).Crossref, Google Scholar
[26] Cecchin A (2021) Finite state N-agent and mean field control problems. ESAIM COCV 27:31.Crossref, Google Scholar
[27] Cecchin A, Fischer M (2020) Probabilistic approach to finite state mean field games. Appl. Math. Optim. 81(2):253–300.Google Scholar
[28] Delarue F, Tchuendom R (2020) Selection of equilibria in a linear quadratic mean-field game. Stochastic Processes Appl. 130(2):1000–1040.Crossref, Google Scholar
[29] Diaconis P, Freedman D (1980) Finite exchangeable sequences. Ann. Probab. 8(4):745–764.Crossref, Google Scholar
[30] Djete M, Possamaï D, Tan X (2022) McKean–Vlasov optimal control: Limit theory and equivalence between different formulations. Math. Oper. Res. 47(4):2891–2930.Link, Google Scholar
[31] Elliott R, Li X, Ni Y (2013) Discrete time mean-field stochastic linear-quadratic optimal control problems. Automatica 49(11):3222–3233.Crossref, Google Scholar
[32] Fischer M (2017) On the connection between symmetric N-player games and mean field games. Ann. Appl. Probab. 27(2):757–810.Crossref, Google Scholar
[33] Fornasier M, Lisini S, Orrieri C, Savaré G (2019) Mean-field optimal control as gamma-limit of finite agent controls. Eur. J. Appl. Math. 30(6):1153–1186.Crossref, Google Scholar
[34] Hajek B, Livesay M (2019) On non-unique solutions in mean field games. 2019 IEEE 58th Conf. Decision Control (CDC), 1219–1224.Google Scholar
[35] Hernández-Lerma O, Lasserre JB (1996) Discrete-Time Markov Control Processes: Basic Optimality Criteria (Springer, New York).Crossref, Google Scholar
[36] Hespanha J, Naghshtabrizi P, Xu Y (2007) A survey of recent results in networked control systems. Proc. IEEE 95(1):138–162.Crossref, Google Scholar
[37] Ho Y (1980) Team decision theory and information structures. Proc. IEEE 68(6):644–654.Crossref, Google Scholar
[38] Huang M, Caines P, Malhamé R (2012) Social optima in mean field LQG control: Centralized and decentralized strategies. IEEE Trans. Automatic Control 57(7):1736–1751.Crossref, Google Scholar
[39] Huang M, Caines PE, Malhamé RP (2006) Large population stochastic dynamic games: Closed-loop McKean–Vlasov systems and the Nash certainty equivalence principle. Comm. Inform. Systems 6(3):221–251.Crossref, Google Scholar
[40] Huang M, Caines PE, Malhamé RP (2007) Large-population cost-coupled LQG problems with nonuniform agents: Individual-mass behavior and decentralized ϵ-Nash equilibria. IEEE Trans. Automatic Control 52(9):1560–1571.Crossref, Google Scholar
[41] Jackson J, Lacker D (2025) Approximately optimal distributed stochastic controls beyond the mean field setting. Ann. Appl. Probab. 35(1):251–308.Crossref, Google Scholar
[42] Kallenberg O (2006) Probabilistic Symmetries and Invariance Principles (Springer Science & Business Media, New York).Google Scholar
[43] Kara A, Yüksel S (2020) Robustness to incorrect system models in stochastic control. SIAM J. Control Optim. 58(2):1144–1182.Crossref, Google Scholar
[44] Kara A, Saldi N, Yüksel S (2023) Q-learning for MDPs with general spaces: Convergence and near optimality via quantization under weak continuity. J. Machine Learn. Res. 24(199):1–34.Google Scholar
[45] Lacker D (2015) Mean field games via controlled martingale problems: Existence of Markovian equilibria. Stochastic Processes Appl. 125(7):2856–2894.Crossref, Google Scholar
[46] Lacker D (2017) Limit theory for controlled McKean–Vlasov dynamics. SIAM J. Control Optim. 55(3):1641–1672.Crossref, Google Scholar
[47] Lacker D (2020) On the convergence of closed-loop Nash equilibria to the mean field game limit. Ann. Appl. Probab. 30(4):1693–1761.Crossref, Google Scholar
[48] Langen HJ (1981) Convergence of dynamic programming models. Math. Oper. Res. 6(4):493–512.Link, Google Scholar
[49] Lasry JM, Lions PL (2007) Mean field games. Japanese J. Math. 2(1):229–260.Crossref, Google Scholar
[50] Laurière M, Pironneau O (2016) Dynamic programming for mean-field type control. J. Optim. Theory Appl. 169(3):902–924.Crossref, Google Scholar
[51] Mahajan A, Martins N, Rotkowitz M, Yüksel S (2012) Information structures in optimal decentralized control. IEEE Conf. Decision Control.Google Scholar
[52] Marschak J (1955) Elements for a theory of teams. Management Sci. 1(2):127–137.Link, Google Scholar
[53] Milgrom P, Weber R (1985) Distributional strategies for games with incomplete information. Math. Oper. Res. 10(4):619–632.Link, Google Scholar
[54] Motte M, Pham H (2022) Mean-field Markov decision processes with common noise and open-loop controls. Ann. Appl. Probab. 32(2):1421–1458.Crossref, Google Scholar
[55] Motte M, Pham H (2023) Quantitative propagation of chaos for mean field Markov decision process with common noise. Electronic J. Probab. 28:1–24.Crossref, Google Scholar
[56] Ni Y, Elliott R, Li X (2015) Discrete-time mean-field stochastic linear–quadratic optimal control problems, II: Infinite horizon case. Automatica 57:65–77.Crossref, Google Scholar
[57] Pham H, Wei X (2016) Discrete time McKean–Vlasov control problem: A dynamic programming approach. Appl. Math. Optim. 74(3):487–506.Crossref, Google Scholar
[58] Pham H, Wei X (2017) Dynamic programming for optimal control of stochastic McKean–Vlasov dynamics. SIAM J. Control Optim. 55(2):1069–1101.Crossref, Google Scholar
[59] Radner R (1962) Team decision problems. Ann. Math. Statist. 33(3):857–881.Crossref, Google Scholar
[60] Saldi N, Başar T, Raginsky M (2018) Markov–Nash equilibria in mean-field games with discounted cost. SIAM J. Control Optim. 56(6):4256–4287.Crossref, Google Scholar
[61] Saldi N, Linder T, Yüksel S (2018) Finite Approximations in Discrete-Time Stochastic Control: Quantized Models and Asymptotic Optimality (Birkhäuser, Cham, Switzerland).Crossref, Google Scholar
[62] Saldi N, Yüksel S, Linder T (2017) On the asymptotic optimality of finite approximations to Markov decision processes with Borel spaces. Math. Oper. Res. 42(4):945–978.Link, Google Scholar
[63] Sanjari S, Yüksel S (2021) Optimal policies for convex symmetric stochastic dynamic teams and their mean-field limit. SIAM J. Control Optim. 59(2):777–804.Crossref, Google Scholar
[64] Sanjari S, Yüksel S (2021) Optimal solutions to infinite-player stochastic teams and mean-field teams. IEEE Trans. Automatic Control. 66(3):1071–1086.Crossref, Google Scholar
[65] Sanjari S, Saldi N, Yüksel S (2023) Optimality of independently randomized symmetric policies for exchangeable stochastic teams with infinitely many decision makers. Math. Oper. Res. 48(3):1254–1285.Link, Google Scholar
[66] Sanjari S, Saldi N, Yüksel S (2024) Nash equilibria for exchangeable team-against-team games, their mean-field limit, and the role of common randomness. SIAM J. Control Optim. 62(3):1437–1464.Crossref, Google Scholar
[67] Serfozo R (1982) Convergence of Lebesgue integrals with varying measures. Sankhyā Indian J. Statist. Ser. A 44(3):380–402.Google Scholar
[68] Subramanian J, Kumar A, Mahajan A (2023) Mean-field games among teams. Preprint, submitted October 18, https://arxiv.org/abs/2310.12282.Google Scholar
[69] Toumi N, Malhamé R, Ny JL (2024) A mean field game approach for a class of linear quadratic discrete choice problems with congestion avoidance. Automatica 160:111420.Crossref, Google Scholar
[70] Tsitsiklis J (1988) Decentralized detection by a large number of sensors. Math. Control. Signals Systems 1(2):167–182.Crossref, Google Scholar
[71] Witsenhausen HS (1975) The intrinsic model for discrete stochastic control: Some open problems. Bensoussan A, Lions JL, eds. Control Theory, Numerical Methods and Computer Systems Modelling, Lecture Notes in Economics and Mathematical Systems, vol. 107 (Springer, Berlin), 322–335.Crossref, Google Scholar
[72] Yongacoglu B, Arslan G, Yüksel S (2024) Mean-field games with finitely many players: Independent learning and subjectivity. J. Machine Learn. Res. 25(419):1–69.Google Scholar
[73] Yüksel S (2024) On Borkar and Young relaxed control topologies and continuous dependence of invariant measures on control policy. SIAM J. Control Optim. 62(4):2367–2386.Crossref, Google Scholar
[74] Yüksel S, Başar T (2013) Stochastic Networked Control Systems: Stabilization and Optimization Under Information Constraints (Springer, New York).Crossref, Google Scholar
[75] Yüksel S, Başar T (2024) Stochastic Teams, Games, and Control Under Information Constraints (Springer, Cham, Switzerland).Crossref, Google Scholar

cover image Mathematics of Operations Research

Articles In Advance

Article Information

Metrics

Information

Received:April 23, 2024
Accepted:March 02, 2026
Published Online:May 12, 2026

Cite as

Sina Sanjari, Naci Saldi, Serdar Yüksel (2026) Optimality of Symmetric Independent Policies Under Decentralized Mean-Field Information Sharing for Stochastic Teams and Equivalence with McKean−Vlasov Control of a Representative Agent. Mathematics of Operations Research 0(0).

https://doi.org/10.1287/moor.2024.0489

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Optimality of Symmetric Independent Policies Under Decentralized Mean-Field Information Sharing for Stochastic Teams and Equivalence with McKean−Vlasov Control of a Representative Agent

References

Articles In Advance

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News