Optimality of Independently Randomized Symmetric Policies for Exchangeable Stochastic Teams with Infinitely Many Decision Makers

Sina Sanjari
Corresponding Author
Sina Sanjari
[email protected]
https://orcid.org/0000-0002-6409-7994
Department of Mathematics and Statistics, Queen’s University, Kingston, Ontario K7L 3N6, Canada;
Search for more papers by this author
,
Naci Saldi
Naci Saldi
[email protected]
https://orcid.org/0000-0002-2677-7366
Department of Mathematics, Bilkent University, 06800 Ankara, Turkey
Search for more papers by this author
,
Serdar Yüksel
Serdar Yüksel
[email protected]
https://orcid.org/0000-0001-6099-5001
Department of Mathematics and Statistics, Queen’s University, Kingston, Ontario K7L 3N6, Canada;
Search for more papers by this author

Sina Sanjari

Corresponding Author

Sina Sanjari

[email protected]

https://orcid.org/0000-0002-6409-7994

Department of Mathematics and Statistics, Queen’s University, Kingston, Ontario K7L 3N6, Canada;

Search for more papers by this author

Naci Saldi

[email protected]

https://orcid.org/0000-0002-2677-7366

Department of Mathematics, Bilkent University, 06800 Ankara, Turkey

Search for more papers by this author

Serdar Yüksel

[email protected]

https://orcid.org/0000-0001-6099-5001

Department of Mathematics and Statistics, Queen’s University, Kingston, Ontario K7L 3N6, Canada;

Search for more papers by this author

Published Online:24 Aug 2022https://doi.org/10.1287/moor.2022.1296

References

[1] Aldous DJ, Ibragimov IA, Jacod J (1985) Ecole d’Ete de Probabilites de Saint-Flour XIII, 1983, vol. 1117 (Springer, Berlin).Google Scholar
[2] Aliprantis CD, Border KC (2006) Infinite Dimensional Analysis: A Hitchhiker’s Guide, 3rd ed. (Springer, Berlin).Google Scholar
[3] Arabneydi J, Mahajan A (2015) Team-optimal solution of finite number of mean-field coupled LQG subsystems. Proc. 54th IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 5308–5313.Google Scholar
[4] Arapostathis A, Biswas A, Carroll J (2017) On solutions of mean field games with ergodic cost. J. Math. Pures Appliquées 107(2):205–251.Crossref, Google Scholar
[5] Arrow KJ, Radner R (1979) Allocation of resources in large teams. Econometrica 47(2):361–385.Crossref, Google Scholar
[6] Banica T, Curran S, Speicher R (2012) De Finetti theorems for easy quantum groups. Ann. Probab. 40(1):401–435.Crossref, Google Scholar
[7] Bardi M, Fischer M (2019) On non-uniqueness and uniqueness of solutions in finite-horizon mean field games. ESAIM Control Optim. Calculus Variations 25:44.Crossref, Google Scholar
[8] Bardi M, Priuli FS (2014) Linear-quadratic N-person and mean-field games with ergodic cost. SIAM J. Control Optim. 52(5):3022–3052.Crossref, Google Scholar
[9] Bayraktar E, Zhang X (2020) On non-uniqueness in mean field games. Proc. Amer. Math. Soc. 148(9):4091–4106.Crossref, Google Scholar
[10] Beckmann MJ (1958) Decision and team problems in airline reservations. Econometrica 26(1):134–145.Crossref, Google Scholar
[11] Beneš VE (1971) Existence of optimal stochastic control laws. SIAM J. Control 9(3):446–472.Crossref, Google Scholar
[12] Bertsekas DP, Shreve S (1978) Stochastic Optimal Control: The Discrete Time Case (Academic Press, New York).Google Scholar
[13] Blackwell D (1964) Memoryless strategies in finite-stage dynamic programming. Ann. Math. Statist. 35:863–865.Crossref, Google Scholar
[14] Borkar V (2000) Average cost dynamic programming equations for controlled Markov chains with partial observations. SIAM J. Control Optim. 39(3):673–681.Crossref, Google Scholar
[15] Borkar V (2007) Dynamic programming for ergodic control of Markov chains under partial observations: A correction. SIAM J. Control Optim. 45(6):2299–2304.Crossref, Google Scholar
[16] Borkar VS (1993) White-noise representations in stochastic realization theory. SIAM J. Control Optim. 31:1093–1102.Crossref, Google Scholar
[17] Brandao FGSL, Harrow AW (2017) Quantum de Finetti theorems under local measurements with applications. Comm. Math. Phys. 353(2):469–506.Crossref, Google Scholar
[18] Brunner N, Cavalcanti D, Pironio S, Scarani V, Wehner S (2014) Bell nonlocality. Rev. Modern Phys. 86(2):419–478.Crossref, Google Scholar
[19] Caines P, Huang M, Malhamé R (2017) Mean field games. Başar T, Zaccour G, eds. Handbook of Dynamic Game Theory (Springer, Cham), 345–372.Crossref, Google Scholar
[20] Campi L, Fischer M (2022) Correlated equilibria and mean field games: A simple model. Math. Oper. Res., ePub ahead of print, February 10, https://doi.org/10.1287/moor.2021.1206.Link, Google Scholar
[21] Cardaliaguet P (2011) Notes on mean field games. (from P.-L. Lions’ lectures at College de France). Lecture notes, April–May 2010, Tor Vergata, Rome.Google Scholar
[22] Cardaliaguet P, Rainer C (2020) An example of multiple mean field limits in ergodic differential games. Nonlinear Differential Equations Appl. 27:25.Crossref, Google Scholar
[23] Cardaliaguet P, Delarue F, Lasry J, Lions P (2019) The Master Equation and the Convergence Problem in Mean Field Games. Annals of Mathematics Studies, vol. 201 (Princeton University Press, Princeton, NJ).Google Scholar
[24] Carmona R, Delarue F (2018) Probabilistic Theory of Mean Field Games with Applications, 2 vols. (Springer, Cham, Switzerland).Google Scholar
[25] Carmona R, Delarue F, Lacker D (2016) Mean field games with common noise. Ann. Probab. 44(6):3740–3803.Crossref, Google Scholar
[26] Caves CM, Fuchs CA, Schack R (2002) Unknown quantum states: The quantum de Finetti representation. J. Math. Phys. 43(9):4537–4559.Crossref, Google Scholar
[27] Cecchin A (2021) Finite state N-agent and mean field control problems. ESAIM Control Optim. Calculus Variations 27:31.Crossref, Google Scholar
[28] Cecchin A, Pra OD, Fischer M, Pelino G (2019) On the convergence problem in mean field games: A two state model without uniqueness. SIAM J. Control Optim. 57(4):2443–2466.Crossref, Google Scholar
[29] Charalambous CD (2016) Decentralized optimality conditions of stochastic differential decision problems via Girsanov’s measure transformation. Math. Control Signals Systems 28(3):1–55.Crossref, Google Scholar
[30] Christandl M, Toner B (2009) Finite de Finetti theorem for conditional probability distributions describing physical theories. J. Math. Phys. 50(4):042104.Crossref, Google Scholar
[31] Davison E, Rau N, Palmay F (1973) The optimal decentralized control of a power system consisting of a number of interconnected synchronous machines. Internat. J. Control 18(6):1313–1328.Crossref, Google Scholar
[32] Delarue F, Tchuendom R (2020) Selection of equilibria in a linear quadratic mean-field game. Stochastic Processes Their Appl. 130(2):1000–1040.Crossref, Google Scholar
[33] Diaconis P, Freedman D (1980) Finite exchangeable sequences. Ann. Probab. 8(4):745–764.Crossref, Google Scholar
[34] Filippov A (1962) On certain questions in the theory of optimal control. J. Soc. Indust. Appl. Math., Ser. A. Control 1(1):76–84.Crossref, Google Scholar
[35] Fischer M (2017) On the connection between symmetric N-player games and mean field games. Ann. Appl. Probab. 27(2):757–810.Crossref, Google Scholar
[36] Girsanov IV (1960) On transforming a certain class of stochastic processes by absolutely continuous substitution of measures. Theory Probab. Appl. 5(3):285–301.Crossref, Google Scholar
[37] Gupta A, Yüksel S, Başar T, Langbort C (2015) On the existence of optimal policies for a class of static and sequential dynamic teams. SIAM J. Control Optim. 53(3):1681–1712.Crossref, Google Scholar
[38] Hajek B, Livesay M (2019) On non-unique solutions in mean field games. Proc. 58th IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 1219–1224.Google Scholar
[39] Hernández-Lerma O, Lasserre JB (1996) Discrete-Time Markov Control Processes: Basic Optimality Criteria (Springer, New York).Crossref, Google Scholar
[40] Hespanha J, Naghshtabrizi P, Xu Y (2007) A survey of recent results in networked control systems. Proc. IEEE 95(1):138–162.Crossref, Google Scholar
[41] Hewitt E, Savage LJ (1955) Symmetric measures on Cartesian products. Trans. Amer. Math. Soc. 80(2):470–501.Crossref, Google Scholar
[42] Ho Y (1980) Team decision theory and information structures. Proc. IEEE 68(6):644–654.Crossref, Google Scholar
[43] Ho YC, Chu KC (1972) Team decision theory and information structures in optimal control problems—Part I. IEEE Trans. Automatic Control 17(1):15–22.Crossref, Google Scholar
[44] Huang M, Nguyen SL (2016) Linear-quadratic mean field teams with a major agent. Proc. 55th IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 6958–6963.Google Scholar
[45] Huang M, Caines PE, Malhamé RP (2006) Large population stochastic dynamic games: Closed-loop Mckean–Vlasov systems and the Nash certainty equivalence principle. Comm. Inform. Systems 6(3):221–251.Crossref, Google Scholar
[46] Huang M, Caines PE, Malhamé RP (2007) Large-population cost-coupled LQG problems with nonuniform agents: Individual-mass behavior and decentralized ϵ-Nash equilibria. IEEE Trans. Automatic Control 52(9):1560–1571.Crossref, Google Scholar
[47] Huang M, Caines PE, Malhamé RP (2012) Social optima in mean field LQG control: Centralized and decentralized strategies. IEEE Trans. Automatic Control 57(7):1736–1751.Crossref, Google Scholar
[48] Jovanovic B, Rosenthal RW (1988) Anonymous sequential games. J. Math. Econom. 17(1):77–87.Crossref, Google Scholar
[49] Kallenberg O (1973) Canonical representations and convergence criteria for processes with interchangeable increments. Z. Wahrscheinlichkeitstheorie verw. Gebiete 27(1):23–36.Crossref, Google Scholar
[50] Kallenberg O (2006) Probabilistic Symmetries and Invariance Principles (Springer, New York).Google Scholar
[51] Kingman JFC (1978) Uses of exchangeability. Ann. Probab. 6(2):183–197.Crossref, Google Scholar
[52] Krainak JC, Speyer JL, Marcus SI (1982) Static team problems—Part I: Sufficient conditions and the exponential cost criterion. IEEE Trans. Automatic Control 27:839–848.Crossref, Google Scholar
[53] Lacker D (2015) Mean field games via controlled martingale problems: Existence of Markovian equilibria. Stochastic Processes Their Appl. 125(7):2856–2894.Crossref, Google Scholar
[54] Lacker D (2016) A general characterization of the mean field limit for stochastic differential games. Probab. Theory Related Fields 165(3–4):581–648.Crossref, Google Scholar
[55] Lacker D (2017) Limit theory for controlled Mckean–Vlasov dynamics. SIAM J. Control Optim. 55(3):1641–1672.Crossref, Google Scholar
[56] Lacker D (2020) On the convergence of closed-loop Nash equilibria to the mean field game limit. Ann. Appl. Probab. 30(4):1693–1761.Crossref, Google Scholar
[57] Lasry JM, Lions PL (2007) Mean field games. Japanese J. Math. 2:229–260.Crossref, Google Scholar
[58] Light B, Weintraub GY (2022) Mean field equilibrium: Uniqueness, existence, and comparative statics. Oper. Res. 70(1):585–605.Google Scholar
[59] Mahajan A, Martins NC, Yüksel S (2013) Static LQG teams with countably infinite players. Proc. 52nd IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 6765–6770.Google Scholar
[60] Mahajan A, Martins N, Rotkowitz M, Yüksel S (2012) Information structures in optimal decentralized control. Proc. 51st IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 1291–1306.Google Scholar
[61] Marschak J (1955) Elements for a theory of teams. Management Sci. 1(2):127–137.Link, Google Scholar
[62] Mas-Colell A (1984) On a theorem of Schmeidler. J. Math. Econom. 13(3):201–206.Crossref, Google Scholar
[63] McGuire CB (1961) Some team models of a sales organization. Management Sci. 7(2):101–130.Link, Google Scholar
[64] Popescu S (2014) Nonlocality beyond quantum mechanics. Nature Phys. 10(4):264–270.Crossref, Google Scholar
[65] Radner R (1962) Team decision problems. Ann. Math. Statist. 33(3):857–881.Crossref, Google Scholar
[66] Renner R (2007) Symmetry of large physical systems implies independence of subsystems. Nature Phys. 3(9):645–649.Crossref, Google Scholar
[67] Saldi N (2019) A topology for team policies and existence of optimal team policies in stochastic team theory. IEEE Trans. Automatic Control 65(1):310–317.Crossref, Google Scholar
[68] Sandell N, Varaiya P, Athans M, Safonov M (1978) Survey of decentralized control methods for large scale systems. IEEE Trans. Automatic Control 23(2):108–128.Crossref, Google Scholar
[69] Sanjari S, Yüksel S (2021a) Optimal policies for convex symmetric stochastic dynamic teams and their mean-field limit. SIAM J. Control Optim. 59(2):777–804.Crossref, Google Scholar
[70] Sanjari S, Yüksel S (2021b) Optimal solutions to infinite-player stochastic teams and mean-field teams. IEEE Trans. Automatic Control 66(3):1071–1086.Crossref, Google Scholar
[71] Sanjari S, Saldi N, Yüksel S (2020) Optimality of independently randomized symmetric policies for exchangeable stochastic teams with infinitely many decision makers. Preprint, submitted August 26, https://arxiv.org/abs/2008.11570.Google Scholar
[72] Schäl M (1975) Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal. Z. Wahrscheinlichkeitstheorie verw. Gebiete 32:179–296.Crossref, Google Scholar
[73] Schmeidler D (1973) Equilibrium points of nonatomic games. J. Statist. Phys. 7(4):295–300.Crossref, Google Scholar
[74] Serfozo R (1982) Convergence of Lebesgue integrals with varying measures. Sankhyā: Indian J. Statist. Ser. A. 44(3):380–402.Google Scholar
[75] Tsitsiklis JN (1988) Decentralized detection by a large number of sensors. Math. Control Signals Systems 1(2):167–182.Crossref, Google Scholar
[76] Wang BC, Zhang JF (2017) Social optima in mean field linear-quadratic-Gaussian models with Markov jump parameters. SIAM J. Control Optim. 55(1):429–456.Crossref, Google Scholar
[77] Witsenhausen H (1968) A counterexample in stochastic optimal control. SIAM J. Control Optim. 6:131–147.Crossref, Google Scholar
[78] Witsenhausen H (1988) Equivalent stochastic control problems. Math. Control Signals Systems 1(1):3–11.Crossref, Google Scholar
[79] Witsenhausen HS (1975) The intrinsic model for discrete stochastic control: Some open problems. Bensoussan A, Lions JL, eds. Control Theory, Numerical Methods and Computer Systems Modelling. Lecture Notes in Economics and Mathematical Systems, vol. 107 (Springer, Berlin), 322–335.Crossref, Google Scholar
[80] Young L (1937) Generalized curves and the existence of an attained absolute minimum in the calculus of variations. Comptes Rendus de la Societe des Sci. et des Lettres de Varsovie 30:212–234.Google Scholar
[81] Yu X, Zhang Y, Zhou Z (2021) Teamwise mean field competitions. Appl. Math. Optim. 84:903–942.Crossref, Google Scholar
[82] Yüksel S (2017) On stochastic stability of a class of non-Markovian processes and applications in quantization. SIAM J. Control Optim. 55(2):1241–1260.Crossref, Google Scholar
[83] Yüksel S (2020) A universal dynamic program and refined existence results for decentralized stochastic control. SIAM J. Control Optim. 58(5):2711–2739.Crossref, Google Scholar
[84] Yüksel S, Başar T (2013) Stochastic Networked Control Systems: Stabilization and Optimization under Information Constraints (Springer, New York).Crossref, Google Scholar
[85] Yüksel S, Saldi N (2017) Convex analysis in decentralized stochastic control, strategic measures and optimal solutions. SIAM J. Control Optim. 55(1):1–28.Crossref, Google Scholar

cover image Mathematics of Operations Research

Volume 48, Issue 3

August 2023

Pages 1213-1809, C2

Article Information

Metrics

Information

Received:October 23, 2020
Accepted:June 05, 2022
Published Online:August 24, 2022

Cite as

Sina Sanjari, Naci Saldi, Serdar Yüksel (2022) Optimality of Independently Randomized Symmetric Policies for Exchangeable Stochastic Teams with Infinitely Many Decision Makers. Mathematics of Operations Research 48(3):1254-1285.

https://doi.org/10.1287/moor.2022.1296

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Optimality of Independently Randomized Symmetric Policies for Exchangeable Stochastic Teams with Infinitely Many Decision Makers

References

Volume 48, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News