Optimality of Independently Randomized Symmetric Policies for Exchangeable Stochastic Teams with Infinitely Many Decision Makers
Published Online:24 Aug 2022https://doi.org/10.1287/moor.2022.1296
References
- [1] (1985) Ecole d’Ete de Probabilites de Saint-Flour XIII, 1983, vol. 1117 (Springer, Berlin).Google Scholar
- [2] (2006) Infinite Dimensional Analysis: A Hitchhiker’s Guide, 3rd ed. (Springer, Berlin).Google Scholar
- [3] (2015) Team-optimal solution of finite number of mean-field coupled LQG subsystems. Proc. 54th IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 5308–5313.Google Scholar
- [4] (2017) On solutions of mean field games with ergodic cost. J. Math. Pures Appliquées 107(2):205–251.Crossref, Google Scholar
- [5] (1979) Allocation of resources in large teams. Econometrica 47(2):361–385.Crossref, Google Scholar
- [6] (2012) De Finetti theorems for easy quantum groups. Ann. Probab. 40(1):401–435.Crossref, Google Scholar
- [7] (2019) On non-uniqueness and uniqueness of solutions in finite-horizon mean field games. ESAIM Control Optim. Calculus Variations 25:44.Crossref, Google Scholar
- [8] (2014) Linear-quadratic N-person and mean-field games with ergodic cost. SIAM J. Control Optim. 52(5):3022–3052.Crossref, Google Scholar
- [9] (2020) On non-uniqueness in mean field games. Proc. Amer. Math. Soc. 148(9):4091–4106.Crossref, Google Scholar
- [10] (1958) Decision and team problems in airline reservations. Econometrica 26(1):134–145.Crossref, Google Scholar
- [11] (1971) Existence of optimal stochastic control laws. SIAM J. Control 9(3):446–472.Crossref, Google Scholar
- [12] (1978) Stochastic Optimal Control: The Discrete Time Case (Academic Press, New York).Google Scholar
- [13] (1964) Memoryless strategies in finite-stage dynamic programming. Ann. Math. Statist. 35:863–865.Crossref, Google Scholar
- [14] (2000) Average cost dynamic programming equations for controlled Markov chains with partial observations. SIAM J. Control Optim. 39(3):673–681.Crossref, Google Scholar
- [15] (2007) Dynamic programming for ergodic control of Markov chains under partial observations: A correction. SIAM J. Control Optim. 45(6):2299–2304.Crossref, Google Scholar
- [16] (1993) White-noise representations in stochastic realization theory. SIAM J. Control Optim. 31:1093–1102.Crossref, Google Scholar
- [17] (2017) Quantum de Finetti theorems under local measurements with applications. Comm. Math. Phys. 353(2):469–506.Crossref, Google Scholar
- [18] (2014) Bell nonlocality. Rev. Modern Phys. 86(2):419–478.Crossref, Google Scholar
- [19] (2017) Mean field games. Başar T, Zaccour G, eds. Handbook of Dynamic Game Theory (Springer, Cham), 345–372.Crossref, Google Scholar
- [20] (2022) Correlated equilibria and mean field games: A simple model. Math. Oper. Res., ePub ahead of print, February 10, https://doi.org/10.1287/moor.2021.1206.Link, Google Scholar
- [21] (2011) Notes on mean field games. (from P.-L. Lions’ lectures at College de France). Lecture notes, April–May 2010, Tor Vergata, Rome.Google Scholar
- [22] (2020) An example of multiple mean field limits in ergodic differential games. Nonlinear Differential Equations Appl. 27:25.Crossref, Google Scholar
- [23] (2019) The Master Equation and the Convergence Problem in Mean Field Games. Annals of Mathematics Studies, vol. 201 (Princeton University Press, Princeton, NJ).Google Scholar
- [24] (2018) Probabilistic Theory of Mean Field Games with Applications, 2 vols. (Springer, Cham, Switzerland).Google Scholar
- [25] (2016) Mean field games with common noise. Ann. Probab. 44(6):3740–3803.Crossref, Google Scholar
- [26] (2002) Unknown quantum states: The quantum de Finetti representation. J. Math. Phys. 43(9):4537–4559.Crossref, Google Scholar
- [27] (2021) Finite state N-agent and mean field control problems. ESAIM Control Optim. Calculus Variations 27:31.Crossref, Google Scholar
- [28] (2019) On the convergence problem in mean field games: A two state model without uniqueness. SIAM J. Control Optim. 57(4):2443–2466.Crossref, Google Scholar
- [29] (2016) Decentralized optimality conditions of stochastic differential decision problems via Girsanov’s measure transformation. Math. Control Signals Systems 28(3):1–55.Crossref, Google Scholar
- [30] (2009) Finite de Finetti theorem for conditional probability distributions describing physical theories. J. Math. Phys. 50(4):042104.Crossref, Google Scholar
- [31] (1973) The optimal decentralized control of a power system consisting of a number of interconnected synchronous machines. Internat. J. Control 18(6):1313–1328.Crossref, Google Scholar
- [32] (2020) Selection of equilibria in a linear quadratic mean-field game. Stochastic Processes Their Appl. 130(2):1000–1040.Crossref, Google Scholar
- [33] (1980) Finite exchangeable sequences. Ann. Probab. 8(4):745–764.Crossref, Google Scholar
- [34] (1962) On certain questions in the theory of optimal control. J. Soc. Indust. Appl. Math., Ser. A. Control 1(1):76–84.Crossref, Google Scholar
- [35] (2017) On the connection between symmetric N-player games and mean field games. Ann. Appl. Probab. 27(2):757–810.Crossref, Google Scholar
- [36] (1960) On transforming a certain class of stochastic processes by absolutely continuous substitution of measures. Theory Probab. Appl. 5(3):285–301.Crossref, Google Scholar
- [37] (2015) On the existence of optimal policies for a class of static and sequential dynamic teams. SIAM J. Control Optim. 53(3):1681–1712.Crossref, Google Scholar
- [38] (2019) On non-unique solutions in mean field games. Proc. 58th IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 1219–1224.Google Scholar
- [39] (1996) Discrete-Time Markov Control Processes: Basic Optimality Criteria (Springer, New York).Crossref, Google Scholar
- [40] (2007) A survey of recent results in networked control systems. Proc. IEEE 95(1):138–162.Crossref, Google Scholar
- [41] (1955) Symmetric measures on Cartesian products. Trans. Amer. Math. Soc. 80(2):470–501.Crossref, Google Scholar
- [42] (1980) Team decision theory and information structures. Proc. IEEE 68(6):644–654.Crossref, Google Scholar
- [43] (1972) Team decision theory and information structures in optimal control problems—Part I. IEEE Trans. Automatic Control 17(1):15–22.Crossref, Google Scholar
- [44] (2016) Linear-quadratic mean field teams with a major agent. Proc. 55th IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 6958–6963.Google Scholar
- [45] (2006) Large population stochastic dynamic games: Closed-loop Mckean–Vlasov systems and the Nash certainty equivalence principle. Comm. Inform. Systems 6(3):221–251.Crossref, Google Scholar
- [46] (2007) Large-population cost-coupled LQG problems with nonuniform agents: Individual-mass behavior and decentralized ϵ-Nash equilibria. IEEE Trans. Automatic Control 52(9):1560–1571.Crossref, Google Scholar
- [47] (2012) Social optima in mean field LQG control: Centralized and decentralized strategies. IEEE Trans. Automatic Control 57(7):1736–1751.Crossref, Google Scholar
- [48] (1988) Anonymous sequential games. J. Math. Econom. 17(1):77–87.Crossref, Google Scholar
- [49] (1973) Canonical representations and convergence criteria for processes with interchangeable increments. Z. Wahrscheinlichkeitstheorie verw. Gebiete 27(1):23–36.Crossref, Google Scholar
- [50] (2006) Probabilistic Symmetries and Invariance Principles (Springer, New York).Google Scholar
- [51] (1978) Uses of exchangeability. Ann. Probab. 6(2):183–197.Crossref, Google Scholar
- [52] (1982) Static team problems—Part I: Sufficient conditions and the exponential cost criterion. IEEE Trans. Automatic Control 27:839–848.Crossref, Google Scholar
- [53] (2015) Mean field games via controlled martingale problems: Existence of Markovian equilibria. Stochastic Processes Their Appl. 125(7):2856–2894.Crossref, Google Scholar
- [54] (2016) A general characterization of the mean field limit for stochastic differential games. Probab. Theory Related Fields 165(3–4):581–648.Crossref, Google Scholar
- [55] (2017) Limit theory for controlled Mckean–Vlasov dynamics. SIAM J. Control Optim. 55(3):1641–1672.Crossref, Google Scholar
- [56] (2020) On the convergence of closed-loop Nash equilibria to the mean field game limit. Ann. Appl. Probab. 30(4):1693–1761.Crossref, Google Scholar
- [57] (2007) Mean field games. Japanese J. Math. 2:229–260.Crossref, Google Scholar
- [58] (2022) Mean field equilibrium: Uniqueness, existence, and comparative statics. Oper. Res. 70(1):585–605.Google Scholar
- [59] (2013) Static LQG teams with countably infinite players. Proc. 52nd IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 6765–6770.Google Scholar
- [60] (2012) Information structures in optimal decentralized control. Proc. 51st IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 1291–1306.Google Scholar
- [61] (1955) Elements for a theory of teams. Management Sci. 1(2):127–137.Link, Google Scholar
- [62] (1984) On a theorem of Schmeidler. J. Math. Econom. 13(3):201–206.Crossref, Google Scholar
- [63] (1961) Some team models of a sales organization. Management Sci. 7(2):101–130.Link, Google Scholar
- [64] (2014) Nonlocality beyond quantum mechanics. Nature Phys. 10(4):264–270.Crossref, Google Scholar
- [65] (1962) Team decision problems. Ann. Math. Statist. 33(3):857–881.Crossref, Google Scholar
- [66] (2007) Symmetry of large physical systems implies independence of subsystems. Nature Phys. 3(9):645–649.Crossref, Google Scholar
- [67] (2019) A topology for team policies and existence of optimal team policies in stochastic team theory. IEEE Trans. Automatic Control 65(1):310–317.Crossref, Google Scholar
- [68] (1978) Survey of decentralized control methods for large scale systems. IEEE Trans. Automatic Control 23(2):108–128.Crossref, Google Scholar
- [69] (2021a) Optimal policies for convex symmetric stochastic dynamic teams and their mean-field limit. SIAM J. Control Optim. 59(2):777–804.Crossref, Google Scholar
- [70] (2021b) Optimal solutions to infinite-player stochastic teams and mean-field teams. IEEE Trans. Automatic Control 66(3):1071–1086.Crossref, Google Scholar
- [71] (2020) Optimality of independently randomized symmetric policies for exchangeable stochastic teams with infinitely many decision makers. Preprint, submitted August 26, https://arxiv.org/abs/2008.11570.Google Scholar
- [72] (1975) Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal. Z. Wahrscheinlichkeitstheorie verw. Gebiete 32:179–296.Crossref, Google Scholar
- [73] (1973) Equilibrium points of nonatomic games. J. Statist. Phys. 7(4):295–300.Crossref, Google Scholar
- [74] (1982) Convergence of Lebesgue integrals with varying measures. Sankhyā: Indian J. Statist. Ser. A. 44(3):380–402.Google Scholar
- [75] (1988) Decentralized detection by a large number of sensors. Math. Control Signals Systems 1(2):167–182.Crossref, Google Scholar
- [76] (2017) Social optima in mean field linear-quadratic-Gaussian models with Markov jump parameters. SIAM J. Control Optim. 55(1):429–456.Crossref, Google Scholar
- [77] (1968) A counterexample in stochastic optimal control. SIAM J. Control Optim. 6:131–147.Crossref, Google Scholar
- [78] (1988) Equivalent stochastic control problems. Math. Control Signals Systems 1(1):3–11.Crossref, Google Scholar
- [79] (1975) The intrinsic model for discrete stochastic control: Some open problems. Bensoussan A, Lions JL, eds. Control Theory, Numerical Methods and Computer Systems Modelling. Lecture Notes in Economics and Mathematical Systems, vol. 107 (Springer, Berlin), 322–335.Crossref, Google Scholar
- [80] (1937) Generalized curves and the existence of an attained absolute minimum in the calculus of variations. Comptes Rendus de la Societe des Sci. et des Lettres de Varsovie 30:212–234.Google Scholar
- [81] (2021) Teamwise mean field competitions. Appl. Math. Optim. 84:903–942.Crossref, Google Scholar
- [82] (2017) On stochastic stability of a class of non-Markovian processes and applications in quantization. SIAM J. Control Optim. 55(2):1241–1260.Crossref, Google Scholar
- [83] (2020) A universal dynamic program and refined existence results for decentralized stochastic control. SIAM J. Control Optim. 58(5):2711–2739.Crossref, Google Scholar
- [84] (2013) Stochastic Networked Control Systems: Stabilization and Optimization under Information Constraints (Springer, New York).Crossref, Google Scholar
- [85] (2017) Convex analysis in decentralized stochastic control, strategic measures and optimal solutions. SIAM J. Control Optim. 55(1):1–28.Crossref, Google Scholar

