Optimality of Independently Randomized Symmetric Policies for Exchangeable Stochastic Teams with Infinitely Many Decision Makers

Published Online:https://doi.org/10.1287/moor.2022.1296

References

  • [1] Aldous DJ, Ibragimov IA, Jacod J (1985) Ecole d’Ete de Probabilites de Saint-Flour XIII, 1983, vol. 1117 (Springer, Berlin).Google Scholar
  • [2] Aliprantis CD, Border KC (2006) Infinite Dimensional Analysis: A Hitchhiker’s Guide, 3rd ed. (Springer, Berlin).Google Scholar
  • [3] Arabneydi J, Mahajan A (2015) Team-optimal solution of finite number of mean-field coupled LQG subsystems. Proc. 54th IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 5308–5313.Google Scholar
  • [4] Arapostathis A, Biswas A, Carroll J (2017) On solutions of mean field games with ergodic cost. J. Math. Pures Appliquées 107(2):205–251.CrossrefGoogle Scholar
  • [5] Arrow KJ, Radner R (1979) Allocation of resources in large teams. Econometrica 47(2):361–385.CrossrefGoogle Scholar
  • [6] Banica T, Curran S, Speicher R (2012) De Finetti theorems for easy quantum groups. Ann. Probab. 40(1):401–435.CrossrefGoogle Scholar
  • [7] Bardi M, Fischer M (2019) On non-uniqueness and uniqueness of solutions in finite-horizon mean field games. ESAIM Control Optim. Calculus Variations 25:44.CrossrefGoogle Scholar
  • [8] Bardi M, Priuli FS (2014) Linear-quadratic N-person and mean-field games with ergodic cost. SIAM J. Control Optim. 52(5):3022–3052.CrossrefGoogle Scholar
  • [9] Bayraktar E, Zhang X (2020) On non-uniqueness in mean field games. Proc. Amer. Math. Soc. 148(9):4091–4106.CrossrefGoogle Scholar
  • [10] Beckmann MJ (1958) Decision and team problems in airline reservations. Econometrica 26(1):134–145.CrossrefGoogle Scholar
  • [11] Beneš VE (1971) Existence of optimal stochastic control laws. SIAM J. Control 9(3):446–472.CrossrefGoogle Scholar
  • [12] Bertsekas DP, Shreve S (1978) Stochastic Optimal Control: The Discrete Time Case (Academic Press, New York).Google Scholar
  • [13] Blackwell D (1964) Memoryless strategies in finite-stage dynamic programming. Ann. Math. Statist. 35:863–865.CrossrefGoogle Scholar
  • [14] Borkar V (2000) Average cost dynamic programming equations for controlled Markov chains with partial observations. SIAM J. Control Optim. 39(3):673–681.CrossrefGoogle Scholar
  • [15] Borkar V (2007) Dynamic programming for ergodic control of Markov chains under partial observations: A correction. SIAM J. Control Optim. 45(6):2299–2304.CrossrefGoogle Scholar
  • [16] Borkar VS (1993) White-noise representations in stochastic realization theory. SIAM J. Control Optim. 31:1093–1102.CrossrefGoogle Scholar
  • [17] Brandao FGSL, Harrow AW (2017) Quantum de Finetti theorems under local measurements with applications. Comm. Math. Phys. 353(2):469–506.CrossrefGoogle Scholar
  • [18] Brunner N, Cavalcanti D, Pironio S, Scarani V, Wehner S (2014) Bell nonlocality. Rev. Modern Phys. 86(2):419–478.CrossrefGoogle Scholar
  • [19] Caines P, Huang M, Malhamé R (2017) Mean field games. Başar T, Zaccour G, eds. Handbook of Dynamic Game Theory (Springer, Cham), 345–372.CrossrefGoogle Scholar
  • [20] Campi L, Fischer M (2022) Correlated equilibria and mean field games: A simple model. Math. Oper. Res., ePub ahead of print, February 10, https://doi.org/10.1287/moor.2021.1206.LinkGoogle Scholar
  • [21] Cardaliaguet P (2011) Notes on mean field games. (from P.-L. Lions’ lectures at College de France). Lecture notes, April–May 2010, Tor Vergata, Rome.Google Scholar
  • [22] Cardaliaguet P, Rainer C (2020) An example of multiple mean field limits in ergodic differential games. Nonlinear Differential Equations Appl. 27:25.CrossrefGoogle Scholar
  • [23] Cardaliaguet P, Delarue F, Lasry J, Lions P (2019) The Master Equation and the Convergence Problem in Mean Field Games. Annals of Mathematics Studies, vol. 201 (Princeton University Press, Princeton, NJ).Google Scholar
  • [24] Carmona R, Delarue F (2018) Probabilistic Theory of Mean Field Games with Applications, 2 vols. (Springer, Cham, Switzerland).Google Scholar
  • [25] Carmona R, Delarue F, Lacker D (2016) Mean field games with common noise. Ann. Probab. 44(6):3740–3803.CrossrefGoogle Scholar
  • [26] Caves CM, Fuchs CA, Schack R (2002) Unknown quantum states: The quantum de Finetti representation. J. Math. Phys. 43(9):4537–4559.CrossrefGoogle Scholar
  • [27] Cecchin A (2021) Finite state N-agent and mean field control problems. ESAIM Control Optim. Calculus Variations 27:31.CrossrefGoogle Scholar
  • [28] Cecchin A, Pra OD, Fischer M, Pelino G (2019) On the convergence problem in mean field games: A two state model without uniqueness. SIAM J. Control Optim. 57(4):2443–2466.CrossrefGoogle Scholar
  • [29] Charalambous CD (2016) Decentralized optimality conditions of stochastic differential decision problems via Girsanov’s measure transformation. Math. Control Signals Systems 28(3):1–55.CrossrefGoogle Scholar
  • [30] Christandl M, Toner B (2009) Finite de Finetti theorem for conditional probability distributions describing physical theories. J. Math. Phys. 50(4):042104.CrossrefGoogle Scholar
  • [31] Davison E, Rau N, Palmay F (1973) The optimal decentralized control of a power system consisting of a number of interconnected synchronous machines. Internat. J. Control 18(6):1313–1328.CrossrefGoogle Scholar
  • [32] Delarue F, Tchuendom R (2020) Selection of equilibria in a linear quadratic mean-field game. Stochastic Processes Their Appl. 130(2):1000–1040.CrossrefGoogle Scholar
  • [33] Diaconis P, Freedman D (1980) Finite exchangeable sequences. Ann. Probab. 8(4):745–764.CrossrefGoogle Scholar
  • [34] Filippov A (1962) On certain questions in the theory of optimal control. J. Soc. Indust. Appl. Math., Ser. A. Control 1(1):76–84.CrossrefGoogle Scholar
  • [35] Fischer M (2017) On the connection between symmetric N-player games and mean field games. Ann. Appl. Probab. 27(2):757–810.CrossrefGoogle Scholar
  • [36] Girsanov IV (1960) On transforming a certain class of stochastic processes by absolutely continuous substitution of measures. Theory Probab. Appl. 5(3):285–301.CrossrefGoogle Scholar
  • [37] Gupta A, Yüksel S, Başar T, Langbort C (2015) On the existence of optimal policies for a class of static and sequential dynamic teams. SIAM J. Control Optim. 53(3):1681–1712.CrossrefGoogle Scholar
  • [38] Hajek B, Livesay M (2019) On non-unique solutions in mean field games. Proc. 58th IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 1219–1224.Google Scholar
  • [39] Hernández-Lerma O, Lasserre JB (1996) Discrete-Time Markov Control Processes: Basic Optimality Criteria (Springer, New York).CrossrefGoogle Scholar
  • [40] Hespanha J, Naghshtabrizi P, Xu Y (2007) A survey of recent results in networked control systems. Proc. IEEE 95(1):138–162.CrossrefGoogle Scholar
  • [41] Hewitt E, Savage LJ (1955) Symmetric measures on Cartesian products. Trans. Amer. Math. Soc. 80(2):470–501.CrossrefGoogle Scholar
  • [42] Ho Y (1980) Team decision theory and information structures. Proc. IEEE 68(6):644–654.CrossrefGoogle Scholar
  • [43] Ho YC, Chu KC (1972) Team decision theory and information structures in optimal control problems—Part I. IEEE Trans. Automatic Control 17(1):15–22.CrossrefGoogle Scholar
  • [44] Huang M, Nguyen SL (2016) Linear-quadratic mean field teams with a major agent. Proc. 55th IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 6958–6963.Google Scholar
  • [45] Huang M, Caines PE, Malhamé RP (2006) Large population stochastic dynamic games: Closed-loop Mckean–Vlasov systems and the Nash certainty equivalence principle. Comm. Inform. Systems 6(3):221–251.CrossrefGoogle Scholar
  • [46] Huang M, Caines PE, Malhamé RP (2007) Large-population cost-coupled LQG problems with nonuniform agents: Individual-mass behavior and decentralized ϵ-Nash equilibria. IEEE Trans. Automatic Control 52(9):1560–1571.CrossrefGoogle Scholar
  • [47] Huang M, Caines PE, Malhamé RP (2012) Social optima in mean field LQG control: Centralized and decentralized strategies. IEEE Trans. Automatic Control 57(7):1736–1751.CrossrefGoogle Scholar
  • [48] Jovanovic B, Rosenthal RW (1988) Anonymous sequential games. J. Math. Econom. 17(1):77–87.CrossrefGoogle Scholar
  • [49] Kallenberg O (1973) Canonical representations and convergence criteria for processes with interchangeable increments. Z. Wahrscheinlichkeitstheorie verw. Gebiete 27(1):23–36.CrossrefGoogle Scholar
  • [50] Kallenberg O (2006) Probabilistic Symmetries and Invariance Principles (Springer, New York).Google Scholar
  • [51] Kingman JFC (1978) Uses of exchangeability. Ann. Probab. 6(2):183–197.CrossrefGoogle Scholar
  • [52] Krainak JC, Speyer JL, Marcus SI (1982) Static team problems—Part I: Sufficient conditions and the exponential cost criterion. IEEE Trans. Automatic Control 27:839–848.CrossrefGoogle Scholar
  • [53] Lacker D (2015) Mean field games via controlled martingale problems: Existence of Markovian equilibria. Stochastic Processes Their Appl. 125(7):2856–2894.CrossrefGoogle Scholar
  • [54] Lacker D (2016) A general characterization of the mean field limit for stochastic differential games. Probab. Theory Related Fields 165(3–4):581–648.CrossrefGoogle Scholar
  • [55] Lacker D (2017) Limit theory for controlled Mckean–Vlasov dynamics. SIAM J. Control Optim. 55(3):1641–1672.CrossrefGoogle Scholar
  • [56] Lacker D (2020) On the convergence of closed-loop Nash equilibria to the mean field game limit. Ann. Appl. Probab. 30(4):1693–1761.CrossrefGoogle Scholar
  • [57] Lasry JM, Lions PL (2007) Mean field games. Japanese J. Math. 2:229–260.CrossrefGoogle Scholar
  • [58] Light B, Weintraub GY (2022) Mean field equilibrium: Uniqueness, existence, and comparative statics. Oper. Res. 70(1):585–605.Google Scholar
  • [59] Mahajan A, Martins NC, Yüksel S (2013) Static LQG teams with countably infinite players. Proc. 52nd IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 6765–6770.Google Scholar
  • [60] Mahajan A, Martins N, Rotkowitz M, Yüksel S (2012) Information structures in optimal decentralized control. Proc. 51st IEEE Conf. Decision Control (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 1291–1306.Google Scholar
  • [61] Marschak J (1955) Elements for a theory of teams. Management Sci. 1(2):127–137.LinkGoogle Scholar
  • [62] Mas-Colell A (1984) On a theorem of Schmeidler. J. Math. Econom. 13(3):201–206.CrossrefGoogle Scholar
  • [63] McGuire CB (1961) Some team models of a sales organization. Management Sci. 7(2):101–130.LinkGoogle Scholar
  • [64] Popescu S (2014) Nonlocality beyond quantum mechanics. Nature Phys. 10(4):264–270.CrossrefGoogle Scholar
  • [65] Radner R (1962) Team decision problems. Ann. Math. Statist. 33(3):857–881.CrossrefGoogle Scholar
  • [66] Renner R (2007) Symmetry of large physical systems implies independence of subsystems. Nature Phys. 3(9):645–649.CrossrefGoogle Scholar
  • [67] Saldi N (2019) A topology for team policies and existence of optimal team policies in stochastic team theory. IEEE Trans. Automatic Control 65(1):310–317.CrossrefGoogle Scholar
  • [68] Sandell N, Varaiya P, Athans M, Safonov M (1978) Survey of decentralized control methods for large scale systems. IEEE Trans. Automatic Control 23(2):108–128.CrossrefGoogle Scholar
  • [69] Sanjari S, Yüksel S (2021a) Optimal policies for convex symmetric stochastic dynamic teams and their mean-field limit. SIAM J. Control Optim. 59(2):777–804.CrossrefGoogle Scholar
  • [70] Sanjari S, Yüksel S (2021b) Optimal solutions to infinite-player stochastic teams and mean-field teams. IEEE Trans. Automatic Control 66(3):1071–1086.CrossrefGoogle Scholar
  • [71] Sanjari S, Saldi N, Yüksel S (2020) Optimality of independently randomized symmetric policies for exchangeable stochastic teams with infinitely many decision makers. Preprint, submitted August 26, https://arxiv.org/abs/2008.11570.Google Scholar
  • [72] Schäl M (1975) Conditions for optimality in dynamic programming and for the limit of n-stage optimal policies to be optimal. Z. Wahrscheinlichkeitstheorie verw. Gebiete 32:179–296.CrossrefGoogle Scholar
  • [73] Schmeidler D (1973) Equilibrium points of nonatomic games. J. Statist. Phys. 7(4):295–300.CrossrefGoogle Scholar
  • [74] Serfozo R (1982) Convergence of Lebesgue integrals with varying measures. Sankhyā: Indian J. Statist. Ser. A. 44(3):380–402.Google Scholar
  • [75] Tsitsiklis JN (1988) Decentralized detection by a large number of sensors. Math. Control Signals Systems 1(2):167–182.CrossrefGoogle Scholar
  • [76] Wang BC, Zhang JF (2017) Social optima in mean field linear-quadratic-Gaussian models with Markov jump parameters. SIAM J. Control Optim. 55(1):429–456.CrossrefGoogle Scholar
  • [77] Witsenhausen H (1968) A counterexample in stochastic optimal control. SIAM J. Control Optim. 6:131–147.CrossrefGoogle Scholar
  • [78] Witsenhausen H (1988) Equivalent stochastic control problems. Math. Control Signals Systems 1(1):3–11.CrossrefGoogle Scholar
  • [79] Witsenhausen HS (1975) The intrinsic model for discrete stochastic control: Some open problems. Bensoussan A, Lions JL, eds. Control Theory, Numerical Methods and Computer Systems Modelling. Lecture Notes in Economics and Mathematical Systems, vol. 107 (Springer, Berlin), 322–335.CrossrefGoogle Scholar
  • [80] Young L (1937) Generalized curves and the existence of an attained absolute minimum in the calculus of variations. Comptes Rendus de la Societe des Sci. et des Lettres de Varsovie 30:212–234.Google Scholar
  • [81] Yu X, Zhang Y, Zhou Z (2021) Teamwise mean field competitions. Appl. Math. Optim. 84:903–942.CrossrefGoogle Scholar
  • [82] Yüksel S (2017) On stochastic stability of a class of non-Markovian processes and applications in quantization. SIAM J. Control Optim. 55(2):1241–1260.CrossrefGoogle Scholar
  • [83] Yüksel S (2020) A universal dynamic program and refined existence results for decentralized stochastic control. SIAM J. Control Optim. 58(5):2711–2739.CrossrefGoogle Scholar
  • [84] Yüksel S, Başar T (2013) Stochastic Networked Control Systems: Stabilization and Optimization under Information Constraints (Springer, New York).CrossrefGoogle Scholar
  • [85] Yüksel S, Saldi N (2017) Convex analysis in decentralized stochastic control, strategic measures and optimal solutions. SIAM J. Control Optim. 55(1):1–28.CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.