Stochastic Recursive Inclusions in Two Timescales with Nonadditive Iterate-Dependent Markov Noise
Published Online:21 Jul 2020https://doi.org/10.1287/moor.2019.1037
References
- [1] (2005) Stability of stochastic approximation under verifiable conditions. SIAM J. Control Optim. 44(1):283–312.Crossref, Google Scholar
- [2] (2012) Differential Inclusions: Set-Valued Maps and Viability Theory, vol. 264 (Springer Science & Business Media, New York).Google Scholar
- [3] (1996) A dynamical system approach to stochastic approximations. SIAM J. Control Optim. 34(2):437–472.Crossref, Google Scholar
- [4] (2005) Stochastic approximations and differential inclusions. SIAM J. Control Optim. 44(1):328–348.Crossref, Google Scholar
- [5] (2005) Numerical solution of saddle point problems. Acta Numerica 14:1–137.Crossref, Google Scholar
- [6] (2009) Convex Optimization Theory (Athena Scientific, Belmont, MA).Google Scholar
- [7] (2015) Simultaneous perturbation newton algorithms for simulation optimization. J. Optim. Theory Appl. 164(2):621–643.Crossref, Google Scholar
- [8] (1989) Optimal Control of Diffusion Processes. Pitman Lecture Notes in Mathematics, vol. 203 (Longman Scientific and Technical, Harlow, UK).Google Scholar
- [9] (1997) Stochastic approximation with two time scales. Systems Control Lett. 29(5):291–294.Crossref, Google Scholar
- [10] (2006) Stochastic approximation with controlled Markov noise. Systems Control Lett. 55(2):139–145.Crossref, Google Scholar
- [11] (2008) Stochastic Approximation: A Dynamical Systems Viewpoint (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
- [12] (2012) Probability Theory: An Advanced Course (Springer Science & Business Media, New York).Google Scholar
- [13] (2000) The ODE method for convergence of stochastic approximation and reinforcement learning. SIAM J. Control Optim. 38(2):447–469.Crossref, Google Scholar
- [14] (1987) Convergence and robustness of the Robbins-Monro algorithm truncated at randomly varying bounds. Stochastic Processes Their Appl. 27:217–231.Crossref, Google Scholar
- [15] (1986) Stochastic approximation procedures with randomly varying truncations. Sci. China Ser. A: Math., Phys., Astron., Tech. Sci. 29(9):914–926.Google Scholar
- [16] (1974) Two models analyzing the dynamics of adaptation algorithms. Avtomatika i Telemekhanika (1):67–75.Google Scholar
- [17] (2013) Random Iterative Models, vol. 34 (Springer Science & Business Media, New York).Google Scholar
- [18] (2016) Convergence of markovian stochastic approximation with discontinuous dynamics. SIAM J. Control Optim. 54(2):866–893.Crossref, Google Scholar
- [19] (2017) Extragradient method with variance reduction for stochastic variational inequalities. SIAM J. Optim. 27(2):686–724.Crossref, Google Scholar
- [20] (2008) Stochastic approximation approaches to the stochastic variational inequality problem. IEEE Trans. Automatic Control 53(6):1462–1475.Crossref, Google Scholar
- [21] (2017) Two time-scale stochastic approximation with controlled markov noise and off-policy temporal-difference learning. Math. Oper. Res. 43(1):130–151.Link, Google Scholar
- [22] (2000) The inflation of attractors and their discretization: The autonomous case. Nonlinear Anal.: Theory Methods Appl. 40(1–8):333–344.Google Scholar
- [23] (2013) Regularized iterative stochastic approximation methods for stochastic variational inequality problems. IEEE Trans. Automatic Control 58(3):594–609.Crossref, Google Scholar
- [24] (2005) Topology of Metric Spaces (Alpha Science International, Oxford, UK).Google Scholar
- [25] (2003) Stochastic approximation and recursive algorithms and applications (Springer, New York).Google Scholar
- [26] (2002) On dynamical properties of general dynamical systems and differential inclusions. J. Math. Anal. Appl. 274(2):705–724.Crossref, Google Scholar
- [27] (2013) Limit Theorems and Applications of Set-Valued and Fuzzy Set-Valued Random Variables, vol. 43 (Springer Science & Business Media, New York).Google Scholar
- [28] (1984) Applications of a Kushner and Clark lemma to general classes of stochastic algorithms. IEEE Trans. Inform. Theory 30(2):140–151.Crossref, Google Scholar
- [29] (2012) Markov Chains and Stochastic Stability (Springer Science & Business Media, New York).Google Scholar
- [30] (2002) Envelope theorems for arbitrary choice sets. Econometrica 70(2):583–601.Google Scholar
- [31] (2012) Projected Dynamical Systems and Variational Inequalities with Applications, vol. 2 (Kluwer Academic Publishers, Norwell, MA).Google Scholar
- [32] (2009) Subgradient methods for saddle-point problems. J. Optim. Theory Appl. 142(1):205–228.Crossref, Google Scholar
- [33] (1967) Probability Measures on Metric Spaces, vol. 352 (American Mathematical Society, Providence, RI).Crossref, Google Scholar
- [34] (2012) Asynchronous stochastic approximation with differential inclusions. Stochastic Systems 2(2):409–446.Link, Google Scholar
- [35] (2016) A generalization of the borkar-meyn theorem for stochastic recursive inclusions. Math. Oper. Res. 42(3):648–661.Link, Google Scholar
- [36] (2016) Stochastic recursive inclusion in two timescales with an application to the Lagrangian dual problem. Stochastics 88(8):1173–1187.Google Scholar
- [37] (1951) A stochastic approximation method. Ann. Math. Statist. 22(3):400–407.Crossref, Google Scholar
- [38] (1976) Principles of mathematical analysis. International Series in Pure and Applied Mathematics (McGraw-Hill, New York).Google Scholar
- [39] (1992) Multivariate stochastic approximation using a simultaneous perturbation gradient approximation. IEEE Trans. Automatic Control 37(3):332–341.Crossref, Google Scholar
- [40] (2009) Fast gradient-descent methods for temporal-difference learning with linear function approximation. Proc. 26th Annual Internat. Conf. Machine Learning (ACM, New York), 993–1000.Google Scholar
- [41] (2018) Stochastic recursive inclusions with non-additive iterate-dependent markov noise. Stochastics 90(3):330–363.Crossref, Google Scholar
- [42] (2020) Analysis of stochastic approximation schemes with set-valued maps in the absence of a stability guarantee and their stabilization. IEEE Trans. Automatic Control 65(3):1100–1115.Google Scholar

