Near-Optimal Performance of Stochastic Model Predictive Control

Published Online:https://doi.org/10.1287/moor.2023.0159

References

  • [1] Anderson BD, Moore JB (2007) Optimal Control: Linear Quadratic Methods (Courier Corporation, North Chelmsford, MA), 1–451.Google Scholar
  • [2] Angeli D, Amrit R, Rawlings JB (2011) On average performance and stability of economic model predictive control. IEEE Trans. Automatic Control 57(7):1615–1626.CrossrefGoogle Scholar
  • [3] Åström KJ (2012) Introduction to Stochastic Control Theory (Courier Corporation, North Chelmsford, MA), 320.Google Scholar
  • [4] Bernardini D, Bemporad A (2009) Scenario-based model predictive control of stochastic constrained linear systems. Proc. 48th IEEE Conf. Decision Control (CDC) Held Jointly 2009 28th Chinese Control Conf. (IEEE, New York), 6333–6338.Google Scholar
  • [5] Bernardini D, Bemporad A (2011) Stabilizing model predictive control of stochastic constrained linear systems. IEEE Trans. Automatic Control 57(6):1468–1480.CrossrefGoogle Scholar
  • [6] Bertsekas D (2012) Dynamic Programming and Optimal Control, vol. 1 (Athena Scientific, Nashua, NH), 1–159.Google Scholar
  • [7] Bertsekas D (2019) Reinforcement Learning and Optimal Control (Athena Scientific, Nashua, NH), 1–15.Google Scholar
  • [8] Birge JR (1985) Decomposition and partitioning methods for multistage stochastic linear programs. Oper. Res. 33(5):989–1007.LinkGoogle Scholar
  • [9] Birge JR, Louveaux F (2011) Introduction to Stochastic Programming, 2nd ed. (Springer Science & Business Media, Berlin), 1–512.CrossrefGoogle Scholar
  • [10] Chatterjee D, Lygeros J (2014) On stability and performance of stochastic predictive control techniques. IEEE Trans. Automatic Control 60(2):509–514.CrossrefGoogle Scholar
  • [11] Chen X, Qu G, Tang Y, Low S, Li N (2022) Reinforcement learning for selective key applications in power systems: Recent advances and future challenges. IEEE Trans. Smart Grid 13(4):2935–2958.CrossrefGoogle Scholar
  • [12] Dantzig GB, Infanger G (1993) Multi-stage stochastic linear programs for portfolio optimization. Ann. Oper. Res. 45(1):59–76.CrossrefGoogle Scholar
  • [13] Dean S, Mania H, Matni N, Recht B, Tu S (2020) On the sample complexity of the linear quadratic regulator. Foundations Comput. Math. 20(4):633–679.CrossrefGoogle Scholar
  • [14] De la Penad D, Bemporad A, Alamo T (2005) Stochastic programming applied to model predictive control. Proc. 44th IEEE Conf. Decision Control (IEEE, New York), 1361–1366.Google Scholar
  • [15] Demko S (1977) Inverses of band matrices and local convergence of spline projections. SIAM J. Numer. Anal. 14(4):616–619.CrossrefGoogle Scholar
  • [16] Faulwasser T, Grüne L, Müller MA (2018) Economic nonlinear model predictive control. Foundations Trends Systems Control 5(1):1–98.CrossrefGoogle Scholar
  • [17] Fazel M, Ge R, Kakade S, Mesbahi M (2018) Global convergence of policy gradient methods for the linear quadratic regulator. Dy J, Krause A, eds. Proc. 35th Internat. Conf. Machine Learn., 1467–1476.Google Scholar
  • [18] Frison G, Diehl M (2020) HPIPM: A high-performance quadratic programming framework for model predictive control. IFAC-PapersOnLine 53(2):6563–6569.CrossrefGoogle Scholar
  • [19] Goulart PJ, Kerrigan EC (2008) Input-to-state stability of robust receding horizon control with an expected value cost. Automatica 44(4):1171–1174.CrossrefGoogle Scholar
  • [20] Green M, Limebeer DJ (2012) Linear Robust Control (Courier Corporation, North Chelmsford, MA), 1–538.Google Scholar
  • [21] Grüne L, Pirkelmann S (2020) Economic model predictive control for time-varying system: Performance and stability results. Optim. Control Appl. Methods 41(1):42–64.CrossrefGoogle Scholar
  • [22] Grune L, Rantzer A (2008) On the infinite horizon performance of receding horizon controllers. IEEE Trans. Automatic Control 53(9):2100–2111.CrossrefGoogle Scholar
  • [23] Hans CA, Sopasakis P, Bemporad A, Raisch J, Reincke-Collon C (2015) Scenario-based model predictive operation control of islanded microgrids. 2015 54th IEEE Conf. Decision Control (CDC) (IEEE, New York), 3272–3277.Google Scholar
  • [24] Ho JK, Manne AS (1974) Nested decomposition for dynamic models. Math. Programming 6(1):121–140.CrossrefGoogle Scholar
  • [25] Hu B, Linnemann A (2002) Toward infinite-horizon optimality in nonlinear model predictive control. IEEE Trans. Automatic Control 47(4):679–682.CrossrefGoogle Scholar
  • [26] Jacobson D (1973) Optimal stochastic linear systems with exponential performance criteria and their relation to deterministic differential games. IEEE Trans. Automatic Control 18(2):124–131.CrossrefGoogle Scholar
  • [27] Keerthi SS, Gilbert EG (1988) Optimal infinite-horizon feedback laws for a general class of constrained discrete-time systems: Stability and moving-horizon approximations. J. Optim. Theory Appl. 57(2):265–293.CrossrefGoogle Scholar
  • [28] Kleywegt AJ, Shapiro A, Homem-de Mello T (2002) The sample average approximation method for stochastic discrete optimization. SIAM J. Optim. 12(2):479–502.CrossrefGoogle Scholar
  • [29] Kouvaritakis B, Cannon M, Raković SV, Cheng Q (2010) Explicit use of probabilistic distributions in linear predictive control. Automatica 46(10):1719–1724.CrossrefGoogle Scholar
  • [30] Kumar R, Wenzel MJ, ElBsat MN, Risbeck MJ, Drees KH, Zavala VM (2020) Stochastic model predictive control for central HVAC plants. J. Process Control 90:1–17.CrossrefGoogle Scholar
  • [31] Kumar R, Wenzel MJ, Ellis MJ, ElBsat MN, Drees KH, Zavala VM (2018) A stochastic model predictive control framework for stationary battery systems. IEEE Trans. Power Systems 33(4):4397–4406.CrossrefGoogle Scholar
  • [32] Kumar R, Wenzel MJ, Ellis MJ, ElBsat MN, Drees KH, Zavala VM (2019) Hierarchical MPC schemes for periodic systems using stochastic programming. Automatica 107:306–316.CrossrefGoogle Scholar
  • [33] Li Y, Chen X, Li N (2019) Online optimal control with linear dynamics and predictions: Algorithms and regret analysis. Wallach H, Larochelle H, Beygelzimer A, d’Alché-Buc F, Fox E, Garnett R, eds. Adv. Neural Inform. Processing Systems 32 (NeurIPS 2019) (Curran Associates Inc., Red Hook, NY), 14887–14899.Google Scholar
  • [34] Lin Y, Hu Y, Qu G, Li T, Wierman A (2022) Bounded-regret MPC via perturbation analysis: Prediction error, constraints, and nonlinearity. Koyejo S, Mohamed S, Agarwal A, Belgrave D, Cho K, Oh A, eds. Adv. Neural Inform. Processing Systems 35 (NeurIPS 2022) (Curran Associates Inc., Red Hook, NY), 36174–36187.Google Scholar
  • [35] Lin Y, Hu Y, Shi G, Sun H, Qu G, Wierman A (2021) Perturbation-based regret analysis of predictive control in linear time varying systems. Ranzato M, Beygelzimer A, Dauphin Y, Liang PS, Wortman Vaughan J, eds. Adv. Neural Inform. Processing Systems 34 (NeurIPS 2021) (Curran Associates Inc., Red Hook, NY), 5174–5185.Google Scholar
  • [36] Lorenzen M, Dabbene F, Tempo R, Allgöwer F (2017) Constraint-tightening and stability in stochastic model predictive control. IEEE Trans. Automatic Control 62(7):3165–3177.CrossrefGoogle Scholar
  • [37] Lucia S, Finkler T, Engell S (2013) Multi-stage nonlinear model predictive control applied to a semi-batch polymerization reactor under uncertainty. J. Process Control 23(9):1306–1319.CrossrefGoogle Scholar
  • [38] Lucia S, Subramanian S, Limon D, Engell S (2020) Stability properties of multi-stage nonlinear model predictive control. Systems Control Lett. 143:104743.CrossrefGoogle Scholar
  • [39] Luo Y, Gupta V, Kolar M (2022) Dynamic regret minimization for control of non-stationary linear dynamical systems. Proc. ACM Measurement Anal. Comput. Systems 6(1):9:1–9:72.Google Scholar
  • [40] Mania H, Guy A, Recht B (2018) Simple random search of static linear policies is competitive for reinforcement learning. Bengio S, Wallach H, Larochelle H, Grauman K, Cesa-Bianchi N, Garnett R, eds. Adv. Neural Inform. Processing Systems 31 (NeurIPS 2018) (Curran Associates Inc., Red Hook, NY), 1805–1814.Google Scholar
  • [41] Mania H, Tu S, Recht B (2019) Certainty equivalence is efficient for linear quadratic control. Wallach H, Larochelle H, Beygelzimer A, d’Alché-Buc F, Fox E, eds. Adv. Neural Inform. Processing Systems 32 (NeurIPS 2019) (Curran Associates Inc., Red Hook, NY), 10154–10164.Google Scholar
  • [42] Mayne DQ, Falugi P (2019) Stabilizing conditions for model predictive control. Internat. J. Robust Nonlinear Control 29(4):894–903.CrossrefGoogle Scholar
  • [43] McAllister RD, Rawlings JB (2023) Nonlinear stochastic model predictive control: Existence, measurability, and stochastic asymptotic stability. IEEE Trans. Automatic Control 68(3):1524–1536.CrossrefGoogle Scholar
  • [44] McAllister RD, Rawlings JB (2023) The stochastic robustness of nominal and stochastic model predictive control. IEEE Trans. Automatic Control 68(10):5810–5822.CrossrefGoogle Scholar
  • [45] McAllister RD, Rawlings JB (2024) On the inherent distributional robustness of stochastic and nominal model predictive control. IEEE Trans. Automatic Control 69(2):741–754.CrossrefGoogle Scholar
  • [46] Mesbah A (2016) Stochastic model predictive control: An overview and perspectives for future research. IEEE Control Systems Magazine 36(6):30–44.CrossrefGoogle Scholar
  • [47] Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, et al. (2015) Human-level control through deep reinforcement learning. Nature 518(7540):529–533.CrossrefGoogle Scholar
  • [48] Morari M, Lee JH (1999) Model predictive control: Past, present and future. Comput. Chemical Engrg. 23(4–5):667–682.CrossrefGoogle Scholar
  • [49] Mulvey JM, Vladimirou H (1991) Applying the progressive hedging algorithm to stochastic generalized networks. Ann. Oper. Res. 31(1):399–424.CrossrefGoogle Scholar
  • [50] Na S (2021) Global convergence of online optimization for nonlinear model predictive control. Ranzato M, Beygelzimer A, Dauphin Y, Liang PS, Wortman Vaughan J, eds. Adv. Neural Inform. Processing Systems 34 (NeurIPS 2021) (Curran Associates Inc., Red Hook, NY), 12441–12453.Google Scholar
  • [51] Na S, Anitescu M (2023) Superconvergence of online optimization for model predictive control. IEEE Trans. Automatic Control 68(3):1383–1398.CrossrefGoogle Scholar
  • [52] Nocedal J, Wright SJ (1999) Numerical Optimization (Springer, New York), 1–684.CrossrefGoogle Scholar
  • [53] Pedersen TH, Petersen S (2018) Investigating the performance of scenario-based model predictive control of space heating in residential buildings. J. Building Performance Simulation 11(4):485–498.CrossrefGoogle Scholar
  • [54] Pereira MV, Pinto LM (1991) Multi-stage stochastic optimization applied to energy planning. Math. Programming52(1):359–375.CrossrefGoogle Scholar
  • [55] Primbs JA, Sung CH (2009) Stochastic receding horizon control of constrained linear systems with state and control multiplicative noise. IEEE Trans. Automatic Control 54(2):221–230.CrossrefGoogle Scholar
  • [56] Rawlings JB, Mayne DQ, Diehl M (2017) Model Predictive Control: Theory, Computation, and Design, vol. 2 (Nob Hill Publishing, Madison, WI), 1–819.Google Scholar
  • [57] Rockafellar RT, Wets RJB (1991) Scenarios and policy aggregation in optimization under uncertainty. Math. Oper. Res. 16(1):119–147.LinkGoogle Scholar
  • [58] Schießl J, Ou R, Faulwasser T, Baumann MH, Grüne L (2024) Near-optimal performance of stochastic economic MPC. 2024 IEEE 63rd Conf. Decision Control (CDC) (IEEE, New York), 2565–2571.Google Scholar
  • [59] Schildbach G, Fagiano L, Frei C, Morari M (2014) The scenario approach for stochastic model predictive control with bounds on closed-loop constraint violations. Automatica 50(12):3009–3018.CrossrefGoogle Scholar
  • [60] Schulman J, Levine S, Abbeel P, Jordan M, Moritz P (2015) Trust region policy optimization. Bach F, Blei D, eds. Internat. Conf. Machine Learn., 1889–1897.Google Scholar
  • [61] Shapiro A (2006) On complexity of multistage stochastic programs. Oper. Res. Lett. 34(1):1–8.CrossrefGoogle Scholar
  • [62] Shapiro A (2011) Analysis of stochastic dual dynamic programming method. Eur. J. Oper. Res. 209(1):63–72.CrossrefGoogle Scholar
  • [63] Shapiro A, Ding L (2020) Periodical multistage stochastic programs. SIAM J. Optim. 30(3):2083–2102.CrossrefGoogle Scholar
  • [64] Shapiro A, Dentcheva D, Ruszczynski A (2021) Lectures on Stochastic Programming: Modeling and Theory (SIAM, Philadelphia), 1–512.CrossrefGoogle Scholar
  • [65] Shin S, Zavala VM (2021) Controllability and observability imply exponential decay of sensitivity in dynamic optimization. IFAC-PapersOnLine 54(6):179–184.CrossrefGoogle Scholar
  • [66] Shin S, Anitescu M, Zavala VM (2020) Overlapping Schwarz decomposition for constrained quadratic programs. 2020 59th IEEE Conf. Decision Control (CDC) (IEEE, New York), 3004–3009.Google Scholar
  • [67] Shin S, Anitescu M, Zavala VM (2022) Exponential decay of sensitivity in graph-structured nonlinear programs. SIAM J. Optim. 32(2):1156–1183.CrossrefGoogle Scholar
  • [68] Shin S, Lin Y, Qu G, Wierman A, Anitescu M (2023) Near-optimal distributed linear-quadratic regulator for networked systems. SIAM J. Control Optim. 61(3):1113–1135.CrossrefGoogle Scholar
  • [69] Simon HA (1956) Dynamic programming under uncertainty with a quadratic criterion function. Econometrica 24(1):74–81.CrossrefGoogle Scholar
  • [70] Thangavel S, Lucia S, Paulen R, Engell S (2018) Dual robust nonlinear model predictive control: A multi-stage approach. J. Process Control 72:39–51.CrossrefGoogle Scholar
  • [71] Theil H (1957) A note on certainty equivalence in dynamic planning. Econometrica 25(2):346–349.CrossrefGoogle Scholar
  • [72] Xu W, Anitescu M (2018) Exponentially accurate temporal decomposition for long-horizon linear-quadratic dynamic optimization. SIAM J. Optim. 28(3):2541–2573.CrossrefGoogle Scholar
  • [73] Yu ZJ, Biegler LT (2019) Advanced-step multistage nonlinear model predictive control: Robustness and stability. J. Process Control 84:192–206.CrossrefGoogle Scholar
  • [74] Yu C, Shi G, Chung SJ, Yue Y, Wierman A (2020) The power of predictions in online control. Larochelle H, Ranzato M, Hadsell R, Balcan MF, Lin H, eds. Adv. Neural Inform. Processing Systems 33 (NeurIPS 2020) (Curran Associates Inc., Red Hook, NY), 1994–2004.Google Scholar
  • [75] Zhang R, Li Y, Li N (2021) On the regret analysis of online LQR control with predictions. 2021 Amer. Control Conf. (ACC) (IEEE, New York), 697–703.Google Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.