Learning-Based Online Optimization for Autonomous Mobility-on-Demand Fleet Control

Published Online:https://doi.org/10.1287/ijoc.2024.0637

References

  • Alonso-Mora J, Wallar A, Rus D (2017) Predictive routing for autonomous mobility-on-demand systems with ride-sharing. 2017 IEEE/RSJ Internat. Conf. Intelligent Robots Systems (IROS) (IEEE, New York), 3583–3590.Google Scholar
  • Bengio Y, Lodi A, Prouvost A (2021) Machine learning for combinatorial optimization: A methodological tour d’horizon. Eur. J. Oper. Res. 290(2):405–421.CrossrefGoogle Scholar
  • Berthet Q, Blondel M, Teboul O, Cuturi M, Vert JP, Bach F (2020) Learning with differentiable perturbed optimizers. Larochelle H, Ranzato M, Hadsell R, Balcan M, Lin H, eds. Advances in Neural Information Processing Systems, vol. 33 (Curran Associates, Inc., Red Hook, NY), 9508–9519.Google Scholar
  • Bertsimas D, Jaillet P, Martin S (2019) Online vehicle routing: The edge of optimization in large-scale applications. Oper. Res. 67(1):143–162.LinkGoogle Scholar
  • Bösch PM, Becker F, Becker H, Axhausen KW (2018) Cost-based analysis of autonomous mobility services. Transport Policy 64:76–91.CrossrefGoogle Scholar
  • Dalle G, Baty L, Bouvier L, Parmentier A (2022) Learning with combinatorial optimization layers: A probabilistic approach. Preprint, submitted July 27, https://arxiv.org/abs/2207.13513.Google Scholar
  • Donti P, Amos B, Kolter JZ (2017) Task-based end-to-end model learning in stochastic optimization. Guyon I, Von Luxburg U, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R, eds. Advances in Neural Information Processing Systems, vol. 30 (Curran Associates, Inc., Red Hook, NY), 5484–5494.Google Scholar
  • Elmachtoub AN, Grigas P (2022) Smart “predict, then optimize.” Management Sci. 68(1):9–26.LinkGoogle Scholar
  • Enders T, Harrison J, Pavone M, Schiffer M (2023) Hybrid multi-agent deep reinforcement learning for autonomous mobility on demand systems. Matni N, Morari M, Pappas GJ, eds. Proc. 5th Annual Learn. Dynam. Control Conf., Proceedings of Machine Learning Research, vol. 211 (PMLR, New York), 1284–1296.Google Scholar
  • Gammelli D, Yang K, Harrison J, Rodrigues F, Pereira FC, Pavone M (2021) Graph neural network reinforcement learning for autonomous mobility-on-demand systems. 2021 60th IEEE Conf. Decision Control (CDC) (IEEE, New York), 2996–3003.Google Scholar
  • Iglesias R, Rossi F, Wang K, Hallac D, Leskovec J, Pavone M (2018) Data-driven model predictive control of autonomous mobility-on-demand systems. 2018 IEEE Internat. Conf. Robotics Automation (ICRA) (IEEE, New York), 6019–6025.Google Scholar
  • Jiao Y, Tang X, Qin ZT, Li S, Zhang F, Zhu H, Ye J (2021) Real-world ride-hailing vehicle repositioning using deep reinforcement learning. Transportation Res. Part C Emerging Tech. 130:103289.CrossrefGoogle Scholar
  • Jungel K, Parmentier A, Schiffer M, Vidal T (2025) Learning-based online optimization for autonomous mobility-on-demand fleet control. https://doi.org/10.1287/ijoc.2024.0637.cd, https://github.com/INFORMSJoC/2024.0637.Google Scholar
  • Kotary J, Fioretto F, Van Hentenryck P, Wilder B (2021) End-to-end constrained optimization learning: A survey. Zhou ZH, ed., Proc. 30th Internat. Joint Conf. Artificial Intelligence IJCAI-21 (International Joint Conferences on Artificial Intelligence Organization), 4475–4482.Google Scholar
  • Levy JI, Buonocore JJ, von Stackelberg K (2010) Evaluation of the public health impacts of traffic congestion: A health risk assessment. Environ. Health 9(1):65.CrossrefGoogle Scholar
  • Li M, Qin Z, Jiao Y, Yang Y, Wang J, Wang C, Wu G, Ye J (2019) Efficient ridesharing order dispatching with mean field multi-agent reinforcement learning. World Wide Web Conf. (Association for Computing Machinery, New York), 983–994.Google Scholar
  • Liang E, Wen K, Lam WHK, Sumalee A, Zhong R (2022) An integrated reinforcement learning and centralized programming approach for online taxi dispatching. IEEE Trans. Neural Networks Learn. Systems 33(9):4742–4756.CrossrefGoogle Scholar
  • Liu Y, Samaranayake S (2022) Proactive rebalancing and speed-up techniques for on-demand high capacity ridesourcing services. IEEE Trans. Intelligent Transportation Systems 23(2):819–826.CrossrefGoogle Scholar
  • New York City Taxi & Limousine Commission (2015) TLC trip record data. https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page.Google Scholar
  • Nowozin S, Lampert CH (2011) Structured learning and prediction in computer vision. Foundations Trends Comput. Graphics Vision 6(3–4):185–365.CrossrefGoogle Scholar
  • Parmentier A (2021) Learning to approximate industrial problems by operations research classic problems. Oper. Res. 70(1):606–623.LinkGoogle Scholar
  • Parmentier A, T’Kindt V (2023) Structured learning based heuristics to solve the single machine scheduling problem with release times and sum of completion times. Eur. J. Oper. Res. 305(3):1032–1041.CrossrefGoogle Scholar
  • Pavone M (2015) Autonomous mobility-on-demand systems for future urban mobility. Maurer M, Gerdes J, Lenz B, Winner H, eds. Autonomes Fahren (Springer, Berlin), 399–416.CrossrefGoogle Scholar
  • Pavone M, Smith SL, Frazzoli E, Rus D (2012) Robotic load balancing for mobility-on-demand systems. Internat. J. Robotics Res. 31(7):839–854.CrossrefGoogle Scholar
  • Sadeghi Eshkevari S, Tang X, Qin Z, Mei J, Zhang C, Meng Q, Xu J (2022) Reinforcement learning in the wild: Scalable RL dispatching algorithm deployed in ridehailing marketplace. KDD ‘22 Proc. 28th ACM SIGKDD Conf. Knowledge Discovery Data Mining (Association for Computing Machinery, New York), 3838–3848.Google Scholar
  • Schiffer M, Hiermann G, Rüdel F, Walther G (2021) A polynomial-time algorithm for user-based relocation in free-floating car sharing systems. Transportation Res. Part B Methodological 143:65–85.CrossrefGoogle Scholar
  • Schrank D, Eisele B, Lomax T (2019) 2019 urban mobility report. Technical report, Texas A&M Transportation Institute, College Station.Google Scholar
  • Skordilis E, Hou Y, Tripp C, Moniot M, Graf P, Biagioni D (2022) A modular and transferable reinforcement learning framework for the fleet rebalancing problem. IEEE Trans. Intelligent Transportation Systems 23(8):11903–11916.CrossrefGoogle Scholar
  • Statista (2021) Market share of the leading ride-hailing companies in the United States from September 2017 to July 2021. https://www.statista.com/statistics/910704/market-share-of-rideshare-companies-united-states/.Google Scholar
  • Tang X, Qin ZT, Zhang F, Wang Z, Xu Z, Ma Y, Zhu H, Ye J (2019) A deep value-network based approach for multi-driver order dispatching. KDD ‘19 Proc. 25th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (Association for Computing Machinery, New York), 1780–1790.Google Scholar
  • Tsao M, Iglesias R, Pavone M (2018) Stochastic model predictive control for autonomous mobility on demand. 2018 21st Internat. Conf. Intelligent Transportation Systems (ITSC) (IEEE, New York), 3941–3948.Google Scholar
  • Vidal T, Laporte G, Matl P (2020) A concise guide to existing and emerging vehicle routing problem variants. Eur. J. Oper. Res. 286(2):401–416.CrossrefGoogle Scholar
  • White J (2020) Waymo opens driverless robo-taxi service to the public in Phoenix. Reuters (October 8), https://www.reuters.com/article/us-waymo-autonomous-phoenix-idUKKBN26T2Y3.Google Scholar
  • Xu Z, Li Z, Guan Q, Zhang D, Li Q, Nan J, Liu C, Bian W, Ye J (2018) Large-scale order dispatch in on-demand ride-hailing platforms: A learning and planning approach. KDD ‘18 Proc. 24th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (Association for Computing Machinery, New York), 905–913.Google Scholar
  • Zardini G, Lanzetti N, Pavone M, Frazzoli E (2022) Analysis and control of autonomous mobility-on-demand systems. Annual Rev. Control Robotics Autonomous Systems 5(1):633–658.CrossrefGoogle Scholar
  • Zhang R, Pavone M (2016) Control of robotic mobility-on-demand systems: A queueing-theoretical perspective. Internat. J. Robotics Res. 35(1–3):186–203.CrossrefGoogle Scholar
  • Zhou M, Jin J, Zhang W, Qin Z, Jiao Y, Wang C, Wu G, Yu Y, Ye J (2019) Multi-agent reinforcement learning for order-dispatching via order-vehicle distribution matching. CIKM ‘19 Proc. 28th ACM Internat. Conf. Inform. Knowledge Management (Association for Computing Machinery, New York), 2645–2653.Google Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.