Learning-Based Online Optimization for Autonomous Mobility-on-Demand Fleet Control

Kai Jungel
Kai Jungel
[email protected]
https://orcid.org/0000-0002-6625-9942
TUM School of Management, Technical University of Munich, 80333 Munich, Germany
Search for more papers by this author
,
Axel Parmentier
Axel Parmentier
[email protected]
https://orcid.org/0000-0003-1762-4947
CERMICS, École des Ponts, 77455 Marne-la-Vallée, France
Search for more papers by this author
,
Maximilian Schiffer
Corresponding Author
Maximilian Schiffer
[email protected]
https://orcid.org/0000-0003-2682-4975
TUM School of Management, Technical University of Munich, 80333 Munich, Germany; and Munich Data Science Institute, Technical University of Munich, 80333 Munich, Germany
Search for more papers by this author
,
Thibaut Vidal
Thibaut Vidal
[email protected]
https://orcid.org/0000-0001-5183-8485
CIRRELT & SCALE-AI Chair in Data-Driven Supply Chains, Department of Mathematics and Industrial Engineering, École Polytechnique de Montréal, Montréal, Quebec H3T 1J4, Canada
Search for more papers by this author

TUM School of Management, Technical University of Munich, 80333 Munich, Germany

Search for more papers by this author

Axel Parmentier

[email protected]

https://orcid.org/0000-0003-1762-4947

CERMICS, École des Ponts, 77455 Marne-la-Vallée, France

Search for more papers by this author

Maximilian Schiffer

Corresponding Author

Maximilian Schiffer

[email protected]

https://orcid.org/0000-0003-2682-4975

TUM School of Management, Technical University of Munich, 80333 Munich, Germany; and Munich Data Science Institute, Technical University of Munich, 80333 Munich, Germany

Search for more papers by this author

Thibaut Vidal

[email protected]

https://orcid.org/0000-0001-5183-8485

CIRRELT & SCALE-AI Chair in Data-Driven Supply Chains, Department of Mathematics and Industrial Engineering, École Polytechnique de Montréal, Montréal, Quebec H3T 1J4, Canada

Search for more papers by this author

Published Online:9 Jun 2025https://doi.org/10.1287/ijoc.2024.0637

References

Alonso-Mora J, Wallar A, Rus D (2017) Predictive routing for autonomous mobility-on-demand systems with ride-sharing. 2017 IEEE/RSJ Internat. Conf. Intelligent Robots Systems (IROS) (IEEE, New York), 3583–3590.Google Scholar
Bengio Y, Lodi A, Prouvost A (2021) Machine learning for combinatorial optimization: A methodological tour d’horizon. Eur. J. Oper. Res. 290(2):405–421.Crossref, Google Scholar
Berthet Q, Blondel M, Teboul O, Cuturi M, Vert JP, Bach F (2020) Learning with differentiable perturbed optimizers. Larochelle H, Ranzato M, Hadsell R, Balcan M, Lin H, eds. Advances in Neural Information Processing Systems, vol. 33 (Curran Associates, Inc., Red Hook, NY), 9508–9519.Google Scholar
Bertsimas D, Jaillet P, Martin S (2019) Online vehicle routing: The edge of optimization in large-scale applications. Oper. Res. 67(1):143–162.Link, Google Scholar
Bösch PM, Becker F, Becker H, Axhausen KW (2018) Cost-based analysis of autonomous mobility services. Transport Policy 64:76–91.Crossref, Google Scholar
Dalle G, Baty L, Bouvier L, Parmentier A (2022) Learning with combinatorial optimization layers: A probabilistic approach. Preprint, submitted July 27, https://arxiv.org/abs/2207.13513.Google Scholar
Donti P, Amos B, Kolter JZ (2017) Task-based end-to-end model learning in stochastic optimization. Guyon I, Von Luxburg U, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R, eds. Advances in Neural Information Processing Systems, vol. 30 (Curran Associates, Inc., Red Hook, NY), 5484–5494.Google Scholar
Elmachtoub AN, Grigas P (2022) Smart “predict, then optimize.” Management Sci. 68(1):9–26.Link, Google Scholar
Enders T, Harrison J, Pavone M, Schiffer M (2023) Hybrid multi-agent deep reinforcement learning for autonomous mobility on demand systems. Matni N, Morari M, Pappas GJ, eds. Proc. 5th Annual Learn. Dynam. Control Conf., Proceedings of Machine Learning Research, vol. 211 (PMLR, New York), 1284–1296.Google Scholar
Gammelli D, Yang K, Harrison J, Rodrigues F, Pereira FC, Pavone M (2021) Graph neural network reinforcement learning for autonomous mobility-on-demand systems. 2021 60th IEEE Conf. Decision Control (CDC) (IEEE, New York), 2996–3003.Google Scholar
Iglesias R, Rossi F, Wang K, Hallac D, Leskovec J, Pavone M (2018) Data-driven model predictive control of autonomous mobility-on-demand systems. 2018 IEEE Internat. Conf. Robotics Automation (ICRA) (IEEE, New York), 6019–6025.Google Scholar
Jiao Y, Tang X, Qin ZT, Li S, Zhang F, Zhu H, Ye J (2021) Real-world ride-hailing vehicle repositioning using deep reinforcement learning. Transportation Res. Part C Emerging Tech. 130:103289.Crossref, Google Scholar
Jungel K, Parmentier A, Schiffer M, Vidal T (2025) Learning-based online optimization for autonomous mobility-on-demand fleet control. https://doi.org/10.1287/ijoc.2024.0637.cd, https://github.com/INFORMSJoC/2024.0637.Google Scholar
Kotary J, Fioretto F, Van Hentenryck P, Wilder B (2021) End-to-end constrained optimization learning: A survey. Zhou ZH, ed., Proc. 30th Internat. Joint Conf. Artificial Intelligence IJCAI-21 (International Joint Conferences on Artificial Intelligence Organization), 4475–4482.Google Scholar
Levy JI, Buonocore JJ, von Stackelberg K (2010) Evaluation of the public health impacts of traffic congestion: A health risk assessment. Environ. Health 9(1):65.Crossref, Google Scholar
Li M, Qin Z, Jiao Y, Yang Y, Wang J, Wang C, Wu G, Ye J (2019) Efficient ridesharing order dispatching with mean field multi-agent reinforcement learning. World Wide Web Conf. (Association for Computing Machinery, New York), 983–994.Google Scholar
Liang E, Wen K, Lam WHK, Sumalee A, Zhong R (2022) An integrated reinforcement learning and centralized programming approach for online taxi dispatching. IEEE Trans. Neural Networks Learn. Systems 33(9):4742–4756.Crossref, Google Scholar
Liu Y, Samaranayake S (2022) Proactive rebalancing and speed-up techniques for on-demand high capacity ridesourcing services. IEEE Trans. Intelligent Transportation Systems 23(2):819–826.Crossref, Google Scholar
New York City Taxi & Limousine Commission (2015) TLC trip record data. https://www1.nyc.gov/site/tlc/about/tlc-trip-record-data.page.Google Scholar
Nowozin S, Lampert CH (2011) Structured learning and prediction in computer vision. Foundations Trends Comput. Graphics Vision 6(3–4):185–365.Crossref, Google Scholar
Parmentier A (2021) Learning to approximate industrial problems by operations research classic problems. Oper. Res. 70(1):606–623.Link, Google Scholar
Parmentier A, T’Kindt V (2023) Structured learning based heuristics to solve the single machine scheduling problem with release times and sum of completion times. Eur. J. Oper. Res. 305(3):1032–1041.Crossref, Google Scholar
Pavone M (2015) Autonomous mobility-on-demand systems for future urban mobility. Maurer M, Gerdes J, Lenz B, Winner H, eds. Autonomes Fahren (Springer, Berlin), 399–416.Crossref, Google Scholar
Pavone M, Smith SL, Frazzoli E, Rus D (2012) Robotic load balancing for mobility-on-demand systems. Internat. J. Robotics Res. 31(7):839–854.Crossref, Google Scholar
Sadeghi Eshkevari S, Tang X, Qin Z, Mei J, Zhang C, Meng Q, Xu J (2022) Reinforcement learning in the wild: Scalable RL dispatching algorithm deployed in ridehailing marketplace. KDD ‘22 Proc. 28th ACM SIGKDD Conf. Knowledge Discovery Data Mining (Association for Computing Machinery, New York), 3838–3848.Google Scholar
Schiffer M, Hiermann G, Rüdel F, Walther G (2021) A polynomial-time algorithm for user-based relocation in free-floating car sharing systems. Transportation Res. Part B Methodological 143:65–85.Crossref, Google Scholar
Schrank D, Eisele B, Lomax T (2019) 2019 urban mobility report. Technical report, Texas A&M Transportation Institute, College Station.Google Scholar
Skordilis E, Hou Y, Tripp C, Moniot M, Graf P, Biagioni D (2022) A modular and transferable reinforcement learning framework for the fleet rebalancing problem. IEEE Trans. Intelligent Transportation Systems 23(8):11903–11916.Crossref, Google Scholar
Statista (2021) Market share of the leading ride-hailing companies in the United States from September 2017 to July 2021. https://www.statista.com/statistics/910704/market-share-of-rideshare-companies-united-states/.Google Scholar
Tang X, Qin ZT, Zhang F, Wang Z, Xu Z, Ma Y, Zhu H, Ye J (2019) A deep value-network based approach for multi-driver order dispatching. KDD ‘19 Proc. 25th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (Association for Computing Machinery, New York), 1780–1790.Google Scholar
Tsao M, Iglesias R, Pavone M (2018) Stochastic model predictive control for autonomous mobility on demand. 2018 21st Internat. Conf. Intelligent Transportation Systems (ITSC) (IEEE, New York), 3941–3948.Google Scholar
Vidal T, Laporte G, Matl P (2020) A concise guide to existing and emerging vehicle routing problem variants. Eur. J. Oper. Res. 286(2):401–416.Crossref, Google Scholar
White J (2020) Waymo opens driverless robo-taxi service to the public in Phoenix. Reuters (October 8), https://www.reuters.com/article/us-waymo-autonomous-phoenix-idUKKBN26T2Y3.Google Scholar
Xu Z, Li Z, Guan Q, Zhang D, Li Q, Nan J, Liu C, Bian W, Ye J (2018) Large-scale order dispatch in on-demand ride-hailing platforms: A learning and planning approach. KDD ‘18 Proc. 24th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (Association for Computing Machinery, New York), 905–913.Google Scholar
Zardini G, Lanzetti N, Pavone M, Frazzoli E (2022) Analysis and control of autonomous mobility-on-demand systems. Annual Rev. Control Robotics Autonomous Systems 5(1):633–658.Crossref, Google Scholar
Zhang R, Pavone M (2016) Control of robotic mobility-on-demand systems: A queueing-theoretical perspective. Internat. J. Robotics Res. 35(1–3):186–203.Crossref, Google Scholar
Zhou M, Jin J, Zhang W, Qin Z, Jiao Y, Wang C, Wu G, Yu Y, Ye J (2019) Multi-agent reinforcement learning for order-dispatching via order-vehicle distribution matching. CIKM ‘19 Proc. 28th ACM Internat. Conf. Inform. Knowledge Management (Association for Computing Machinery, New York), 2645–2653.Google Scholar

cover image INFORMS Journal on Computing

Volume 38, Issue 3

May-June 2026

Pages 693-1031, iii

Article Information

Supplemental Material

Metrics

Information

Received:February 19, 2024
Accepted:May 01, 2025
Published Online:June 09, 2025

Cite as

Kai Jungel; , Axel Parmentier; , Maximilian Schiffer; , Thibaut Vidal (2025) Learning-Based Online Optimization for Autonomous Mobility-on-Demand Fleet Control. INFORMS Journal on Computing 38(3):745-765.

https://doi.org/10.1287/ijoc.2024.0637

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Learning-Based Online Optimization for Autonomous Mobility-on-Demand Fleet Control

References

Volume 38, Issue 3

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News