Multiagent Environments for Vehicle Routing Problems

Ricardo Gama
Corresponding Author
Ricardo Gama
[email protected]
https://orcid.org/0000-0002-7051-8310
Escola Superior de Tecnologia e Gestão de Lamego, Instituto Politécnico de Viseu, 5100-074 Lamego, Portugal
Search for more papers by this author
,
Ricardo Cunha
Ricardo Cunha
[email protected]
Escola Superior de Tecnologia e Gestão de Lamego, Instituto Politécnico de Viseu, 5100-074 Lamego, Portugal
Search for more papers by this author
,
Daniel Fuertes
Daniel Fuertes
[email protected]
https://orcid.org/0000-0002-5746-2199
Grupo de Tratamiento de Imágenes (GTI), Information Processing and Telecommunications Center, ETSI, Telecomunicación, Universidad Politécnica de Madrid, 28040 Madrid, Spain
Search for more papers by this author
,
Carlos R. del-Blanco
Carlos R. del-Blanco
[email protected]
https://orcid.org/0000-0003-0618-3488
Grupo de Tratamiento de Imágenes (GTI), Information Processing and Telecommunications Center, ETSI, Telecomunicación, Universidad Politécnica de Madrid, 28040 Madrid, Spain
Search for more papers by this author
,
Hugo L. Fernandes
Hugo L. Fernandes
[email protected]
Singuli, Brooklyn, New York 11217
Search for more papers by this author

Ricardo Gama

Corresponding Author

Ricardo Gama

[email protected]

https://orcid.org/0000-0002-7051-8310

Escola Superior de Tecnologia e Gestão de Lamego, Instituto Politécnico de Viseu, 5100-074 Lamego, Portugal

Search for more papers by this author

Ricardo Cunha

[email protected]

Escola Superior de Tecnologia e Gestão de Lamego, Instituto Politécnico de Viseu, 5100-074 Lamego, Portugal

Search for more papers by this author

Daniel Fuertes

[email protected]

https://orcid.org/0000-0002-5746-2199

Grupo de Tratamiento de Imágenes (GTI), Information Processing and Telecommunications Center, ETSI, Telecomunicación, Universidad Politécnica de Madrid, 28040 Madrid, Spain

Search for more papers by this author

Carlos R. del-Blanco

[email protected]

https://orcid.org/0000-0003-0618-3488

Grupo de Tratamiento de Imágenes (GTI), Information Processing and Telecommunications Center, ETSI, Telecomunicación, Universidad Politécnica de Madrid, 28040 Madrid, Spain

Search for more papers by this author

Hugo L. Fernandes

[email protected]

Singuli, Brooklyn, New York 11217

Search for more papers by this author

Published Online:23 Jun 2026https://doi.org/10.1287/ijoc.2025.1211

Abstract

Research on reinforcement learning (RL) approaches for discrete optimization problems has increased considerably, extending RL to areas classically dominated by operations research (OR). Vehicle routing problems are a good example of discrete optimization problems with high practical relevance for which RL techniques have achieved notable success. Despite these advances, open-source development frameworks remain scarce, hindering both algorithm testing and objective comparison of results. This situation ultimately slows down progress in the field and limits the exchange of ideas between the RL and OR communities. Here, we propose MAEnvs4VRP library, a unified framework for multiagent vehicle routing environments that supports classical, dynamic, stochastic, and multitask problem variants within a single modular design. The library, built on PyTorch, provides a flexible and modular architecture design that facilitates customization and the incorporation of new routing problems. It follows the agent environment cycle (“AEC”) games model and features an intuitive API, enabling rapid adoption and seamless integration into existing reinforcement learning frameworks.

History: Accepted by Ted Ralphs, Area Editor for Software Tools.

Funding: R. Gama gratefully acknowledges the Research Centre in Digital Services (CISeD), the Instituto Politécnico de Viseu, and the Foundation for Science and Technology, I.P. (FCT) for their support during the work [Grants UIDB/05583/2020 and 2023.13303.CPCA.A0].

Supplemental Material: The software that supports the findings of this study is available within the paper and its Supplemental Information (https://pubsonline.informs.org/doi/suppl/10.1287/ijoc.2025.1211) as well as from the IJOC GitHub software repository (https://github.com/INFORMSJoC/2025.1211). The complete IJOC Software and Data Repository is available at https://informsjoc.github.io/.

cover image INFORMS Journal on Computing

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Received:March 08, 2025
Accepted:May 17, 2026
Published Online:June 23, 2026

Cite as

Ricardo Gama, Ricardo Cunha, Daniel Fuertes, Carlos R. del-Blanco, Hugo L. Fernandes (2026) Multiagent Environments for Vehicle Routing Problems. INFORMS Journal on Computing 0(0).

https://doi.org/10.1287/ijoc.2025.1211

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Multiagent Environments for Vehicle Routing Problems

Abstract

Articles In Advance

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News