We analyze the efficiency of parallelization and restart mechanisms for stochastic simulations in model-free settings, where the underlying system dynamics are unknown. Such settings are common in Reinforcement Learning (RL) and rare-event estimation, where standard variance-reduction techniques like importance sampling are inapplicable. Focusing on the challenge of reaching rare states under a finite computational budget, we model exploration via random walks and Lévy processes. Based on rigorous probability analysis, our work reveals a phase transition in the success probability as a function of the number of parallel simulations: an optimal number $N^{*}$ exists, balancing exploration diversity and time allocation per simulation. Beyond this threshold, performance degrades exponentially. Furthermore, we demonstrate that a restart strategy, which reallocates resources from stagnant trajectories to promising regions, can yield an exponential improvement in success probability. In the context of RL, these strategies can improve policy gradient methods by enabling more efficient state-space exploration, leading to more accurate policy gradient estimates.

Funding: This research was supported by the SticAmsud LAGOON project, ANR EPLER, IRL-CNRS IFUMI-2030, and Action international CNRS.

Articles In Advance

Article Information

Metrics

Information

Received:March 26, 2025
Accepted:May 12, 2026
Published Online:June 15, 2026

Cite as

Ernesto Garcia, Paola Bermolen, Matthieu Jonckheere, Seva Shneer (2026) Efficiency of Parallel and Restart Exploration Strategies in Model-Free Stochastic Simulations. Stochastic Systems 0(0).

https://doi.org/10.1287/stsy.2025.0108

Keywords

Acknowledgments

The authors are grateful to the anonymous referees for their thorough and insightful reviews; their suggestions and observations led to substantial improvements in this work. P.B. and E.G. also express their deep gratitude to professors E. Mordecki and J. R. León for very valuable discussions.

PDF download

Available Issues

Available Issues

Available Issues

Efficiency of Parallel and Restart Exploration Strategies in Model-Free Stochastic Simulations

Abstract

Articles In Advance

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News