Corruption-Robust Exploration in Episodic Reinforcement Learning
- Thodoris Lykouris ,
Corresponding Author
Thodoris Lykouris
[email protected]https://orcid.org/0000-0002-3375-5579
Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139
- Max Simchowitz ,
Max Simchowitz
[email protected]https://orcid.org/0000-0001-9900-1238
Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139
- Aleksandrs Slivkins ,
Aleksandrs Slivkins
[email protected]https://orcid.org/0000-0001-6899-6383
Microsoft Research Lab, New York, New York 10012
- Wen Sun
Wen Sun
[email protected]https://orcid.org/0000-0003-4322-5878
Department of Computer Science, Cornell University, Ithaca, New York 14850
Corresponding Author
Thodoris Lykouris
[email protected]https://orcid.org/0000-0002-3375-5579
Sloan School of Management, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139
Max Simchowitz
[email protected]https://orcid.org/0000-0001-9900-1238
Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139
Aleksandrs Slivkins
[email protected]https://orcid.org/0000-0001-6899-6383
Microsoft Research Lab, New York, New York 10012
Wen Sun
[email protected]https://orcid.org/0000-0003-4322-5878
Department of Computer Science, Cornell University, Ithaca, New York 14850

