Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization
- Shicong Cen,
Corresponding Author
Shicong Cen
[email protected]Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213;
- Chen Cheng,
Chen Cheng
[email protected]Department of Statistics, Stanford University, Stanford, California 94305;
- Yuxin Chen ,
Yuxin Chen
[email protected]https://orcid.org/0000-0001-9256-5815
Department of Electrical and Computer Engineering, Princeton University, Princeton, New Jersey 08544;
- Yuting Wei ,
Yuting Wei
[email protected]https://orcid.org/0000-0002-3041-3434
Department of Statistics and Data Science, The Wharton School, University of Pennsylvania, Philadelphia, Pennsylvania 19104
- Yuejie Chi
Yuejie Chi
[email protected]https://orcid.org/0000-0002-6766-5459
Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213;
Corresponding Author
Shicong Cen
[email protected]Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213;
Chen Cheng
[email protected]Department of Statistics, Stanford University, Stanford, California 94305;
Yuxin Chen
[email protected]https://orcid.org/0000-0001-9256-5815
Department of Electrical and Computer Engineering, Princeton University, Princeton, New Jersey 08544;
Yuting Wei
[email protected]https://orcid.org/0000-0002-3041-3434
Department of Statistics and Data Science, The Wharton School, University of Pennsylvania, Philadelphia, Pennsylvania 19104
Yuejie Chi
[email protected]https://orcid.org/0000-0002-6766-5459
Department of Electrical and Computer Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213;

