The authors thank department editor Omar Besbes, the associate editor, and two referees for constructive comments and suggestions that have helped to significantly improve both the content and exposition of this paper. The authors also thank the MIT-IBM partnership in Artificial Intelligence and the MIT Data Science Lab for support. A preliminary version of this paper appeared in the 37th International Conference on Machine Learning (ICML 2020), and the current paper is a significantly enhanced version of it.