Trimmed Statistical Estimation via Variance Reduction

Aleksandr Aravkin
Corresponding Author
Aleksandr Aravkin
Department of Applied Mathematics, University of Washington, Seattle, Washington 98195
Search for more papers by this author
,
Damek Davis
Damek Davis
http://orcid.org/0000-0003-2105-4641
School of Operations Research and Information Engineering, Cornell University, Ithaca, New York 14850
Search for more papers by this author

Aleksandr Aravkin

Corresponding Author

Aleksandr Aravkin

Department of Applied Mathematics, University of Washington, Seattle, Washington 98195

Search for more papers by this author

Damek Davis

http://orcid.org/0000-0003-2105-4641

School of Operations Research and Information Engineering, Cornell University, Ithaca, New York 14850

Search for more papers by this author

Published Online:5 Jul 2019https://doi.org/10.1287/moor.2019.0992

Abstract

In this paper, we show how to transform any optimization problem that arises from fitting a machine learning model into one that (1) detects and removes contaminated data from the training set while (2) simultaneously fitting the trimmed model on the uncontaminated data that remains. To solve the resulting nonconvex optimization problem, we introduce a fast stochastic proximal-gradient algorithm that incorporates prior knowledge through nonsmooth regularization. For data sets of size n, our approach requires O(n^2/3/ℇ) gradient evaluations to reach ℇ-accuracy, and when a certain error bound holds, the complexity improves to O(κn^2/3 log(1/ℇ)), where κ is a “condition number.” These rates are n^1/3 times better than those achieved by typical, nonstochastic methods.

cover image Mathematics of Operations Research

Volume 45, Issue 1

February 2020

Pages 1-401, C2

Article Information

Metrics

Information

Received:February 20, 2018
Accepted:December 23, 2018
Published Online:July 05, 2019

Cite as

Aleksandr Aravkin, Damek Davis (2019) Trimmed Statistical Estimation via Variance Reduction. Mathematics of Operations Research 45(1):292-322.

https://doi.org/10.1287/moor.2019.0992

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Trimmed Statistical Estimation via Variance Reduction

Abstract

Volume 45, Issue 1

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News