Learning Markov Models Via Low-Rank Optimization

Ziwei Zhu
Ziwei Zhu
[email protected]
https://orcid.org/0000-0001-9536-0575
Department of Statistics, University of Michigan, Ann Arbor, Michigan 48109;
Search for more papers by this author
,
Xudong Li
Xudong Li
[email protected]
School of Data Science, Fudan University, Shanghai 200433, China;Shanghai Center for Mathematical Sciences, Fudan University, Shanghai 200433, China;
Search for more papers by this author
,
Mengdi Wang
Mengdi Wang
[email protected]
https://orcid.org/0000-0002-2101-9507
Department of Electrical Engineering, Princeton University, Princeton, New Jersey 08544;Center for Statistics and Machine Learning, Princeton University, Princeton, New Jersey 08544;
Search for more papers by this author
,
Anru Zhang
Anru Zhang
[email protected]
https://orcid.org/0000-0002-8721-5252
Department of Statistics, University of Wisconsin-Madison, Madison, Wisconsin 53706;Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina 27710
Search for more papers by this author

Department of Statistics, University of Michigan, Ann Arbor, Michigan 48109;

School of Data Science, Fudan University, Shanghai 200433, China;Shanghai Center for Mathematical Sciences, Fudan University, Shanghai 200433, China;

Search for more papers by this author

Mengdi Wang

[email protected]

https://orcid.org/0000-0002-2101-9507

Department of Electrical Engineering, Princeton University, Princeton, New Jersey 08544;Center for Statistics and Machine Learning, Princeton University, Princeton, New Jersey 08544;

Search for more papers by this author

Anru Zhang

[email protected]

https://orcid.org/0000-0002-8721-5252

Department of Statistics, University of Wisconsin-Madison, Madison, Wisconsin 53706;Department of Biostatistics and Bioinformatics, Duke University, Durham, North Carolina 27710

Search for more papers by this author

Published Online:23 Nov 2021https://doi.org/10.1287/opre.2021.2115

Abstract

Modeling unknown systems from data is a precursor of system optimization and sequential decision making. In this paper, we focus on learning a Markov model from a single trajectory of states. Suppose that the transition model has a small rank despite having a large state space, meaning that the system admits a low-dimensional latent structure. We show that one can estimate the full transition model accurately using a trajectory of length that is proportional to the total number of states. We propose two maximum-likelihood estimation methods: a convex approach with nuclear norm regularization and a nonconvex approach with rank constraint. We explicitly derive the statistical rates of both estimators in terms of the Kullback-Leiber divergence and the $ℓ_{2}$ error and also establish a minimax lower bound to assess the tightness of these rates. For computing the nonconvex estimator, we develop a novel DC (difference of convex function) programming algorithm that starts with the convex M-estimator and then successively refines the solution till convergence. Empirical experiments demonstrate consistent superiority of the nonconvex estimator over the convex one.

Volume 70, Issue 4

July-August 2022

Pages iii-vii, 1953-2596, C2-C3

Article Information

Supplemental Material

Metrics

Information

Received:June 18, 2019
Accepted:November 25, 2020
Published Online:November 23, 2021

Cite as

Ziwei Zhu, Xudong Li, Mengdi Wang, Anru Zhang (2021) Learning Markov Models Via Low-Rank Optimization. Operations Research 70(4):2384-2398.

https://doi.org/10.1287/opre.2021.2115

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Learning Markov Models Via Low-Rank Optimization

Abstract

Volume 70, Issue 4

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News