Jointly Modeling and Clustering Tensors in High Dimensions
Abstract
We consider the problem of jointly modeling and clustering populations of tensors by introducing a high-dimensional tensor mixture model with heterogeneous covariances. To effectively tackle the high dimensionality of tensor objects, we employ plausible dimension reduction assumptions that exploit the intrinsic structures of tensors, such as low rankness in the mean and separability in the covariance. In estimation, we develop an efficient high-dimensional expectation conditional maximization (HECM) algorithm that breaks the intractable optimization in the M step into a sequence of much simpler conditional optimization problems, each of which is convex, admits regularization, and has closed-form updating formulas. Our theoretical analysis is challenged by both the nonconvexity in the expectation maximization-type estimation and having access to only the solutions of conditional maximizations in the M step, leading to the notion of dual nonconvexity. We demonstrate that the proposed HECM algorithm, with an appropriate initialization, converges geometrically to a neighborhood that is within statistical precision of the true parameter. The efficacy of our proposed method is demonstrated through comparative numerical experiments and an application to a medical study, where our proposal achieves an improved clustering accuracy over existing benchmarking methods.
Funding: The research of J. Zhang was supported by the National Science Foundation [Grants DMS-2326893 and DMS-2329296]. The research of W. W. Sun was partially supported by the National Science Foundation [Grant SES-2217440].
Supplemental Material: All supplemental materials, including the code, data, and files required to reproduce the results, are available at https://doi.org/10.1287/opre.2021.0635.

