Transfer Learning, Cross Learning and Co-Learning with Operational Data Analytics (ODA)
Abstract
Making decisions with limited data and incomplete statistical characterization is challenging. The typical statistical-machine-learning approaches would call for migrating the experience of a related system with ample data through transfer learning or leveraging the similarity of multiple systems with limited data through data pooling. We, instead, develop new solution concepts to learn across related systems by adapting the parametric Operational Data Analytics (ODA) framework, which is known to produce uniformly optimal data-integrated decisions in the corresponding parametric settings, for nonparametric decision-making. We demonstrate, through the application of newsvendor systems, that transfer learning can, indeed, improve decision performance in the focal system by utilizing a model pretrained with ample data in a related system. However, through the lens of the ODA framework, the best transfer-learning decision falls in a subclass of operational statistics, limiting the ultimate optimality. In contrast, the ODA cross-learning approach utilizes the ample data from the related system to mimic the stochastic environment of the focal system. When the data from the old system are sufficiently large, the cross-learning solutions derived outperform any transfer-learning solution, and they are shown to asymptotically approach the parametric ODA solutions. When there are multiple related systems with limited data, we aggregate the data from different systems to create a generic stochastic environment for the decision-making problem, which facilitates the implementation of the parametric ODA solutions. We show that the derived co-learning solutions are asymptotically optimal for the aggregate system and for each subsystem. This approach outperforms the existing data-pooling techniques in the sense that the latter focuses only on the aggregated performance, and the chosen solution may be (asymptotically) suboptimal for individual subsystems. Our results underscore the roles of domain knowledge and the structural relationships between the data and the decision in designing efficient learning solutions with limited data. Though we demonstrate our development through the application of newsvendor systems, the solutions developed in this study apply to a much wider class of operational decision-making problems that exhibit certain homogeneous properties.
This paper was accepted by David Simchi-Levi, operations management.
Funding: L. Li’s research is partly supported by the Research Grants Council of Hong Kong [Grant 15515324] and The Hong Kong Polytechnic University under Grant 1-BEAT.
Supplemental Material: The online appendix is available at https://doi.org/10.1287/mnsc.2023.03688.

