Nonstationary and Sparsely-Correlated Multioutput Gaussian Process with Spike-and-Slab Prior

Published Online:https://doi.org/10.1287/ijds.2023.0022

References

  • AlBahar A, Kim I, Wang X, Yue X (2022) Physics-constrained Bayesian optimization for optimal actuators placement in composite structures assembly. IEEE Trans. Automation Sci. Engrg. 20(4):2772–2783.Google Scholar
  • Alvarez MA, Lawrence ND (2011) Computationally efficient convolved multiple output gaussian processes. J. Machine Learn. Res. 12:1459–1500.Google Scholar
  • Bai Y, Safikhani A, Michailidis G (2022) Hybrid modeling of regional Covid-19 transmission dynamics in the U.S. IEEE J. Selected Topics Signal Processing 16(2):261–275.Google Scholar
  • Boyle P, Frean M (2004) Dependent Gaussian processes. Saul L, Weiss Y, Bottou B, eds. Advances in Neural Information Processing Systems, vol. 17 (MIR Press, Cambridge, MA), 217–224.Google Scholar
  • Cheng S, Lu F, Peng P (2021) Short-term traffic forecasting by mining the non-stationarity of spatiotemporal patterns. IEEE Trans. Intelligent Transportation Systems 22(10):6365–6383.Google Scholar
  • Choong MK, Charbit M, Yan H (2009) Autoregressive-model-based missing value estimation for DNA microarray time series data. IEEE Trans. Inform. Tech. Biomedicine 13(1):131–137.Google Scholar
  • Christakos G (2012) Random Field Models in Earth Sciences (Courier Corporation, North Chelmsford, MA).Google Scholar
  • Damianou A, Lawrence ND (2013) Deep Gaussian processes. Carvalho CM, Ravikumar P, eds. Proc. 16th Internat. Conf. Artificial Intelligence Statist., Proc. Machine Learn. Res., vol. 31 (JMLR.org, Scottsdale, AZ), 207–215.Google Scholar
  • Dance H, Paige B (2022) Fast and scalable spike and slab variable selection in high-dimensional Gaussian processes. Camps-Valls G, Ruiz FJR, Valera I, eds. Proc. 25th Internat. Conf. Artificial Intelligence Statist., Proc. Machine Learn. Res., vol. 151 (JMLR.org), 7976–8002.Google Scholar
  • Fairchild KD, Lake DE, Kattwinkel J, Moorman JR, Bateman DA, Grieve PG, Isler JR, Sahni R (2017) Vital signs and their cross-correlation in sepsis and NEC: A study of 1,065 very-low-birth-weight infants in two NICUs. Pediatric Res. 81(2):315–321.Google Scholar
  • Fothergill S, Mentis H, Kohli P, Nowozin S (2012) Instructing people for training gestural interactive systems. Konstan JA, Chi H, Hook K, eds. Proc. SIGCHI Conf. Human Factors Comput. Systems (Association for Computing Machinery, New York), 1737–1746.Google Scholar
  • Frazier PI (2018) Bayesian optimization. Gel E, Ntaimo L, eds. Recent Advances in Optimization and Modeling of Contemporary Problems. Tutorials in Operations Research (INFORMS, Catonsville, MD), 255–278.LinkGoogle Scholar
  • Fricker TE, Oakley JE, Urban NM (2013) Multivariate Gaussian process emulators with nonseparable covariance structures. Technometrics 55(1):47–56.Google Scholar
  • Garg S, Singh A, Ramos F (2012) Learning non-stationary space-time models for environmental monitoring. Proc. 26th AAAI Conf. Artificial Intelligence (AAAI Press, Palo Alto, CA), 288–294.Google Scholar
  • Gelfand AE, Schmidt AM, Banerjee S, Sirmans C (2004) Nonstationary multivariate process modeling through spatially varying coregionalization. Test 13(2):263–312.Google Scholar
  • George EI, McCulloch RE (1993) Variable selection via Gibbs sampling. J. Amer. Statist. Assoc. 88(423):881–889.Google Scholar
  • Goulard M, Voltz M (1992) Linear coregionalization model: Tools for estimation and choice of cross-variogram matrix. Math. Geology 24(3):269–286.Google Scholar
  • Gramacy RB (2020) Surrogates: Gaussian Process Modeling, Design, and Optimization for the Applied Sciences (CRC Press, Boca Raton, FL).Google Scholar
  • Gramacy RB, Lee HKH (2008) Bayesian treed Gaussian process models with an application to computer modeling. J. Amer. Statist. Assoc. 103(483):1119–1130.Google Scholar
  • Heaton MJ, Christensen WF, Terres MA (2017) Nonstationary Gaussian process models using spatial hierarchical clustering from finite differences. Technometrics 59(1):93–101.Google Scholar
  • Heinonen M, Mannerström H, Rousu J, Kaski S, Lähdesmäki H (2016) Non-stationary Gaussian process regression with Hamiltonian Monte Carlo. Gretton A, Robert CC, eds. Proc. 19th Internat. Conf. Artificial Intelligence Statist., Proc. Machine Learn. Res., vol. 51 (JMLR.org, Cadiz, Spain), 732–740.Google Scholar
  • Hersbach H (2000) Decomposition of the continuous ranked probability score for ensemble prediction systems. Weather Forecasting 15(5):559–570.Google Scholar
  • Hu Z, Wang C (2021) Nonlinear online multioutput Gaussian process for multistream data informatics. IEEE Trans. Indust. Informatics 18(6):3885–3893.Google Scholar
  • Huber F, Koop G, Onorante L (2021) Inducing sparsity and shrinkage in time-varying parameter models. J. Bus. Econom. Statist. 39(3):669–683.Google Scholar
  • Ishwaran H, Rao JS (2005) Spike and slab variable selection: Frequentist and Bayesian strategies. Ann. Statist. 33(2):730–773.Google Scholar
  • Kalli M, Griffin JE (2014) Time-varying sparsity in dynamic regression models. J. Econometrics 178(2):779–793.Google Scholar
  • Kingma DP, Ba JL (2015) Adam: A method for stochastic optimization. Bengio Y, LeCun Y, eds. Proc. 3rd Internat. Conf. Learn. Representation (ICLR, San Diego).Google Scholar
  • Ko J, Kim H (2022) Deep Gaussian process models for integrating multifidelity experiments with nonstationary relationships. IISE Trans. 54(7):686–698.Google Scholar
  • Kontar R, Zhou S, Sankavaram C, Du X, Zhang Y (2018) Nonparametric modeling and prognosis of condition monitoring signals using multivariate gaussian convolution processes. Technometrics 60(4):484–496.Google Scholar
  • Kuss M, Rasmussen C (2003) Gaussian processes in reinforcement learning. Thrun S, Saul L, Schölkopf B, eds. Advances in Neural Information Processing Systems (NIPS 2003), vol. 16 (Whistler, British Columbia, Canada), 751–758.Google Scholar
  • Lee C, Wang K, Wu J, Cai W, Yue X (2023) Partitioned active learning for heterogeneous systems. J. Comput. Inform. Sci. Engrg. 23(4):041009-1–041009-11.Google Scholar
  • Liu Y, Gong C, Yang L, Chen Y (2020) DSTP-RNN: A dual-stage two-phase attention-based recurrent neural network for long-term and multivariate time series prediction. Expert Systems Appl. 143:113082-1–113082-12.Google Scholar
  • Liu Y, Wu H, Wang J, Long M (2022) Non-stationary transformers: Exploring the stationarity in time series forecasting. Koyejo S, Mohamed S, Agarwal A, Belgrave D, Cho K, Oh A, eds. Advances in Neural Information Processing Systems, vol. 35 (Curran Associates Inc., Red Hook, NY), 9881–9893.Google Scholar
  • Matthews AGDG, van der Wilk M, Nickson T, Fujii K, Boukouvalas A, León-Villagrá P, Ghahramani Z, Hensman J (2017) GPflow: A Gaussian process library using TensorFlow. J. Machine Learn. Res. 18(40):1–6.Google Scholar
  • Meng R, Soper B, Lee HK, Liu VX, Greene JD, Ray P (2021) Nonstationary multivariate Gaussian processes for electronic health records. J. Biomedical Informatics 117:103698-1–103698-11.Google Scholar
  • Moore AW (1990) Efficient memory-based learning for robot control. Technical Report Number 209, University of Cambridge, Cambridge, UK.Google Scholar
  • Paciorek C, Schervish M (2003) Nonstationary covariance functions for Gaussian process regression. Thrun S, Saul L, Schölkopf B, eds. Advances in Neural Information Processing Systems, vol. 16 (MIT Press, Cambridge, MA), 273–280.Google Scholar
  • Padakandla S, KJ P, Bhatnagar S (2020) Reinforcement learning algorithm for non-stationary environments. Appl. Intelligence 50(11):3590–3606.Google Scholar
  • Pan SJ, Yang Q (2009) A survey on transfer learning. IEEE Trans. Knowledge Data Engrg. 22(10):1345–1359.Google Scholar
  • Park C (2022) Jump Gaussian process model for estimating piecewise continuous regression functions. J. Machine Learn. Res. 23(278):1–37.Google Scholar
  • Paun I, Husmeier D, Torney CJ (2023) Stochastic variational inference for scalable non-stationary gaussian process regression. Statist. Comput. 33(2):44-1–44-21.Google Scholar
  • Rangapuram SS, Seeger MW, Gasthaus J, Stella L, Wang Y, Januschowski T (2018) Deep state space models for time series forecasting. Bengio S, Wallach H, Larochelle H, Grauman K, Cesa-Bianchi N, Garnett R, eds. Advances in Neural Information Processing Systems, vol. 31 (Curran Associates Inc., Red Hook, NY), 7796–7805.Google Scholar
  • Ročková V, George EI (2014) EMVS: The EM approach to Bayesian variable selection. J. Amer. Statist. Assoc. 109(506):828–846.Google Scholar
  • Rockova V, McAlinn K (2021) Dynamic variable selection with spike-and-slab process priors. Bayesian Anal. 16(1):233–269.Google Scholar
  • Rodrigues F, Henrickson K, Pereira FC (2019) Multi-output Gaussian processes for crowdsourced traffic data imputation. IEEE Trans. Intelligent Transportation Systems 20(2):594–603.Google Scholar
  • Scheipl F, Fahrmeir L, Kneib T (2012) Spike-and-slab priors for function selection in structured additive regression models. J. Amer. Statist. Assoc. 107(500):1518–1532.Google Scholar
  • Shand L, Li B (2017) Modeling nonstationarity in space and time. Biometrics 73(3):759–768.Google Scholar
  • Shen B, Gnanasambandam R, Wang R, Kong ZJ (2023) Multi-task Gaussian process upper confidence bound for hyperparameter tuning and its application for simulation studies of additive manufacturing. IISE Trans. 55(5):496–508.Google Scholar
  • Stathopoulos A, Karlaftis MG (2003) A multivariate state space approach for urban traffic flow modeling and prediction. Transportation Res. Part C Emerging Tech. 11(2):121–135.Google Scholar
  • Ton JF, Flaxman S, Sejdinovic D, Bhatt S (2018) Spatial mapping with Gaussian processes and nonstationary Fourier features. Spatial Statist. 28:59–78.Google Scholar
  • Verstraeten T, Libin PJ, Nowé A (2020) Fleet control using coregionalized Gaussian process policy iteration. De Giacomo G, Catala A, Dilkina B, Milano M, Barro S, Bugarin A, Lang J, eds. Proc. 24th Eur. Conf. Artificial Intelligence ECAI 2020 (IOS Press, Amsterdam), 1571–1578.Google Scholar
  • Wang S, Cao J, Philip SY (2020b) Deep learning for spatio-temporal data mining: A survey. IEEE Trans. Knowledge Data Engrg. 34(8):3681–3700.Google Scholar
  • Wang K, Hamelijnck O, Damoulas T, Steel M (2020a) Non-separable non-stationary random fields. Daume H, Singh A, eds. Proc. 37th Internat. Conf. Machine Learn., Proc. Machine Learn. Res., vol. 119 (JMLR.org), 9887–9897.Google Scholar
  • Wang X, Wang C, Song X, Kirby L, Wu J (2022) Regularized multi-output Gaussian convolution process with domain adaptation. IEEE Trans. Pattern Anal. Machine Intelligence 45(5):6142–6156.Google Scholar
  • Wang Y, Zhang J, Zhu H, Long M, Wang J, Yu PS (2019) Memory in memory: A predictive neural network for learning higher-order non-stationarity from spatiotemporal dynamics. Davis L, Torr P, Zhu SC, eds. Proc. IEEE/CVF Conf. Comput. Vision Pattern Recognition (IEEE Computer Society, Washington, DC), 9154–9162.Google Scholar
  • Wen Q, Zhou T, Zhang C, Chen W, Ma Z, Yan J, Sun L (2023) Transformers in time series: A survey. Proc. 32nd Internat. Joint Conf. Artificial Intelligence (IJCAI, Macao, SAR), 6778–6786.Google Scholar
  • Williams CK, Rasmussen CE (2006) Gaussian Processes for Machine Learning, vol. 2 (MIT Press, Cambridge, MA).Google Scholar
  • Xia D, Wang B, Li H, Li Y, Zhang Z (2016) A distributed spatial–temporal weighted model on MapReduce for short-term traffic flow forecasting. Neurocomputing 179:246–263.Google Scholar
  • Xu R, Wu J, Yue X, Li Y (2022) Online structural change-point detection of high-dimensional streaming data via dynamic sparse subspace learning. Technometrics 65(1):19–32.Google Scholar
  • Yang R, Xu H, Wu Y, Wang X (2020) Multi-task reinforcement learning with soft modularization. Larochelle H, Ranzato M, Hadsell R, Balcan MF, Lin H, eds. NIPS ‘20 Proc. 34th Internat. Conf. Neural Inform. Processing Systems (Curran Associates, Red Hook, NY), 4767–4777.Google Scholar
  • Ye R, Dai Q (2021) Implementing transfer learning across different datasets for time series forecasting. Pattern Recognition 109:106717-1–106717-12.Google Scholar
  • Yoon H, Li J (2018) A novel positive transfer learning approach for telemonitoring of Parkinson’s disease. IEEE Trans. Automation Sci. Engrg. 16(1):180–191.Google Scholar
  • Yun S, Zhang X, Li B (2022) Detection of local differences in spatial characteristics between two spatiotemporal random fields. J. Amer. Statist. Assoc. 117(537):291–306.Google Scholar
  • Zhang Y, Yang Q (2021) A survey on multi-task learning. IEEE Trans. Knowledge Data Engrg. 34(12):5586–5609.Google Scholar
  • Zhang L, Wang K, Chen N (2016) Monitoring wafers’ geometric quality using an additive gaussian process model. IIE Trans. 48(1):1–15.Google Scholar
  • Zhang C, Yan H, Lee S, Shi J (2021) Dynamic multivariate functional data modeling via sparse subspace learning. Technometrics 63(3):370–383.Google Scholar
  • Zhu C, Byrd RH, Lu P, Nocedal J (1997) Algorithm 778: L-bfgs-b: Fortran subroutines for large-scale bound-constrained optimization. ACM Trans. Math. Software 23(4):550–560.Google Scholar
  • Zhu Z, Lin K, Jain AK, Zhou J (2023) Transfer learning in deep reinforcement learning: A survey. IEEE Trans. Pattern Anal. Machine Intelligence 45(11):13344–13362.Google Scholar
  • Zou Z, Careem M, Dutta A, Thawdar N (2023) Joint spatio-temporal precoding for practical non-stationary wireless channels. IEEE Trans. Comm. 71(4):2396–2409.Google Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.