The Bigger Picture: Combining Econometrics with Analytics Improves Forecasts of Movie Success

Published Online:https://doi.org/10.1287/mnsc.2020.3911

References

  • Ban G-Y, Karoui NE, Lim AEB (2018) Machine learning and portfolio optimization. Management Sci. 64(3):1136–1154.LinkGoogle Scholar
  • Bandari R, Asur S, Huberman B (2012) The pulse of news in social media: Forecasting popularity. Breslin J, ed. ICWSM 2012—Proc. 6th Internat. AAAI Conf. Weblogs Social Media (AAAI Press, Palo Alto, CA), 26–33.Google Scholar
  • Belloni A, Chernozhukov V (2013) Least squares after model selection in high-dimensional sparse models. Bernoulli 19(2):521–547.CrossrefGoogle Scholar
  • Bollen J, Mao H, Zheng X (2011) Twitter mood predicts the stock market. J. Comput. Sci. 2(1):1–8.CrossrefGoogle Scholar
  • Breiman L (1996) Bagging predictors. Machine Learn. 26:123–140.CrossrefGoogle Scholar
  • Breiman L (2001) Random forests. Machine Learn. 45:5–32.CrossrefGoogle Scholar
  • Breiman L, Friedman J, Stone CJ (1984) Classification and Regression Trees (Chapman and Hall/CRC, New York).Google Scholar
  • Brodley CE, Utgoff PE (1995) Multivariate decision trees. Machine Learn. 19(1):45–77.CrossrefGoogle Scholar
  • Campos J, Hendry DF, Krolzig H-M (2003) Consistent model selection by an automatic gets approach. Oxford Bull. Econom. Statist. 65(s1):803–819.CrossrefGoogle Scholar
  • Chaudhuri P, Huang M-C, Loh W-Y, Yao R (1994) Piecewise-polynomial regression trees. Statistica Sinica 4(1):143–167.Google Scholar
  • Chintagunta PK, Gopinath S, Venkataraman S (2010) The effects of online user reviews on movie box office performance: Accounting for sequential rollout and aggregation across local markets. Marketing Sci. 29(5):944–957.LinkGoogle Scholar
  • Chipman HA, George EI, McCulloch RE (2010) BART: Bayesian additive regression trees. Ann. Appl. Statist. 6(1):266–298.CrossrefGoogle Scholar
  • De Vany AS, Walls W (2004) Motion picture profit, the stable Paretian hypothesis, and the curse of the superstar. J. Econom. Dynamics Control 28(6):1035–1057.CrossrefGoogle Scholar
  • Dobra A, Gehrke J (2002) SECRET: A scalable linear regression tree algorithm. Proc. Eighth ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (ACM Press, New York), 481–487.Google Scholar
  • Drucker H, Burges CJC, Kaufman L, Smola A, Vapnik V (1996) Support vector regression machines. Mozer MC, Jordan MI, Petsche T, eds. Advances in Neural Information Processing Systems 9 (NIPS 1996) (MIT Press, Cambridge, MA), 155–161.Google Scholar
  • Fan G, Gray JB (2005) Regression tree analysis using TARGET. J. Computational Graphical Statist. 14(1):206–218.CrossrefGoogle Scholar
  • Genuer R, Poggi J-M, Tuleau-Malot C (2010) Variable selection using random forests. Pattern Recognition Lett. 31(14):2225–2236.CrossrefGoogle Scholar
  • Goh K-Y, Heng C-S, Lin Z (2013) Social media brand community and consumer behavior: Quantifying the relative impact of user- and marketer-generated content. Inform. Systems Res. 24(1):88–107.LinkGoogle Scholar
  • Gopinath S, Chintagunta PK, Venkataraman S (2013) Blogs, advertising, and local-market movie box office performance. Management Sci. 59(12):2635–2654.LinkGoogle Scholar
  • Gray JB, Fan G (2008) Classification tree analysis using TARGET. Computational Statist. Data Anal. 52(3):1362–1372.CrossrefGoogle Scholar
  • Hannak A, Anderson E, Barrett LF, Lehmann S, Mislove A, Riedewald M (2012) Tweetin’ in the rain: Exploring societal-scale effects of weather on mood. Breslin J, ed. Proc. Sixth Internat. AAAI Conf. Weblogs Social Media (AAAI Press, Palo Alto, CA), 479–482.Google Scholar
  • Hansen B (2014) Model averaging, asymptotic risk, and regressor groups. Quant. Econom. 5:495–530.CrossrefGoogle Scholar
  • Hansen BE, Racine JS (2012) Jackknife model averaging. J. Econometrics 167(1):38–46.CrossrefGoogle Scholar
  • Hansen PR (2005) A test for superior predictive ability. J. Bus. Econom. Statist. 23(4):365–380.CrossrefGoogle Scholar
  • Hastie T, Tibshirani R, Friedman J (2009) The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer Series in Statistics (Springer New York, New York).CrossrefGoogle Scholar
  • Hendry DF, Nielsen B (2007) Econometric Modeling: A Likelihood Approach (Princeton University Press, Princeton, NJ), 286–301.Google Scholar
  • Hernández B, Raftery AE, Pennington SR, Parnell AC (2018) Bayesian additive regression trees using Bayesian model averaging. Statist. Comput. 28(4):869–890.CrossrefGoogle Scholar
  • Hothorn T, Hornik K, Zeileis A (2006) Unbiased recursive partitioning: A conditional inference framework. J. Computational Graphical Statist. 15(3):651–674.CrossrefGoogle Scholar
  • Ishwaran H (2007) Variable importance in binary regression trees and forests. Electronic J. Statist. 1:519–537.CrossrefGoogle Scholar
  • Kim H, Loh W-Y (2003) Classification trees with bivariate linear discriminant node models. J. Computational Graphical Statist. 12(3):512–530.CrossrefGoogle Scholar
  • Lehrer SF, Xie T (2017) Box office buzz: Does social media data steal the show from model uncertainty when forecasting for Hollywood? Rev. Econom. Statist. 99(5):749–755.CrossrefGoogle Scholar
  • Liu Q, Okui R (2013) Heteroskedasticity-robust Cp model averaging. Econom. J. 16:463–472.CrossrefGoogle Scholar
  • Liu Y (2006) Word of mouth for movies: Its dynamics and impact on box office revenue. J. Marketing 70(3):74–89.CrossrefGoogle Scholar
  • Loh W-Y, Shih Y-S (1997) Split selection methods for classification trees. Statistica Sinica 7(4):815–840.Google Scholar
  • Manski CF (2004) Statistical treatment rules for heterogeneous populations. Econometrica 72(4):1221–1246.CrossrefGoogle Scholar
  • Murthy SK, Kasif S, Salzberg S (1994) A system for induction of oblique decision trees. J. Artificial Intelligence Res. 2(1994):1–32.CrossrefGoogle Scholar
  • Pratola MT, Chipman HA, George EI, McCulloch RE (2020) Heteroscedastic BART via multiplicative regression trees. J. Computational Graphical Statist. 29(2):405–417.CrossrefGoogle Scholar
  • Quinlan JR (1992) Learning with Continuous Classes (World Scientific, Singapore), 343–348.Google Scholar
  • Silva JMCS, Tenreyro S (2006) The log of gravity. Rev. Econom. Statist. 88(4):641–658.CrossrefGoogle Scholar
  • Steel MF (2020) Model averaging and its use in economics. J. Econom. Lit. 58(3):644–719.Google Scholar
  • Strobl C, Boulesteix A-L, Kneib T, Augustin T, Zeileis A (2008) Conditional variable importance for random forests. BMC Bioinformatics 9(1):307.CrossrefGoogle Scholar
  • Suykens J, Vandewalle J (1999) Least squares support vector machine classifiers. Neural Processing Lett. 9(1999):293–300.CrossrefGoogle Scholar
  • Ullah A, Wang H (2013) Parametric and nonparametric frequentist model selection and model averaging. Econometrics 1:157–179.CrossrefGoogle Scholar
  • Vasilios P, Theophilos P, Periklis G (2015) Forecasting daily and monthly exchange rates with machine learning techniques. J. Forecasting 34(7):560–573.CrossrefGoogle Scholar
  • Wager S, Athey S (2018) Estimation and inference of heterogeneous treatment effects using random forests. J. Amer. Statist. Assoc. 113(523):1228–1242.CrossrefGoogle Scholar
  • Wan AT, Zhang X, Zou G (2010) Least squares model averaging by Mallows criterion. J. Econometrics 156(2):277–283.CrossrefGoogle Scholar
  • Wolpert DH, Macready WG (1997) No free lunch theorems for optimization. IEEE Trans. Evolutionary Comput. 1(1):67–82.CrossrefGoogle Scholar
  • Xie T (2015) Prediction model averaging estimator. Econom. Lett. 131:5–8.CrossrefGoogle Scholar
  • Xie T (2017) Heteroscedasticity-robust model screening: A useful toolkit for model averaging in big data analytics. Econom. Lett. 151:119–122.CrossrefGoogle Scholar
  • Xiong G, Bharadwaj S (2014) Prerelease buzz evolution patterns and new product performance. Marketing Sci. 33(3):401–421.LinkGoogle Scholar
  • Yuan Z, Yang Y (2005) Combining linear regression models: When and how? J. Amer. Statist. Assoc. 100(472):1202–1214.CrossrefGoogle Scholar
  • Zhang X, Ullah A, Zhao S (2016a) On the dominance of Mallows model averaging estimator over ordinary least squares estimator. Econom. Lett. 142:69–73.CrossrefGoogle Scholar
  • Zhang X, Zou G, Carroll RJ (2015) Model averaging based on Kullback-Leibler distance. Statistica Sinica 25(4):1583–1598.Google Scholar
  • Zhang X, Yu D, Zou G, Liang H (2016b) Optimal model averaging estimation for generalized linear models and generalized linear mixed-effects models. J. Amer. Statist. Assoc. 111(516):1775–1790.CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.