The Bigger Picture: Combining Econometrics with Analytics Improves Forecasts of Movie Success

Steven F. Lehrer
Corresponding Author
Steven F. Lehrer
[email protected]
https://orcid.org/0000-0001-6715-4096
Department of Economics, Queen’s University, Kingston, Ontario K7L3N6, Canada;National Bureau of Economic Research, Cambridge, Massachusetts 02138;
Search for more papers by this author
,
Tian Xie
Tian Xie
[email protected]
College of Business, Shanghai University of Finance and Economics, Shanghai 200433, China
Search for more papers by this author

Corresponding Author

Steven F. Lehrer

Department of Economics, Queen’s University, Kingston, Ontario K7L3N6, Canada;National Bureau of Economic Research, Cambridge, Massachusetts 02138;

Search for more papers by this author

Tian Xie

[email protected]

College of Business, Shanghai University of Finance and Economics, Shanghai 200433, China

Search for more papers by this author

Published Online:12 Mar 2021https://doi.org/10.1287/mnsc.2020.3911

References

Ban G-Y, Karoui NE, Lim AEB (2018) Machine learning and portfolio optimization. Management Sci. 64(3):1136–1154.Link, Google Scholar
Bandari R, Asur S, Huberman B (2012) The pulse of news in social media: Forecasting popularity. Breslin J, ed. ICWSM 2012—Proc. 6th Internat. AAAI Conf. Weblogs Social Media (AAAI Press, Palo Alto, CA), 26–33.Google Scholar
Belloni A, Chernozhukov V (2013) Least squares after model selection in high-dimensional sparse models. Bernoulli 19(2):521–547.Crossref, Google Scholar
Bollen J, Mao H, Zheng X (2011) Twitter mood predicts the stock market. J. Comput. Sci. 2(1):1–8.Crossref, Google Scholar
Breiman L (1996) Bagging predictors. Machine Learn. 26:123–140.Crossref, Google Scholar
Breiman L (2001) Random forests. Machine Learn. 45:5–32.Crossref, Google Scholar
Breiman L, Friedman J, Stone CJ (1984) Classification and Regression Trees (Chapman and Hall/CRC, New York).Google Scholar
Brodley CE, Utgoff PE (1995) Multivariate decision trees. Machine Learn. 19(1):45–77.Crossref, Google Scholar
Campos J, Hendry DF, Krolzig H-M (2003) Consistent model selection by an automatic gets approach. Oxford Bull. Econom. Statist. 65(s1):803–819.Crossref, Google Scholar
Chaudhuri P, Huang M-C, Loh W-Y, Yao R (1994) Piecewise-polynomial regression trees. Statistica Sinica 4(1):143–167.Google Scholar
Chintagunta PK, Gopinath S, Venkataraman S (2010) The effects of online user reviews on movie box office performance: Accounting for sequential rollout and aggregation across local markets. Marketing Sci. 29(5):944–957.Link, Google Scholar
Chipman HA, George EI, McCulloch RE (2010) BART: Bayesian additive regression trees. Ann. Appl. Statist. 6(1):266–298.Crossref, Google Scholar
De Vany AS, Walls W (2004) Motion picture profit, the stable Paretian hypothesis, and the curse of the superstar. J. Econom. Dynamics Control 28(6):1035–1057.Crossref, Google Scholar
Dobra A, Gehrke J (2002) SECRET: A scalable linear regression tree algorithm. Proc. Eighth ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (ACM Press, New York), 481–487.Google Scholar
Drucker H, Burges CJC, Kaufman L, Smola A, Vapnik V (1996) Support vector regression machines. Mozer MC, Jordan MI, Petsche T, eds. Advances in Neural Information Processing Systems 9 (NIPS 1996) (MIT Press, Cambridge, MA), 155–161.Google Scholar
Fan G, Gray JB (2005) Regression tree analysis using TARGET. J. Computational Graphical Statist. 14(1):206–218.Crossref, Google Scholar
Genuer R, Poggi J-M, Tuleau-Malot C (2010) Variable selection using random forests. Pattern Recognition Lett. 31(14):2225–2236.Crossref, Google Scholar
Goh K-Y, Heng C-S, Lin Z (2013) Social media brand community and consumer behavior: Quantifying the relative impact of user- and marketer-generated content. Inform. Systems Res. 24(1):88–107.Link, Google Scholar
Gopinath S, Chintagunta PK, Venkataraman S (2013) Blogs, advertising, and local-market movie box office performance. Management Sci. 59(12):2635–2654.Link, Google Scholar
Gray JB, Fan G (2008) Classification tree analysis using TARGET. Computational Statist. Data Anal. 52(3):1362–1372.Crossref, Google Scholar
Hannak A, Anderson E, Barrett LF, Lehmann S, Mislove A, Riedewald M (2012) Tweetin’ in the rain: Exploring societal-scale effects of weather on mood. Breslin J, ed. Proc. Sixth Internat. AAAI Conf. Weblogs Social Media (AAAI Press, Palo Alto, CA), 479–482.Google Scholar
Hansen B (2014) Model averaging, asymptotic risk, and regressor groups. Quant. Econom. 5:495–530.Crossref, Google Scholar
Hansen BE, Racine JS (2012) Jackknife model averaging. J. Econometrics 167(1):38–46.Crossref, Google Scholar
Hansen PR (2005) A test for superior predictive ability. J. Bus. Econom. Statist. 23(4):365–380.Crossref, Google Scholar
Hastie T, Tibshirani R, Friedman J (2009) The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer Series in Statistics (Springer New York, New York).Crossref, Google Scholar
Hendry DF, Nielsen B (2007) Econometric Modeling: A Likelihood Approach (Princeton University Press, Princeton, NJ), 286–301.Google Scholar
Hernández B, Raftery AE, Pennington SR, Parnell AC (2018) Bayesian additive regression trees using Bayesian model averaging. Statist. Comput. 28(4):869–890.Crossref, Google Scholar
Hothorn T, Hornik K, Zeileis A (2006) Unbiased recursive partitioning: A conditional inference framework. J. Computational Graphical Statist. 15(3):651–674.Crossref, Google Scholar
Ishwaran H (2007) Variable importance in binary regression trees and forests. Electronic J. Statist. 1:519–537.Crossref, Google Scholar
Kim H, Loh W-Y (2003) Classification trees with bivariate linear discriminant node models. J. Computational Graphical Statist. 12(3):512–530.Crossref, Google Scholar
Lehrer SF, Xie T (2017) Box office buzz: Does social media data steal the show from model uncertainty when forecasting for Hollywood? Rev. Econom. Statist. 99(5):749–755.Crossref, Google Scholar
Liu Q, Okui R (2013) Heteroskedasticity-robust Cp model averaging. Econom. J. 16:463–472.Crossref, Google Scholar
Liu Y (2006) Word of mouth for movies: Its dynamics and impact on box office revenue. J. Marketing 70(3):74–89.Crossref, Google Scholar
Loh W-Y, Shih Y-S (1997) Split selection methods for classification trees. Statistica Sinica 7(4):815–840.Google Scholar
Manski CF (2004) Statistical treatment rules for heterogeneous populations. Econometrica 72(4):1221–1246.Crossref, Google Scholar
Murthy SK, Kasif S, Salzberg S (1994) A system for induction of oblique decision trees. J. Artificial Intelligence Res. 2(1994):1–32.Crossref, Google Scholar
Pratola MT, Chipman HA, George EI, McCulloch RE (2020) Heteroscedastic BART via multiplicative regression trees. J. Computational Graphical Statist. 29(2):405–417.Crossref, Google Scholar
Quinlan JR (1992) Learning with Continuous Classes (World Scientific, Singapore), 343–348.Google Scholar
Silva JMCS, Tenreyro S (2006) The log of gravity. Rev. Econom. Statist. 88(4):641–658.Crossref, Google Scholar
Steel MF (2020) Model averaging and its use in economics. J. Econom. Lit. 58(3):644–719.Google Scholar
Strobl C, Boulesteix A-L, Kneib T, Augustin T, Zeileis A (2008) Conditional variable importance for random forests. BMC Bioinformatics 9(1):307.Crossref, Google Scholar
Suykens J, Vandewalle J (1999) Least squares support vector machine classifiers. Neural Processing Lett. 9(1999):293–300.Crossref, Google Scholar
Ullah A, Wang H (2013) Parametric and nonparametric frequentist model selection and model averaging. Econometrics 1:157–179.Crossref, Google Scholar
Vasilios P, Theophilos P, Periklis G (2015) Forecasting daily and monthly exchange rates with machine learning techniques. J. Forecasting 34(7):560–573.Crossref, Google Scholar
Wager S, Athey S (2018) Estimation and inference of heterogeneous treatment effects using random forests. J. Amer. Statist. Assoc. 113(523):1228–1242.Crossref, Google Scholar
Wan AT, Zhang X, Zou G (2010) Least squares model averaging by Mallows criterion. J. Econometrics 156(2):277–283.Crossref, Google Scholar
Wolpert DH, Macready WG (1997) No free lunch theorems for optimization. IEEE Trans. Evolutionary Comput. 1(1):67–82.Crossref, Google Scholar
Xie T (2015) Prediction model averaging estimator. Econom. Lett. 131:5–8.Crossref, Google Scholar
Xie T (2017) Heteroscedasticity-robust model screening: A useful toolkit for model averaging in big data analytics. Econom. Lett. 151:119–122.Crossref, Google Scholar
Xiong G, Bharadwaj S (2014) Prerelease buzz evolution patterns and new product performance. Marketing Sci. 33(3):401–421.Link, Google Scholar
Yuan Z, Yang Y (2005) Combining linear regression models: When and how? J. Amer. Statist. Assoc. 100(472):1202–1214.Crossref, Google Scholar
Zhang X, Ullah A, Zhao S (2016a) On the dominance of Mallows model averaging estimator over ordinary least squares estimator. Econom. Lett. 142:69–73.Crossref, Google Scholar
Zhang X, Zou G, Carroll RJ (2015) Model averaging based on Kullback-Leibler distance. Statistica Sinica 25(4):1583–1598.Google Scholar
Zhang X, Yu D, Zou G, Liang H (2016b) Optimal model averaging estimation for generalized linear models and generalized linear mixed-effects models. J. Amer. Statist. Assoc. 111(516):1775–1790.Crossref, Google Scholar

Volume 68, Issue 1

January 2022

Pages 1-808, iv-v

Article Information

Supplemental Material

Metrics

Information

Received:June 21, 2018
Accepted:November 07, 2020
Published Online:March 12, 2021

Cite as

Steven F. Lehrer, Tian Xie (2021) The Bigger Picture: Combining Econometrics with Analytics Improves Forecasts of Movie Success. Management Science 68(1):189-210.

https://doi.org/10.1287/mnsc.2020.3911

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

The Bigger Picture: Combining Econometrics with Analytics Improves Forecasts of Movie Success

References

Volume 68, Issue 1

Article Information

Supplemental Material

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News