Open Access

Algorithm Supported Induction for Building Theory: How Can We Use Prediction Models to Theorize?

Yash Raj Shrestha
Corresponding Author
Yash Raj Shrestha
[email protected]
https://orcid.org/0000-0002-2699-4723
Department of Management, Technology, and Economics, ETH Zürich, Zurich CH 8092, Switzerland;
Search for more papers by this author
,
Vivianna Fang He
Vivianna Fang He
[email protected]
https://orcid.org/0000-0003-2591-7838
Management Department, École Supérieure des Sciences Economiques et Commerciales (ESSEC) Business School, 95021 Cergy-Pontoise Cedex, France;
Search for more papers by this author
,
Phanish Puranam
Phanish Puranam
[email protected]
https://orcid.org/0000-0002-0032-8538
Strategy Department, INSEAD, Singapore, Singapore 138676
Search for more papers by this author
,
Georg von Krogh
Georg von Krogh
[email protected]
https://orcid.org/0000-0002-1203-3569
Department of Management, Technology, and Economics, ETH Zürich, Zurich CH 8092, Switzerland;
Search for more papers by this author

Yash Raj Shrestha

Corresponding Author

Yash Raj Shrestha

[email protected]

https://orcid.org/0000-0002-2699-4723

Department of Management, Technology, and Economics, ETH Zürich, Zurich CH 8092, Switzerland;

Search for more papers by this author

Vivianna Fang He

[email protected]

https://orcid.org/0000-0003-2591-7838

Management Department, École Supérieure des Sciences Economiques et Commerciales (ESSEC) Business School, 95021 Cergy-Pontoise Cedex, France;

Search for more papers by this author

Phanish Puranam

[email protected]

https://orcid.org/0000-0002-0032-8538

Strategy Department, INSEAD, Singapore, Singapore 138676

Search for more papers by this author

Georg von Krogh

[email protected]

https://orcid.org/0000-0002-1203-3569

Department of Management, Technology, and Economics, ETH Zürich, Zurich CH 8092, Switzerland;

Search for more papers by this author

Published Online:9 Dec 2020https://doi.org/10.1287/orsc.2020.1382

References

Abu-Mostafa YS , Magdon-Ismail M , Lin HT (2012) Learning from Data (AMLBook, New York).Google Scholar
Aguinis H , Solarino AM (2019) Transparency and replicability in qualitative research: The case of interviews with elite informants. Strategic Management J. 40(8):1291–1315.Google Scholar
Alpaydin E (2014) Introduction to Machine Learning (MIT Press, Cambridge, MA). Google Scholar
Athey S , Imbens G (2016) Recursive partitioning for heterogeneous causal effects. Proc. National Acad. Sci. USA 113(27):7353–7360.Crossref, Google Scholar
Athey S , Tibshirani J , Wager S (2019) Generalized random forests. Ann. Statist. 47(2):1148–1178.Crossref, Google Scholar
Bamberger PA (2018) AMD—Clarifying what we are about and where we are going. Acad. Management Discovery 4(1):1–10.Crossref, Google Scholar
Bao Y , Datta A (2014) Simultaneously discovering and quantifying risk types from textual risk disclosures. Management Sci. 60(6):1371–1391.Link, Google Scholar
Behfar K , Okhuysen GA (2018) Perspective—Discovery within validation logic: Deliberately surfacing, complementing, and substituting abductive reasoning in hypothetico-deductive inquiry. Organ. Sci. 29(2):323–340.Link, Google Scholar
Belloni A , Chernozhukov V , Hansen C (2013) Inference on treatment effects after selection among high-dimensional controls. Rev. Econom. Stud. 81(2):608–650.Crossref, Google Scholar
Bishop CM (2006) Pattern Recognition and Machine Learning (Springer, New York).Google Scholar
Blei DM (2012) Probabilistic topic models. Comm. ACM 55(4):77.Crossref, Google Scholar
Boughorbel S , Jarray F , El-Anbari M (2017) Optimal classifier for imbalanced data using Matthews correlation coefficient metric. PLoS One 12(6):e0177678.Crossref, Google Scholar
Breiman L (2001) Random forests. Machine Learning 45(1):5–32.Crossref, Google Scholar
Burton RM , Obel B (2011) Computational modeling for what-is, what-might-be, and what-should-be studies—And triangulation. Organ. Sci. 22(5):1195–1202.Link, Google Scholar
Bzdok D (2017) Classical statistics and statistical learning in imaging neuroscience. Frontiers Neurosci. 11:543.Crossref, Google Scholar
Chen LF , Liao HYM , Ko MT , Lin JC , Yu GJ (2000) A new LDA-based face recognition system which can solve the small sample size problem. Pattern Recognition 33(10):1713–1726.Crossref, Google Scholar
Christensen K , Nørskov S , Frederiksen L , Scholderer J (2017) In search of new product ideas: Identifying ideas in online communities by machine learning and text mining. Creative Innovative Management 26(1):17–30.Crossref, Google Scholar
Crowston K , Allen EE , Heckman R (2012) Using natural language processing technology for qualitative data analysis. Internat. J. Soc. Res. Methodology 15(6):523–543.Crossref, Google Scholar
Crowston K , Liu X , Allen EE (2010) Machine learning and rule-based automated coding of qualitative data. Marshall C, Toms E, Grove A, eds. Proc. ASIST Annual Meeting (John Wiley & Sons, Hoboken, NJ), 1–2.Google Scholar
Davis JMV , Heller SB (2017) Using causal forests to predict treatment heterogeneity: An application to summer jobs. Amer. Econom. Rev. 107:546–550.Crossref, Google Scholar
Deetz S (1996) Crossroads—Describing differences in approaches to organization science: Rethinking burrell and morgan and their legacy. Organ. Sci. 7(2):191–207.Link, Google Scholar
Dua D , Graff C (2017) Machine learning repository. Accessed June 1, 2016, http://archive.ics.uci.edu/ml.Google Scholar
Eastman W , Bailey JR (1998) Mediating the fact-value antinomy: Patterns in managerial and legal rhetoric, 1890–1990. Organ. Sci. 9(2):232–245.Link, Google Scholar
Eisenhardt KM (1989) Building theories from case study research. Acad. Management Rev. 14(4):532–550.Crossref, Google Scholar
Fischer T , Krauss C (2018) Deep learning with long short-term memory networks for financial market predictions. Eur. J. Oper. Res. 270(2):654–669.Crossref, Google Scholar
Fiss PC (2011) Building better causal theories: A fuzzy set approach to typologies in organization research. Acad. Management J. 54(2):393–420.Crossref, Google Scholar
Gelman A , Loken E (2014) The statistical crisis in science. Amer. Sci. 102(6):460–465.Crossref, Google Scholar
Glaser BG (2008) Doing Quantitative Grounded Theory (Sociology Press, Mill Valley, CA).Google Scholar
Glaser B , Strauss A (1967) The Discovery of Grounded Theory (Weidenfeld & Nicolson, London). Google Scholar
Goldfarb B , King A (2016) Scientific apophenia in strategic management research: Significance tests and mistaken inference. Strategic Management J. 37(1):167–176.Crossref, Google Scholar
Guidotti R , Monreale A , Ruggieri S , Turini F , Giannotti F , Pedreschi D (2018) A survey of methods for explaining black box models. ACM Comput. Survey 51(5):1–42.Crossref, Google Scholar
Haans RFJ , Pieters C , He Z (2016) Thinking about U: Theorizing and testing U‐and inverted U‐shaped relationships in strategy research. Strategic Management J. 37(7):1177–1195.Crossref, Google Scholar
Harrigan KR (1985) An application of clustering for strategic group analysis. Strategic Management J. 6(1):55–73.Crossref, Google Scholar
Hannigan TR , Seidel VP , Yakis-Douglas B (2018) Product innovation rumors as forms of open innovation. Res. Policy 47(5):953–964.Crossref, Google Scholar
Hannigan TR , Haans RFJ , Vakili K , Tchalian H , Glaser VL , Wang MS , Kaplan S , et al. . (2019) Topic modeling in management research: Rendering new theory from textual data. Acad. Management Ann. 13(2):586–632.Crossref, Google Scholar
He F , Puranam P , Shrestha YR , von Krogh G (2020) Resolving governance disputes in communities: A study of software license decisions. Strategic Management J. 41(10):1837–1868.Google Scholar
Helfat CE (2007) Stylized facts, empirical research and theory development in management. Strategic Organ. 5(2):185–192.Google Scholar
Huang AH , Lehavy R , Zang AY , Zheng R (2018) Analyst information discovery and interpretation roles: a topic modeling approach. Management Sci. 64(6):2833–2855.Link, Google Scholar
Hulland J (1999) Use of partial least squares (PLS) in strategic management research: A review of four recent studies. Strategic Management J. 20(2):195–204.Crossref, Google Scholar
Hunter JE , Schmidt FL , Jackson GB (1982) Meta-Analysis: Cumulating Research Findings across Studies (Sage Publications, New York).Google Scholar
Jiang Y , Li M , Zhou ZH (2009) Mining extremely small data sets with application to software reuse. Software Practice Experience 39(4):423–440.Crossref, Google Scholar
Kalnins A (2018) Multicollinearity: How common factors cause type 1 errors in multivariate regression. Strategic Management J. 39(8):2362–2385.Crossref, Google Scholar
Kamishima T , Akaho S , Sakuma J (2011) Fairness-aware learning through regularization approach.Google Scholar
Kleinberg J , Ludwig J , Mullainathan S , Obermeyer Z (2015) Prediction policy problems. Amer. Econom. Rev. Paper Proc. 105(5):491–495.Crossref, Google Scholar
Larson SC (1931) The shrinkage of the coefficient of multiple correlation. J. Edu. Psychol. 22(1):45.Crossref, Google Scholar
Lave CA , March JG (1993) An Introduction to Models in the Social Sciences (University Press of America, Lanham, MD).Google Scholar
LeCun Y , Bengio Y , Hinton G (2015) Deep learning. Nature 521(7553):436–444.Crossref, Google Scholar
Leonard-Barton D (1990) A dual methodology for case studies: Synergistic use of a longitudinal single site with replicated multiple sites. Organ. Sci. 1(3):248–266.Link, Google Scholar
Lewis MW , Grimes AJ (1999) Metatriangulation: Building theory from multiple paradigms. Acad. Management Rev. 24(4):672–690.Crossref, Google Scholar
Lipton ZC (2016) The mythos of model interpretability. Accessed December 11, 2015, http://www.kdnuggets.com/2015/04/model-interpretability-neural-networks-deep-learning.html.Google Scholar
Locke K (2015) Pragmatic reflections on a conversation about grounded theory in management and organization studies. Organ. Res. Methods 18(4):612–619.Google Scholar
Lu H, Eng HL, Guan C, Plataniotis KN, Venetsanopoulos AN (2010) Regularized common spatial pattern with aggregation for EEG classification in small-sample setting. IEEE Trans. Biomedical Engrg. 57(12):2936–2946.Google Scholar
March JG , Sproull LS , Tamuz M (1991) Learning from samples of one or fewer. Organ. Sci. 2(1):1–13.Link, Google Scholar
Marquardt DW , Snee RD (1975) Ridge regression in practice. Amer. Statist. 29(1):3–20.Crossref, Google Scholar
Medlock B , Briscoe T (2007) Weakly supervised learning for hedge classification in scientific literature. Carroll JA, van den Bosch A, Zaenen A, eds. Proc. 45th Annual Meeting Assoc. Comput. Linguistics (Association for Computational Linguistics, Prague, Czech Republic), 992–999.Google Scholar
Mintzberg H (1979) An emerging strategy of” direct” research. Admin. Sci. Quart. 24(4):582–589.Crossref, Google Scholar
Mitchell TM (1997) Machine Learning (McGraw-Hill, New York).Google Scholar
Mosteller F , Wallace DL (1963) Inference in an authorship problem: A comparative study of discrimination methods applied to the authorship of the disputed Federalist Papers. J. Amer. Statist. Assoc. 58(302):275–309.Google Scholar
Mullainathan S , Spiess J (2017) Machine learning: An applied econometric approach. J. Econ. Perspect. 31(2):87–106.Crossref, Google Scholar
Murphy AL , Pietro PG , Roman GC (2006) LIME: A coordination model and middleware supporting mobility of hosts and agents. ACM Trans. Software Engrg. Methodology 15(3):279–328.Crossref, Google Scholar
Pearl J (2000) Causal inference without counterfactuals. J. Amer. Statist. Assoc. 95(450):428–431.Google Scholar
Peirce CS (1878) Deduction, induction and hypothesis. Popular Sci. Monthly 13:470–482.Google Scholar
Popper KR (1959) The Logic of Scientific Discovery (Basic Books, Oxford, UK).Google Scholar
Pratt MG , Kaplan S , Whittington R (2019) Editorial essay: The tumult over transparency: decoupling transparency from replication in establishing trustworthy qualitative research. Admin. Sci. Quart. 65(1):1–19.Google Scholar
Puranam D , Narayan V , Kadiyali V (2017) The effect of calorie posting regulation on consumer opinion: A flexible latent dirichlet allocation model with informative priors. Marketing Sci. 36(5):726–746.Link, Google Scholar
Puranam P , Stieglitz N , Osman M , Pillutla MM (2015) Modelling bounded rationality in organizations: Progress and prospects. Acad. Management Ann. 9(1):337–392.Crossref, Google Scholar
Ragin CC (1987) The Comparative Method (University of California Press, Berkeley). Google Scholar
Ragin CC (2000) Fuzzy-Set Social Science . (University of Chicago Press). Google Scholar
Robert C (2014) Machine learning, a probabilistic perspective. Chance 27(2):62–63.Crossref, Google Scholar
Rudin C (2014) Algorithms for interpretable machine learning. Macskassy S, Perlich C, Leskovec J, Wang W, Ghani R, eds. Proc. 20th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (Association for Computing Machinery, New York), 1519.Google Scholar
Samek W , Montavon G, Vedaldi A, Hansen LK, Müller KR, eds. (2017) Explainable AI: Interpreting, Explaining and Visualizing Deep Learning (Springer Nature, New York).Google Scholar
Sami Ul Haq Q , Tao L , Sun F , Yang S (2012) A fast and robust sparse approach for hyperspectral data classification using a few labeled samples. IEEE Trans. Geosci. Remote Sensing 50(6):2287–2302.Crossref, Google Scholar
Samuel AL (1959) Eight-move opening utilizing generalization learning. IBM J. 3(3):210–229.Crossref, Google Scholar
Shadish WR , Cook TD , Campbell DT (2002) Experimental and Quasi-Experimental Designs for Generalized Causal Inference (Houghton Mifflin, Boston). Google Scholar
Shah SK , Corley KG (2006) Building better theory by bridging the quantitative-qualitative divide. J. Management Stud. 43(8):1821–1835.Crossref, Google Scholar
Shaikhina T , Khovanova NA (2017) Handling limited datasets with neural networks in medical applications: A small-data approach. Artificial Intelligence Medicine 75:51–63.Crossref, Google Scholar
Shalev-Shwartz S , Ben-David S (2014) Understanding Machine Learning: From Theory to Algorithms (Cambridge University Press, Cambridge, UK). Crossref, Google Scholar
Shaver JM (2019) Interpreting interactions in linear fixed-effect regression models: When fixed-effect estimates are no longer within-effects. Strategy Sci. 4(1):25–40.Link, Google Scholar
Shrestha YR , Yang Y (2019) Fairness in algorithmic decision-making: Applications in multi-winner voting, machine learning, and recommender systems. Algorithms (Basel) 12(9):199.Crossref, Google Scholar
Shrestha YR , Ben-Menahem SM , von Krogh G (2019) Organizational decision-making structures in the age of artificial intelligence. California Management Rev. 61(4):66–83.Crossref, Google Scholar
Sutton RI (1997) Crossroads—The virtues of closet qualitative research. Organ. Sci. 8(1):97–106.Link, Google Scholar
Tibshirani R (1996) Regression shrinkage and selection via the Lasso. J. Roy. Statist. Soc. B 58(1):267–288.Crossref, Google Scholar
Tonidandel S , King EB , Cortina JM (2018) Big data methods. Organ. Res. Methods 21(3):525–547.Crossref, Google Scholar
Varian HR (2014) Big data: New tricks for econometrics. J. Econom. Perspective 28(2):3–28.Crossref, Google Scholar
Varian HR (2016) How to build an economic model in your spare time. Amer. Econom. 61(1):81–90.Crossref, Google Scholar
von Krogh G (2018) Artificial intelligence in organizations: New opportunities for phenomenon-based theorizing. Acad. Managment Discovery 4(4):404–409.Crossref, Google Scholar
Walsh I , Holton JA , Bailyn L , Fernandez W , Levina N , Glaser B (2015) Rejoinder: Moving the management field forward. Organ. Res. Methods 18(4):620–628.Crossref, Google Scholar
Wolpert DH , Macready WG (1997) No free lunch theorems for optimization. IEEE Trans. Evolution Comput. 1(1):67–82.Crossref, Google Scholar
Yan JLS , McCracken N , Crowston K (2014) Semi-automatic content analysis of qualitative data. Accessed June 1, 2016, http://socqa.org/iConf2014.Google Scholar
Yang JB , Shen KQ , Ong CJ , Li XP (2009) Feature selection for MLP neural network: The use of random permutation of probabilistic outputs. IEEE Trans. Neural Networks 20(12):1911–1922.Crossref, Google Scholar
Yao S , Huang B (2017) Beyond parity: Fairness objectives for collaborative filtering. Guyon I, Luxburg UV, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R eds. Advances in Neural Information Processing Systems, vol. 30 (Curran Associates, Red Hook, NY), 2921–2930.Google Scholar
Yin RK (2009) Case Study Research: Design and Methods (Sage Publications, Beverly Hills, CA). Google Scholar
Zelner BA (2009) Using simulation to interpret results from logit, probit, and other nonlinear models. Strategic Management J. 30(12):1335–1348.Crossref, Google Scholar
Zemel R , Wu Y , Swersky K , Pitassi T , Dwork C (2013) Learning fair representations. Dasgupta S, McAllester D, eds. Proc. 30th Internat. Conf. Machine Learning (PMLR, Atlanta), 325–333.Google Scholar
Zhou ZH , Jiang Y (2003) Medical diagnosis with C4.5 Rule preceded by artificial neural network ensemble. IEEE Trans. Inform. Tech. Biomedicine 7(1):37–42.Crossref, Google Scholar

Volume 32, Issue 3

May-June 2021

Pages 527-908, C2

Article Information

Metrics

Information

Received:April 30, 2019
Accepted:May 11, 2020
Published Online:December 09, 2020

Cite as

Yash Raj Shrestha, Vivianna Fang He, Phanish Puranam, Georg von Krogh (2020) Algorithm Supported Induction for Building Theory: How Can We Use Prediction Models to Theorize?. Organization Science 32(3):856-880.

https://doi.org/10.1287/orsc.2020.1382

Keywords

Acknowledgments

All authors contributed equally. The authors thank Gino Cattani for excellent editorial guidance; two reviewers for insightful comments; seminar participants at the Vienna Strategy Conference for comments received on earlier versions of this paper; and Binod Bhattarai, Shijing Cai, Nina Geilinger, Thomas Gersdorf, Bibek Paudel, Prothit Sen, Bart Vanneste, and Ce Zhang for feedback.

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Algorithm Supported Induction for Building Theory: How Can We Use Prediction Models to Theorize?

References

Volume 32, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News