Algorithm Supported Induction for Building Theory: How Can We Use Prediction Models to Theorize?

Published Online:https://doi.org/10.1287/orsc.2020.1382

References

  • Abu-Mostafa YS , Magdon-Ismail M , Lin HT (2012) Learning from Data (AMLBook, New York).Google Scholar
  • Aguinis H , Solarino AM (2019) Transparency and replicability in qualitative research: The case of interviews with elite informants. Strategic Management J. 40(8):1291–1315.Google Scholar
  • Alpaydin E (2014) Introduction to Machine Learning (MIT Press, Cambridge, MA). Google Scholar
  • Athey S , Imbens G (2016) Recursive partitioning for heterogeneous causal effects. Proc. National Acad. Sci. USA 113(27):7353–7360.CrossrefGoogle Scholar
  • Athey S , Tibshirani J , Wager S (2019) Generalized random forests. Ann. Statist. 47(2):1148–1178.CrossrefGoogle Scholar
  • Bamberger PA (2018) AMD—Clarifying what we are about and where we are going. Acad. Management Discovery 4(1):1–10.CrossrefGoogle Scholar
  • Bao Y , Datta A (2014) Simultaneously discovering and quantifying risk types from textual risk disclosures. Management Sci. 60(6):1371–1391.LinkGoogle Scholar
  • Behfar K , Okhuysen GA (2018) Perspective—Discovery within validation logic: Deliberately surfacing, complementing, and substituting abductive reasoning in hypothetico-deductive inquiry. Organ. Sci. 29(2):323–340.LinkGoogle Scholar
  • Belloni A , Chernozhukov V , Hansen C (2013) Inference on treatment effects after selection among high-dimensional controls. Rev. Econom. Stud. 81(2):608–650.CrossrefGoogle Scholar
  • Bishop CM (2006) Pattern Recognition and Machine Learning (Springer, New York).Google Scholar
  • Blei DM (2012) Probabilistic topic models. Comm. ACM 55(4):77.CrossrefGoogle Scholar
  • Boughorbel S , Jarray F , El-Anbari M (2017) Optimal classifier for imbalanced data using Matthews correlation coefficient metric. PLoS One 12(6):e0177678.CrossrefGoogle Scholar
  • Breiman L (2001) Random forests. Machine Learning 45(1):5–32.CrossrefGoogle Scholar
  • Burton RM , Obel B (2011) Computational modeling for what-is, what-might-be, and what-should-be studies—And triangulation. Organ. Sci. 22(5):1195–1202.LinkGoogle Scholar
  • Bzdok D (2017) Classical statistics and statistical learning in imaging neuroscience. Frontiers Neurosci. 11:543.CrossrefGoogle Scholar
  • Chen LF , Liao HYM , Ko MT , Lin JC , Yu GJ (2000) A new LDA-based face recognition system which can solve the small sample size problem. Pattern Recognition 33(10):1713–1726.CrossrefGoogle Scholar
  • Christensen K , Nørskov S , Frederiksen L , Scholderer J (2017) In search of new product ideas: Identifying ideas in online communities by machine learning and text mining. Creative Innovative Management 26(1):17–30.CrossrefGoogle Scholar
  • Crowston K , Allen EE , Heckman R (2012) Using natural language processing technology for qualitative data analysis. Internat. J. Soc. Res. Methodology 15(6):523–543.CrossrefGoogle Scholar
  • Crowston K , Liu X , Allen EE (2010) Machine learning and rule-based automated coding of qualitative data. Marshall C, Toms E, Grove A, eds. Proc. ASIST Annual Meeting (John Wiley & Sons, Hoboken, NJ), 1–2.Google Scholar
  • Davis JMV , Heller SB (2017) Using causal forests to predict treatment heterogeneity: An application to summer jobs. Amer. Econom. Rev. 107:546–550.CrossrefGoogle Scholar
  • Deetz S (1996) Crossroads—Describing differences in approaches to organization science: Rethinking burrell and morgan and their legacy. Organ. Sci. 7(2):191–207.LinkGoogle Scholar
  • Dua D , Graff C (2017) Machine learning repository. Accessed June 1, 2016, http://archive.ics.uci.edu/ml.Google Scholar
  • Eastman W , Bailey JR (1998) Mediating the fact-value antinomy: Patterns in managerial and legal rhetoric, 1890–1990. Organ. Sci. 9(2):232–245.LinkGoogle Scholar
  • Eisenhardt KM (1989) Building theories from case study research. Acad. Management Rev. 14(4):532–550.CrossrefGoogle Scholar
  • Fischer T , Krauss C (2018) Deep learning with long short-term memory networks for financial market predictions. Eur. J. Oper. Res. 270(2):654–669.CrossrefGoogle Scholar
  • Fiss PC (2011) Building better causal theories: A fuzzy set approach to typologies in organization research. Acad. Management J. 54(2):393–420.CrossrefGoogle Scholar
  • Gelman A , Loken E (2014) The statistical crisis in science. Amer. Sci. 102(6):460–465.CrossrefGoogle Scholar
  • Glaser BG (2008) Doing Quantitative Grounded Theory (Sociology Press, Mill Valley, CA).Google Scholar
  • Glaser B , Strauss A (1967) The Discovery of Grounded Theory (Weidenfeld & Nicolson, London). Google Scholar
  • Goldfarb B , King A (2016) Scientific apophenia in strategic management research: Significance tests and mistaken inference. Strategic Management J. 37(1):167–176.CrossrefGoogle Scholar
  • Guidotti R , Monreale A , Ruggieri S , Turini F , Giannotti F , Pedreschi D (2018) A survey of methods for explaining black box models. ACM Comput. Survey 51(5):1–42.CrossrefGoogle Scholar
  • Haans RFJ , Pieters C , He Z (2016) Thinking about U: Theorizing and testing U‐and inverted U‐shaped relationships in strategy research. Strategic Management J. 37(7):1177–1195.CrossrefGoogle Scholar
  • Harrigan KR (1985) An application of clustering for strategic group analysis. Strategic Management J. 6(1):55–73.CrossrefGoogle Scholar
  • Hannigan TR , Seidel VP , Yakis-Douglas B (2018) Product innovation rumors as forms of open innovation. Res. Policy 47(5):953–964.CrossrefGoogle Scholar
  • Hannigan TR , Haans RFJ , Vakili K , Tchalian H , Glaser VL , Wang MS , Kaplan S , et al. . (2019) Topic modeling in management research: Rendering new theory from textual data. Acad. Management Ann. 13(2):586–632.CrossrefGoogle Scholar
  • He F , Puranam P , Shrestha YR , von Krogh G (2020) Resolving governance disputes in communities: A study of software license decisions. Strategic Management J. 41(10):1837–1868.Google Scholar
  • Helfat CE (2007) Stylized facts, empirical research and theory development in management. Strategic Organ. 5(2):185–192.Google Scholar
  • Huang AH , Lehavy R , Zang AY , Zheng R (2018) Analyst information discovery and interpretation roles: a topic modeling approach. Management Sci. 64(6):2833–2855.LinkGoogle Scholar
  • Hulland J (1999) Use of partial least squares (PLS) in strategic management research: A review of four recent studies. Strategic Management J. 20(2):195–204.CrossrefGoogle Scholar
  • Hunter JE , Schmidt FL , Jackson GB (1982) Meta-Analysis: Cumulating Research Findings across Studies (Sage Publications, New York).Google Scholar
  • Jiang Y , Li M , Zhou ZH (2009) Mining extremely small data sets with application to software reuse. Software Practice Experience 39(4):423–440.CrossrefGoogle Scholar
  • Kalnins A (2018) Multicollinearity: How common factors cause type 1 errors in multivariate regression. Strategic Management J. 39(8):2362–2385.CrossrefGoogle Scholar
  • Kamishima T , Akaho S , Sakuma J (2011) Fairness-aware learning through regularization approach.Google Scholar
  • Kleinberg J , Ludwig J , Mullainathan S , Obermeyer Z (2015) Prediction policy problems. Amer. Econom. Rev. Paper Proc. 105(5):491–495.CrossrefGoogle Scholar
  • Larson SC (1931) The shrinkage of the coefficient of multiple correlation. J. Edu. Psychol. 22(1):45.CrossrefGoogle Scholar
  • Lave CA , March JG (1993) An Introduction to Models in the Social Sciences (University Press of America, Lanham, MD).Google Scholar
  • LeCun Y , Bengio Y , Hinton G (2015) Deep learning. Nature 521(7553):436–444.CrossrefGoogle Scholar
  • Leonard-Barton D (1990) A dual methodology for case studies: Synergistic use of a longitudinal single site with replicated multiple sites. Organ. Sci. 1(3):248–266.LinkGoogle Scholar
  • Lewis MW , Grimes AJ (1999) Metatriangulation: Building theory from multiple paradigms. Acad. Management Rev. 24(4):672–690.CrossrefGoogle Scholar
  • Lipton ZC (2016) The mythos of model interpretability. Accessed December 11, 2015, http://www.kdnuggets.com/2015/04/model-interpretability-neural-networks-deep-learning.html.Google Scholar
  • Locke K (2015) Pragmatic reflections on a conversation about grounded theory in management and organization studies. Organ. Res. Methods 18(4):612–619.Google Scholar
  • Lu H, Eng HL, Guan C, Plataniotis KN, Venetsanopoulos AN (2010) Regularized common spatial pattern with aggregation for EEG classification in small-sample setting. IEEE Trans. Biomedical Engrg. 57(12):2936–2946.Google Scholar
  • March JG , Sproull LS , Tamuz M (1991) Learning from samples of one or fewer. Organ. Sci. 2(1):1–13.LinkGoogle Scholar
  • Marquardt DW , Snee RD (1975) Ridge regression in practice. Amer. Statist. 29(1):3–20.CrossrefGoogle Scholar
  • Medlock B , Briscoe T (2007) Weakly supervised learning for hedge classification in scientific literature. Carroll JA, van den Bosch A, Zaenen A, eds. Proc. 45th Annual Meeting Assoc. Comput. Linguistics (Association for Computational Linguistics, Prague, Czech Republic), 992–999.Google Scholar
  • Mintzberg H (1979) An emerging strategy of” direct” research. Admin. Sci. Quart. 24(4):582–589.CrossrefGoogle Scholar
  • Mitchell TM (1997) Machine Learning (McGraw-Hill, New York).Google Scholar
  • Mosteller F , Wallace DL (1963) Inference in an authorship problem: A comparative study of discrimination methods applied to the authorship of the disputed Federalist Papers. J. Amer. Statist. Assoc. 58(302):275–309.Google Scholar
  • Mullainathan S , Spiess J (2017) Machine learning: An applied econometric approach. J. Econ. Perspect. 31(2):87–106.CrossrefGoogle Scholar
  • Murphy AL , Pietro PG , Roman GC (2006) LIME: A coordination model and middleware supporting mobility of hosts and agents. ACM Trans. Software Engrg. Methodology 15(3):279–328.CrossrefGoogle Scholar
  • Pearl J (2000) Causal inference without counterfactuals. J. Amer. Statist. Assoc. 95(450):428–431.Google Scholar
  • Peirce CS (1878) Deduction, induction and hypothesis. Popular Sci. Monthly 13:470–482.Google Scholar
  • Popper KR (1959) The Logic of Scientific Discovery (Basic Books, Oxford, UK).Google Scholar
  • Pratt MG , Kaplan S , Whittington R (2019) Editorial essay: The tumult over transparency: decoupling transparency from replication in establishing trustworthy qualitative research. Admin. Sci. Quart. 65(1):1–19.Google Scholar
  • Puranam D , Narayan V , Kadiyali V (2017) The effect of calorie posting regulation on consumer opinion: A flexible latent dirichlet allocation model with informative priors. Marketing Sci. 36(5):726–746.LinkGoogle Scholar
  • Puranam P , Stieglitz N , Osman M , Pillutla MM (2015) Modelling bounded rationality in organizations: Progress and prospects. Acad. Management Ann. 9(1):337–392.CrossrefGoogle Scholar
  • Ragin CC (1987) The Comparative Method (University of California Press, Berkeley). Google Scholar
  • Ragin CC (2000) Fuzzy-Set Social Science . (University of Chicago Press). Google Scholar
  • Robert C (2014) Machine learning, a probabilistic perspective. Chance 27(2):62–63.CrossrefGoogle Scholar
  • Rudin C (2014) Algorithms for interpretable machine learning. Macskassy S, Perlich C, Leskovec J, Wang W, Ghani R, eds. Proc. 20th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (Association for Computing Machinery, New York), 1519.Google Scholar
  • Samek W , Montavon G, Vedaldi A, Hansen LK, Müller KR, eds. (2017) Explainable AI: Interpreting, Explaining and Visualizing Deep Learning (Springer Nature, New York).Google Scholar
  • Sami Ul Haq Q , Tao L , Sun F , Yang S (2012) A fast and robust sparse approach for hyperspectral data classification using a few labeled samples. IEEE Trans. Geosci. Remote Sensing 50(6):2287–2302.CrossrefGoogle Scholar
  • Samuel AL (1959) Eight-move opening utilizing generalization learning. IBM J. 3(3):210–229.CrossrefGoogle Scholar
  • Shadish WR , Cook TD , Campbell DT (2002) Experimental and Quasi-Experimental Designs for Generalized Causal Inference (Houghton Mifflin, Boston). Google Scholar
  • Shah SK , Corley KG (2006) Building better theory by bridging the quantitative-qualitative divide. J. Management Stud. 43(8):1821–1835.CrossrefGoogle Scholar
  • Shaikhina T , Khovanova NA (2017) Handling limited datasets with neural networks in medical applications: A small-data approach. Artificial Intelligence Medicine 75:51–63.CrossrefGoogle Scholar
  • Shalev-Shwartz S , Ben-David S (2014) Understanding Machine Learning: From Theory to Algorithms (Cambridge University Press, Cambridge, UK). CrossrefGoogle Scholar
  • Shaver JM (2019) Interpreting interactions in linear fixed-effect regression models: When fixed-effect estimates are no longer within-effects. Strategy Sci. 4(1):25–40.LinkGoogle Scholar
  • Shrestha YR , Yang Y (2019) Fairness in algorithmic decision-making: Applications in multi-winner voting, machine learning, and recommender systems. Algorithms (Basel) 12(9):199.CrossrefGoogle Scholar
  • Shrestha YR , Ben-Menahem SM , von Krogh G (2019) Organizational decision-making structures in the age of artificial intelligence. California Management Rev. 61(4):66–83.CrossrefGoogle Scholar
  • Sutton RI (1997) Crossroads—The virtues of closet qualitative research. Organ. Sci. 8(1):97–106.LinkGoogle Scholar
  • Tibshirani R (1996) Regression shrinkage and selection via the Lasso. J. Roy. Statist. Soc. B 58(1):267–288.CrossrefGoogle Scholar
  • Tonidandel S , King EB , Cortina JM (2018) Big data methods. Organ. Res. Methods 21(3):525–547.CrossrefGoogle Scholar
  • Varian HR (2014) Big data: New tricks for econometrics. J. Econom. Perspective 28(2):3–28.CrossrefGoogle Scholar
  • Varian HR (2016) How to build an economic model in your spare time. Amer. Econom. 61(1):81–90.CrossrefGoogle Scholar
  • von Krogh G (2018) Artificial intelligence in organizations: New opportunities for phenomenon-based theorizing. Acad. Managment Discovery 4(4):404–409.CrossrefGoogle Scholar
  • Walsh I , Holton JA , Bailyn L , Fernandez W , Levina N , Glaser B (2015) Rejoinder: Moving the management field forward. Organ. Res. Methods 18(4):620–628.CrossrefGoogle Scholar
  • Wolpert DH , Macready WG (1997) No free lunch theorems for optimization. IEEE Trans. Evolution Comput. 1(1):67–82.CrossrefGoogle Scholar
  • Yan JLS , McCracken N , Crowston K (2014) Semi-automatic content analysis of qualitative data. Accessed June 1, 2016, http://socqa.org/iConf2014.Google Scholar
  • Yang JB , Shen KQ , Ong CJ , Li XP (2009) Feature selection for MLP neural network: The use of random permutation of probabilistic outputs. IEEE Trans. Neural Networks 20(12):1911–1922.CrossrefGoogle Scholar
  • Yao S , Huang B (2017) Beyond parity: Fairness objectives for collaborative filtering. Guyon I, Luxburg UV, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R eds. Advances in Neural Information Processing Systems, vol. 30 (Curran Associates, Red Hook, NY), 2921–2930.Google Scholar
  • Yin RK (2009) Case Study Research: Design and Methods (Sage Publications, Beverly Hills, CA). Google Scholar
  • Zelner BA (2009) Using simulation to interpret results from logit, probit, and other nonlinear models. Strategic Management J. 30(12):1335–1348.CrossrefGoogle Scholar
  • Zemel R , Wu Y , Swersky K , Pitassi T , Dwork C (2013) Learning fair representations. Dasgupta S, McAllester D, eds. Proc. 30th Internat. Conf. Machine Learning (PMLR, Atlanta), 325–333.Google Scholar
  • Zhou ZH , Jiang Y (2003) Medical diagnosis with C4.5 Rule preceded by artificial neural network ensemble. IEEE Trans. Inform. Tech. Biomedicine 7(1):37–42.CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.