Chasing a Moving Target: Exploitation and Exploration in Dynamic Environments

Published Online:https://doi.org/10.1287/mnsc.1110.1420

References

  • Auer P., Cesa-Bianchi N., Fischer P. Finite-time analysis of the multiarmed bandit problem. Machine Learn. (2002) 47(2–3):235–256CrossrefGoogle Scholar
  • Barr P., Stimpert J., Huff A. Cognitive change, strategic action, and organizational renewal. Strategic Management J. (1992) 13(S1):15–36CrossrefGoogle Scholar
  • Baum J., Wally S. Strategic decision speed and firm performance. Strategic Management J. (2003) 24(11):1107–1129CrossrefGoogle Scholar
  • Benner M. J. Dynamic or static capabilities? Process management practices and response to technological change. J. Product Innovation Management (2009) 26(5):473–486CrossrefGoogle Scholar
  • Benson D., Ziedonis R. Corporate venture capital as a window on new technologies: Implications for the performance of corporate investors when acquiring startups. Organ. Sci. (2009) 20(2):329–351LinkGoogle Scholar
  • Bergemann D., Välimäki J. Learning and strategic pricing. Econometrica (1996) 64(5):1125–1149CrossrefGoogle Scholar
  • Berry D. A., Fristedt B.Bandit Problems: Sequential Allocation of Experiments (1985) (Chapman and Hall, London) CrossrefGoogle Scholar
  • Blanco S. Consumer Reports: Chevy Volt “is going to be a tough sell to the average consumer”. AutoblogGreen (2011) March 1). http://green.autoblog.com/2011/03/01/consumer-reports-chevy-volt-is-going-to-be-a-tough-sell-to-the/Google Scholar
  • Brezzi M., Lai T. L. Optimal learning and experimentation in bandit problems. J. Econom. Dynam. Control (2002) 27(1):87–108CrossrefGoogle Scholar
  • Brown S., Eisenhardt K. The art of continuous change: Linking complexity theory and time-paced evolution in relentlessly shifting organizations. Admin. Sci. Quart. (1997) 42(1):1–34CrossrefGoogle Scholar
  • Bush R., Mosteller F.Stochastic Models for Learning (1955) (John Wiley & Sons, New York) CrossrefGoogle Scholar
  • Camerer C., Ho T. H. Experience-weighted attraction learning in normal form games. Econometrica (1999) 67(4):827–874CrossrefGoogle Scholar
  • Cyert R., March J.A Behavioral Theory of the Firm (1963) (Prentice-Hall, Englewood Cliffs, NJ) Google Scholar
  • D'Aveni R. A., Gunther R. E.Hypercompetition: Managing the Dynamics of Strategic Maneuvering (1994) (Simon and Schuster, New York) Google Scholar
  • Davis J., Eisenhardt K., Bingham C. Optimal structure, market dynamism, and the strategy of simple rules. Admin. Sci. Quart. (2009) 54(3):413–452CrossrefGoogle Scholar
  • Denrell J., March J. Adaptation as information restriction: The hot stove effect. Organ. Sci. (2001) 12(5):523–538LinkGoogle Scholar
  • Dess G., Beard D. Dimensions of organizational task environments. Admin. Sci. Quart. (1984) 29(1):52–73CrossrefGoogle Scholar
  • Gans N., Knox G., Croson R. Simple models of discrete choice and their performance in bandit experiments. Manufacturing Service Oper. Management (2007) 9(4):383–408LinkGoogle Scholar
  • Gavetti G., Levinthal D. A. Looking forward and looking backward: Cognitive and experiential search. Admin. Sci. Quart. (2000) 45(1):113–137CrossrefGoogle Scholar
  • Gittins J. C. Bandit processes and dynamic allocation indices. J. Roy. Statist. Soc.. Ser. B (Methodological) (1979) 41(2):148–177CrossrefGoogle Scholar
  • Gittins J. C., Jones D. M., Gani J. A dynamic allocation index for the sequential design of experiments. Progress in Statistics (1974) (North-Holland, Amsterdam) 241–266Google Scholar
  • Gittins J. C., Wang Y. The annals of statistics. Ann. Statist. (1992) 20(3):1625–1636CrossrefGoogle Scholar
  • Gupta A., Smith K., Shalley C. The interplay between exploration and exploitation. Acad. Management J. (2006) 49(4):693–706CrossrefGoogle Scholar
  • Hannan M., Freeman J. Structural inertia and organizational change. Amer. Sociol. Rev. (1984) 49(2):149–164CrossrefGoogle Scholar
  • Hardwick J. P., Stout Q. F. Bandit strategies for ethical sequential allocation. Proc. 23rd Sympos. Interface (1992) (American Statistical Association, Alexandria, VA) 421–424Google Scholar
  • Hatch N. W., Dyer J. H. Human capital and learning as a source of sustainable competitive advantage. Strategic Management J. (2004) 25(12):1155–1178CrossrefGoogle Scholar
  • Hedberg B., Nystrom C., Starbuck W. Camping on seesaws—Prescriptions for a self-designing organization. Admin. Sci. Quart. (1976) 21(1):41–65CrossrefGoogle Scholar
  • Henderson R. Underinvestment and incompetence as responses to radical innovation: Evidence from the photolithographic alignment equipment industry. RAND J. Econom. (1993) 24(2):248–270CrossrefGoogle Scholar
  • Holland J.Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control & Artificial Intelligence (1975) (University of Michigan Press, Ann Arbor) Google Scholar
  • Keller G., Rady S. Optimal experimentation in a changing environment. Rev. Econom. Stud. (1999) 66(3):475–507CrossrefGoogle Scholar
  • Lant T., Mezias S. An organizational learning model of convergence and reorientation. Organ. Sci. (1992) 3(1):47–72LinkGoogle Scholar
  • Le Mens G., Denrell J. Rational learning and information sampling: On the naivety assumption in sampling explanations of judgment biases. Psychol. Rev. (2011) 18(2):379–392CrossrefGoogle Scholar
  • Luce R.Individual Choice Behavior: A Theoretical Analysis (1959) (Wiley, New York) Google Scholar
  • March J. G. Exploration and exploitation in organizational learning. Organ. Sci. (1991) 2(1):71–87LinkGoogle Scholar
  • March J. G. Learning to be risk averse. Psychol. Rev. (1996) 103(2):309–319CrossrefGoogle Scholar
  • March J. G. Understanding organizational adaptation. Soc. Econom. (2003) 25(1):1–10CrossrefGoogle Scholar
  • March J. G.The Ambiguities of Experience (2010) (Cornell University Press, Ithaca, NY) CrossrefGoogle Scholar
  • McNamara P., Baden-Fuller C. Lessons from the Celltech case: Balancing knowledge exploration and exploitation in organizational renewal. British J. Management (1999) 10(4):291–307CrossrefGoogle Scholar
  • Miller D. The structural and environmental correlates of business strategy. Strategic Management J. (1987) 8(1):55–76CrossrefGoogle Scholar
  • Nelson R. Bounded rationality, cognitive maps, and trial and error learning. J. Econom. Behav. Organ. (2008) 67(1):78–89CrossrefGoogle Scholar
  • Nelson R., Winter S. G.An Evolutionary Theory of Economic Change (1982) (Belknap Press, Cambridge, MA) Google Scholar
  • Peters T. J., Waterman R. H.In Search of Excellence: Lessons from America's Best-Run Companies (1982) (Harper & Row, New York) Google Scholar
  • Radner R., Rothschild M. On the allocation of effort. J. Economic Theory (1975) 10(3):358–376CrossrefGoogle Scholar
  • Robbins H. Some aspects of the sequential design of experiments. Bull. Amer. Math. Soc. (1952) 58(5):527–535CrossrefGoogle Scholar
  • Schmalensee R. Alternative models of bandit selection. J. Econom. Theory (1975) 10(3):333–342CrossrefGoogle Scholar
  • Sorenson J., Stuart T. E. Aging, obsolescence, and organizational innovation. Admin. Sci. Quart. (2000) 45(1):81–112CrossrefGoogle Scholar
  • Sutton R. S., Barto A. G.Reinforcement Learning: An Introduction (1998) (MIT Press, Cambridge, MA) Google Scholar
  • Teece D. J., Pisano G., Shuen A. M. Y. Dynamic capabilities and strategic management. Strategic Management J. (1997) 18(7):509–533CrossrefGoogle Scholar
  • Tushman M., Romanelli E., Cummings L. L., Staw B. M. Organizational evolution: A metamorphosis model of convergence and reorientation. Research in Organizational Behavior (1985) (JAI Press, Greenwich, CT) 171–222Google Scholar
  • Valdes-Dapena P. Ford: No Volt for us. CNNMoney (2009) January 13). http://money.cnn.com/2009/01/12/autos/ford_electric_plans/index.htmGoogle Scholar
  • Vermorel J., Mohri M., Gama J., Camacho R., Brazdil P., Jorge A., Torgo L. Multi-armed bandit algorithms and empirical evaluation. Machine Learn.: ECML 2005 (2005) (Springer-Verlag, Berlin) 437–448LNAI 3720CrossrefGoogle Scholar
  • Weber E., Shafir S., Blais A.-R. Predicting risk sensitivity in humans and lower animals: Risk as variance or coefficient of variation. Psych. Rev. (2004) 111(2):430–445CrossrefGoogle Scholar
  • Weick K.The Social Psychology of Organizing (1969) (Addison-Wesley, Reading, MA) Google Scholar
  • Welch D. GM: Live green or die. Bloomberg Businessweek (2008) May 15). http://www.businessweek.com/print/magazine/content/08_21/b4085036665789.htmGoogle Scholar
  • Wholey D., Brittain J. Characterizing environmental variation. Acad. Management J. (1989) 32(4):867–882CrossrefGoogle Scholar
  • Whittle P. Restless bandits: Activity allocation in a changing world. J. Appl. Probability (1988) 25:287–298CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.