A for Effort? Using the Crowd to Identify Moral Hazard in New York City Restaurant Hygiene Inspections
Published Online:5 Dec 2019https://doi.org/10.1287/isre.2019.0866
References
- (2008) CyberGate: a design framework and system for text analysis of computer-mediated communication. MIS Quart. 32(4):811–837.Crossref, Google Scholar
- (1970) The market for “lemons”: Quality uncertainty and the market mechanism. Quart. J. Econom. 84(3):488–500.Crossref, Google Scholar
- (2009) Mostly Harmless Econometrics: An Empiricist’s Companion (Princeton University Press, Princeton, NJ).Google Scholar
- (2006) Eating in restaurants: A risk factor for foodborne disease? Clinical Infectious Diseases 43(10):1324–1328.Crossref, Google Scholar
- (2013) Introduction to the special issue—Social media and business transformation: A framework for research. Inform. Systems Res. 24(1):3–13.Link, Google Scholar
- (2011) Deriving the pricing power of product features by mining consumer reviews. Management Sci. 57(8):1485–1509.Link, Google Scholar
- (2015) Machine learning and causal inference for policy evaluation. Proc. 21st ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (Association for Computing Machinery, New York), 5–6.Crossref, Google Scholar
- (2017) Beyond prediction: Using big data for policy problems. Science 355(6324):483–485.Crossref, Google Scholar
- (2006) Identification and inference in nonlinear difference‐in‐differences models. Econometrica 74(2):431–497.Crossref, Google Scholar
- (2015) Machine learning methods for demand estimation. Amer. Econom. Rev. 105(5):481–485.Crossref, Google Scholar
- (2018) Machine learning and natural language processing on the patent corpus: Data, tools, and new measures. J. Econom. Management Strategy 27(3):535–553.Crossref, Google Scholar
- (2015) Restaurants follow consultants’ advice to the letter for an A grade. New York Times (October 4), https://www.nytimes.com/2015/10/05/nyregion/health-exam-help-for-restaurants-to-avoid-rodents-or-worse-a-c.html.Google Scholar
- (2010) Positive effects of negative publicity: When negative reviews increase sales. Marketing Sci. 29(5):815–827.Link, Google Scholar
- (2002) How much should we trust differences-in-differences estimates? Working paper, National Bureau of Economic Research, Cambridge, MA.Google Scholar
- (2010) Research commentary—Designing smart markets. Inform. Systems Res. 21(4):688–699.Link, Google Scholar
- (2009) Digital disease detection—Harnessing the web for public health surveillance. New England J. Medicine 360(21):2153–2157.Crossref, Google Scholar
- (2017) What can machine learning do? Workforce implications. Science 358(6370):1530–1534.Crossref, Google Scholar
- (2002) A risk-based restaurant inspection system in Los Angeles County. J. Food Protection 65(2):367–372.Crossref, Google Scholar
- (2011) Amazon’s Mechanical Turk a new source of inexpensive, yet high-quality, data? Perspect. Psych. Sci. 6(1):3–5.Crossref, Google Scholar
- (2014) City restaurant health inspection grades a sham: Expert. New York Post (April 13), http://nypost.com/2014/04/13/city-restaurant-health-inspection-grades-a-shame-expert/.Google Scholar
- (2013) New avenues in opinion mining and sentiment analysis. IEEE Intelligent Systems 28(2):15–21.Crossref, Google Scholar
- (2011) Exploring determinants of voting for the “helpfulness” of online user reviews: A text mining approach. Decision Support Systems 50(2):511–521.Crossref, Google Scholar
- Centers for Disease Control and Prevention (2016a) Foodborne germs and illnesses. Retrieved April 10, 2016, http://www.cdc.gov/foodsafety/foodborne-germs.html.Google Scholar
- Centers for Disease Control and Prevention (2016b) Trends in foodborne illness. Retrieved April 10, 2016, http://www.cdc.gov/foodborneburden/trends-in-foodborne-illness.html.Google Scholar
- (2011) Using web search query data to monitor dengue epidemics: A new model for neglected tropical disease surveillance. PLoS Neglected Tropical Diseases 5(5):e1206.Crossref, Google Scholar
- (2012) The impact of electronic word-of-mouth communication: A literature analysis and integrative model. Decision Support Systems 54(1):461–470.Crossref, Google Scholar
- City of New York (2016) NYC Open Data: Restaurant inspections. Retrieved April 10, 2016, https://nycopendata.socrata.com/.Google Scholar
- (2001) How should food safety certification be financed? Amer. J. Agricultural Econom. 83(4):852–861.Crossref, Google Scholar
- (2010) Estimating aggregate consumer preferences from online product reviews. Internat. J. Res. Marketing 27(4):293–307.Crossref, Google Scholar
- (2003) The digitization of word of mouth: Promise and challenges of online feedback mechanisms. Management Sci. 49(10):1407–1424.Link, Google Scholar
- (2006) A statistical measure of a population’s propensity to engage in post-purchase online word-of-mouth. Statist. Sci. 21(2):277–285.Crossref, Google Scholar
- , (2007) Exploring the value of online product reviews in forecasting sales: The case of motion pictures. J. Interactive Marketing 21(4):23–45.Crossref, Google Scholar
- DrivenData (2016) Keeping it fresh: Predict restaurant inspections. Retrieved April 10, 2016, https://www.drivendata.org/competitions/5/.Google Scholar
- (2008) Do online reviews matter?—An empirical investigation of panel data. Decision Support Systems 45(4):1007–1016.Crossref, Google Scholar
- (2007) A review of food safety and food hygiene training studies in the commercial sector. Food Control 18(10):1180–1190.Crossref, Google Scholar
- (2014) Economics in the age of big data. Science 346(6210):1243089.Crossref, Google Scholar
- (2011) Restaurant letter grading: The first year. Report, New York City Department of Health and Mental Hygiene, New York.Google Scholar
- (2012) Restaurant grading in New York City at 18 months. Report, New York City Department of Health and Mental Hygiene, New York.Google Scholar
- (2016) Enforcement guidelines for common sanitary violations. Report, New York City Department of Health and Mental Hygiene, New York.Google Scholar
- (2013) Techniques and applications for sentiment analysis. Comm. ACM 56(4):82–89.Crossref, Google Scholar
- (2013) Regression anatomy, revealed. Stata J. 13(1):92–106.Crossref, Google Scholar
- (1933) Partial time regressions as compared with individual trends. Econometrica 1(4):387–401.Crossref, Google Scholar
- (2012) A changing landscape of physician quality reporting: Analysis of patients’ online ratings of their physicians over a 5-year period. J. Medical Internet Res. 14(1):e38.Crossref, Google Scholar
- (2000) Are invisible hands good hands? Moral hazard, competition, and the second‐best in healthcare markets. J. Political Econom. 108(5):992–1005.Crossref, Google Scholar
- (2017) Text as data. Working paper, National Bureau of Economic Research, Cambridge, MA.Google Scholar
- (2011) Estimating the helpfulness and economic impact of product reviews: Mining text and reviewer characteristics. IEEE Trans. Knowledge Data Engrg. 23(10):1498–1512.Crossref, Google Scholar
- (2012) Designing ranking systems for hotels on travel search engines by mining user-generated and crowdsourced content. Marketing Sci. 31(3):493–520.Link, Google Scholar
- (2009) Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford University, Stanford, CA.Google Scholar
- (2004) Using online conversations to study word-of-mouth communication. Marketing Sci. 23(4):545–560.Link, Google Scholar
- (2013) Data collection in a flat world: The strengths and weaknesses of Mechanical Turk samples. J. Behav. Decision Making 26(3):213–224.Crossref, Google Scholar
- (2013) Contributing factors in restaurant-associated foodborne disease outbreaks, FoodNet sites, 2006 and 2007. J. Food Protection 76(11):1824–1828.Crossref, Google Scholar
- (2001) Principles of Data Mining (MIT Press, Cambridge, MA).Google Scholar
- (2016) The risk of James–Stein and Lasso Shrinkage. Econometric Rev. 35(8–10):1456–1470.Crossref, Google Scholar
- (2014) Using online reviews by restaurant patrons to identify unreported cases of foodborne illness—New York City, 2012–2013. Morbidity Mortality Weekly Report 63(20):441–445.Google Scholar
- HealthMap (2017) About. Retrieved March 24, 2017, http://www.healthmap.org.Google Scholar
- (2006) Systematic environmental evaluations to identify food safety differences between outbreak and nonoutbreak restaurants. J. Food Protection 69(11):2697–2702.Crossref, Google Scholar
- (2012) Fudging the nudge: Information disclosure and restaurant grading. Yale Law J. 122(3):574–688.Google Scholar
- (1979) Moral hazard and observability. Bell J. Econom. 10(1):74–91.Crossref, Google Scholar
- (2016) Package “textcat.” Retrieved April 10, 2016, https://cran.r-project.org/web/packages/textcat/index.html.Google Scholar
- (2004) Mining and summarizing customer reviews. Proc. 10th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (Association for Computing Machinery, New York), 168–177.Google Scholar
- (1989) Results of routine restaurant inspections can predict outbreaks of foodborne illness: The Seattle-King County experience. Amer. J. Public Health 79(5):586–590.Crossref, Google Scholar
- (2016) Combining satellite imagery and machine learning to predict poverty. Science 353(6301):790–794.Crossref, Google Scholar
- (2014a) Inspection technology, detection, and compliance: Evidence from Florida restaurant inspections. RAND J. Econom. 45(4):885–917.Crossref, Google Scholar
- (2014b) A tale of repetition: Lessons from Florida restaurant inspections. Working paper, National Bureau of Economic Research, Cambridge, MA.Google Scholar
- (2003) The effect of information on product quality: Evidence from restaurant hygiene grade cards. Quart. J. Econom. 118(2):409–451.Crossref, Google Scholar
- (2009) Reputational incentives for restaurant hygiene. Amer. Econom. J. Microeconom. 1(1):237–267.Crossref, Google Scholar
- (2013) Where not to eat? Improving public policy by predicting hygiene inspections using online reviews. Ng HT, ed. Proc. 2013 Conf. Empirical Methods Natl. Language Processing (Association for Computational Linguistics, Stroudsburg, PA), 1443–1448.Google Scholar
- (2017) Open innovation in the public sector: A research agenda. Government Inform. Quart. 34(1):84–89.Crossref, Google Scholar
- (2016) Market transparency, adverse selection, and moral hazard. J. Political. Econom. 124(6):1677–1713.Crossref, Google Scholar
- (2015) Prediction policy problems. Amer. Econom. Rev. 105(5):491–495.Crossref, Google Scholar
- (2017) Human decisions and machine predictions. Quart. J. Econom. 133(1):237–293.Crossref, Google Scholar
- (1992) Agency theory and franchising: Some empirical results. RAND J. Econom. 23(2):263–283.Crossref, Google Scholar
- (2011) Automated marketing research using online customer reviews. J. Marketing Res. 48(5):881–894.Crossref, Google Scholar
- (2015) Do I follow my friends or the crowd? Information cascades in online movie ratings. Management Sci. 61(9):2241–2258.Link, Google Scholar
- (2010) The information content of forward-looking statements in corporate filings—A naïve Bayesian machine learning approach. J. Accounting Res. 48(5):1049–1102.Crossref, Google Scholar
- (2008) Self-selection and information role of online product reviews. Inform. Systems Res. 19(4):456–474.Link, Google Scholar
- (1986) Longitudinal data analysis using generalized linear models. Biometrika 73(1):13–22.Crossref, Google Scholar
- (2015) Sentiment Analysis: Mining Opinions, Sentiments, and Emotions (Cambridge University Press, Cambridge, UK).Crossref, Google Scholar
- (2002) Text classification using string kernels. J. Machine Learn. Res. 2(February):419–444.Google Scholar
- (2011) When is a liability not a liability? Textual analysis, dictionaries, and 10-Ks. J. Finance 66(1):35–65.Crossref, Google Scholar
- (1963) Seasonal adjustment of economic time series and multiple regression analysis. J. Amer. Statist. Assoc. 58(304):993–1010.Crossref, Google Scholar
- (2013a) The emergence of opinion leaders in a networked online community: A dyadic model with time dynamics and a heuristic for fast estimation. Management Sci. 59(8):1783–1799.Link, Google Scholar
- (2013b) Promotional marketing or word-of-mouth? Evidence from online restaurant reviews. Inform. Systems Res. 24(3):596–612.Link, Google Scholar
- (2016) Fake it till you make it: Reputation, competition, and Yelp review fraud. Management Sci. 62(12): 3412–3427.Link, Google Scholar
- (2013) More than words: The influence of affective content and linguistic style matches in online reviews on conversion rates. J. Marketing 77(1):87–103.Crossref, Google Scholar
- , (1998) A comparison of event models for naive Bayes text classification. Cohn A, ed. Learning for Text Categorization: Papers from the 1998 AAAI Workshop (Association for the Advancement of Artificial Intelligence, Menlo Park, CA), 41–48.Google Scholar
- (2015) More than just words: Latent semantic analysis, online reviews and restaurant closure. Acad. Management Proc. 2015(1), https://doi.org/10.5465/ambpp.2015.13912abstract.Google Scholar
- (1995) WordNet: a lexical database for English. Comm. ACM 38(11):39–41.Crossref, Google Scholar
- (2014) Doing business with strangers: Reputation in online service marketplaces. Inform. Systems Res. 25(4):865–886.Link, Google Scholar
- (2017) Machine learning: an applied econometric approach. J. Econom. Perspect. 31(2):87–106.Crossref, Google Scholar
- (2012) Mine your own business: Market-structure surveillance through text mining. Marketing Sci. 31(3):521–543.Link, Google Scholar
- New York City Department of Health and Mental Hygiene (2016a) How we score and grade. Retrieved April 1, 2016, https://www1.nyc.gov/assets/doh/downloads/pdf/rii/how-we-score-grade.pdf.Google Scholar
- New York City Department of Health and Mental Hygiene (2016b) Letter grading workshop materials. Retrieved April 1, 2016, https://www1.nyc.gov/assets/doh/downloads/ppt/rii/letter_grading_workshop.pps.Google Scholar
- New York City Department of Health and Mental Hygiene (2016c) What to expect when you’re inspected: A guide for food service operators. Retrieved April 1, 2016, https://www1.nyc.gov/assets/doh/downloads/pdf/rii/blue-book.pdf.Google Scholar
- (2010) An exploration of social media in extreme events: Rumor theory and Twitter during the Haiti earthquake 2010. Proc. 31st Internat. Conf. Inform. Systems (Association for Information Systems, Atlanta), 7332–7336.Google Scholar
- (2018) Board independence and firm performance in the IT industry: The moderating role of new entry threats. MIS Quart. 42(3):979–1000.Crossref, Google Scholar
- (2015) New York City restaurants are cleaner than ever. Time (June 29), http://time.com/3940482/new-york-city-restaurants-clean/.Google Scholar
- (2013) Responding to online reviews problem solving and engagement in hotels. Cornell Hospitality Quart. 54(1):64–73.Crossref, Google Scholar
- (1996) Stochastic Processes (John Wiley & Sons, New York).Google Scholar
- (1983) Repeated insurance contracts and moral hazard. J. Econom. Theory 30(1):74–97.Crossref, Google Scholar
- (2016) Supplementing public health inspection via social media. PLoS One 11(3):e0152117.Crossref, Google Scholar
- (2002) Machine learning in automated text categorization. ACM Comput. Surveys 34(1):1–47.Crossref, Google Scholar
- (1986) Investment, moral hazard, and occupational licensing. Rev. Econom. Stud. 53(5):843–862.Crossref, Google Scholar
- (1979) On moral hazard and insurance. Quart. J. Econom. 93(4):541–562.Crossref, Google Scholar
- (2005) Moral hazard, inspection policy, and food safety. Amer. J. Agricultural Econom. 87(1):15–27.Crossref, Google Scholar
- (2007) Contract selectivity, food safety, and traceability. J. Agricultural Food Indust. Organ. 5(1):Article 2.Google Scholar
- (2002) Information and the change in the paradigm in economics. Amer. Econom. Rev. 92(3):460–501.Crossref, Google Scholar
- (2010) Regulation and failure. Rev. Econom. Inst. 12(23):13–28.Google Scholar
- (2009) A survey on sentiment detection of reviews. Expert Systems Appl. 36(7):10760–10773.Crossref, Google Scholar
- (2008) More than words: Quantifying language to measure firms’ fundamentals. J. Finance 63(3):1437–1467.Crossref, Google Scholar
- , Hsu JY (2013) Building a concept-level sentiment dictionary based on commonsense knowledge. IEEE Intelligent Systems 28(2):22–30.Crossref, Google Scholar
- (2004) Developing affective lexical resources. PsychNology J. 2(1):61–83.Google Scholar
- (2014) Big data: New tricks for econometrics. J. Econom. Perspect. 28(2):3–27.Crossref, Google Scholar
- (2009) Rethinking LDA: Why priors matter. Bengio Y, Schuurmans D, Lafferty JD, Williams CKI, Culotta A, eds. Advances in Neural Information Processing Systems, vol. 22 (Curran Associates, Inc., Red Hook, NY), 1973–1981.Google Scholar
- (2018) Tracking depression dynamics in college students using mobile phone and wearable sensing. Proc. ACM Interactive Mobile Wearable Ubiquitous Tech. 2(1):Article 43.Google Scholar
- (2010) Web 2.0 and politics: The 2008 U.S. presidential election and an e-politics research agenda. MIS Quart. 34(4):669–688.Crossref, Google Scholar
- (2012) Affective news: The automated coding of sentiment in political texts. Political Comm. 29(2):205–231.Crossref, Google Scholar

