Reidentification Risk in Panel Data: Protecting for k-Anonymity

Published Online:https://doi.org/10.1287/isre.2022.1169

References

  • Aggarwal CC (2005) On k-anonymity and the curse of dimensionality. Proc. 31st Internat. Conf. Very Large Data Bases, 901–909.Google Scholar
  • Aggarwal G, Feder T, Kenthapadi K, Motwani R, Panigrahy R, Thomas D, Zhu A (2005) Approximation algorithms for k-anonymity. Proc. Internat. Conf. Database Theory.Google Scholar
  • Allenby GM, Rossi PE (1998) Marketing models of consumer heterogeneity. J. Econometrics 89(1–2):57–78.CrossrefGoogle Scholar
  • Anand P, Lee C (2022) Using deep learning to overcome privacy and scalability issues in customer data transfer. Marketing Sci. Forthcoming.LinkGoogle Scholar
  • Bayardo RJ, Agrawal R (2005) Data privacy through optimal k-anonymization. 21st Internat. Conf. Data Engrg., 217–228.Google Scholar
  • Benitez K, Malin B (2010) Evaluating re-identification risks with respect to the HIPAA privacy rule. J. Amer. Medical Informatics Assoc. 17(2):169–177.CrossrefGoogle Scholar
  • Berry S, Levinsohn J, Pakes A (1995) Automobile prices in market equilibrium. Econometrica 63(4):841–890.CrossrefGoogle Scholar
  • Besanko D, Dube JP, Gupta S (2003) Competitive price discrimination in a vertical channel using aggregate retail data. Management Sci. 49(9):1121–1138.LinkGoogle Scholar
  • Besanko D, Gupta S, Jain D (1998) Logit demand estimation under competitive pricing behavior: An equilibrium framework. Management Sci., 44(11):1533–1547.LinkGoogle Scholar
  • Bodapati AV, Gupta S (2004) The recoverability of segmentation structure from store-level aggregate data. J. Marketing Res. 41(3):351–364.CrossrefGoogle Scholar
  • Bowman D, Narayandas D (2001) Managing customer-initiated contacts with manufacturers: The impact on share of category requirements and word-of-mouth behavior. J. Marketing Res. 38(3):281–297.CrossrefGoogle Scholar
  • Bronnenberg BJ, Kruger MW, Mela CF (2008) Database paper—The IRI marketing data set. Marketing Sci. 27(4):745–748.LinkGoogle Scholar
  • Bruno HA, Cebollada J, Chintagunta PK (2018) Targeting Mr. or Mrs. Smith: Modeling and leveraging intrahousehold heterogeneity in brand choice behavior. Marketing Sci. 37(4):631–648.LinkGoogle Scholar
  • Bucklin RE, Gupta S (1999) Commercial use of UPC scanner data: Industry and academic perspectives. Marketing Sci. 18(3):247–273.LinkGoogle Scholar
  • Chen Y, Yang S (2007) Estimating disaggregate models using aggregate data through augmentation of individual choice. J. Marketing Res. 44(4):613–621.CrossrefGoogle Scholar
  • Cox LH (1980) Suppression methodology and statistical disclosure control. J. Amer. Statist. Assoc. 75(370):377–385.CrossrefGoogle Scholar
  • De Montjoye YA, Hidalgo CA, Verleysen M, Blondel VD (2013) Unique in the crowd: The privacy bounds of human mobility. Sci. Rep. 3:1376.CrossrefGoogle Scholar
  • De Montjoye YA, Radaelli L, Singh VK, Pentland AS (2015) Unique in the shopping mall: On the reidentifiability of credit card metadata. Sci. 347(6221):536–539.CrossrefGoogle Scholar
  • Dey D (2003) Record matching in data warehouses: A decision model for data consolidation. Oper. Res. 51(2):240–254.LinkGoogle Scholar
  • Dey D, Sarkar S, De P (1998) A probabilistic decision model for entity matching in heterogeneous databases. Management Sci. 44(10):1379–1395.LinkGoogle Scholar
  • Domingo-Ferrer J, Torra V (2005) Ordinal, continuous and heterogeneous k-anonymity through microaggregation. Data Mining Knowledge Discovery 11(2):195–212.CrossrefGoogle Scholar
  • Duncan G, Lambert D (1986) Disclosure-limited data dissemination. J. Amer. Statist. Assoc. 81(393):10–18.CrossrefGoogle Scholar
  • Duncan G, Lambert D (1989) The risk of disclosure for microdata. J. Bus. Econom. Statist. 7(2):207–217.Google Scholar
  • Duncan GT, Stokes SL (2004) Disclosure risk vs. data utility: The RU confidentiality map as applied to topcoding. Chance 17(3):16–20.CrossrefGoogle Scholar
  • Dwork C (2006) Differential Privacy. Bugliesi M, Preneel B, Sassone V, Wegener I, eds. 33rd Internat. Colloquium Automata Languages Programming (Springer, Berlin/Heidelberg), 1-12.Google Scholar
  • El Emam K, Dankar FK (2008) Protecting privacy using k-anonymity. J. Amer. Medical Informatics Assoc. 15(5):627–637.CrossrefGoogle Scholar
  • Fellegi IP, Sunter AB (1969) A theory for record linkage. J. Amer. Statist. Assoc. 64(328):1183–1210.CrossrefGoogle Scholar
  • Ferrell OC (2017) Broadening marketing’s contribution to data privacy. J. Acad. Marketing Sci. 45(2):160–163.CrossrefGoogle Scholar
  • Finck M, Pallas K (2020) They who must not be identified—Distinguishing personal from non-personal data under the GDPR. Internat. Data Privacy Law 10(1):11–36.CrossrefGoogle Scholar
  • Fung BCM, Wang K, Chen R, Yu PS (2010) Privacy-preserving data publishing: A survey of recent developments. ACM Comput. Surveys 42(4):1–53.CrossrefGoogle Scholar
  • Gelman A, Park DK (2009) Splitting a predictor at the upper quarter or third and the lower quarter or third. Amer. Statist. 63(1):1–8.CrossrefGoogle Scholar
  • Goldfarb A, Tucker C (2012) Shifts in privacy concerns. Amer. Econom. Rev. 102(3):349–353.CrossrefGoogle Scholar
  • Hern A (2014) New York taxi details can be extracted from anonymised data, researchers say. The Guardian Online (June 27), https://www.theguardian.com/technology/2014/jun/27/new-york-taxi-details-anonymised-data-researchers-warn.Google Scholar
  • Herzog TN, Scheuren FJ, Winkler WE (2007) Data Quality and Record Linkage Techniques, vol. 1 (Springer, New York).Google Scholar
  • Kappe E, Stremersch S (2016) Drug detailing and doctors’ prescription decisions: The role of information content in the face of competitive entry. Marketing Sci. 35(6):915–933.LinkGoogle Scholar
  • Kartal HB, Li XB (2020) Protecting privacy when sharing and releasing data with multiple records per person. J. Assoc. Inform. Systems 21(6):1461–1485.Google Scholar
  • Kenig B, Tassa T (2012) A practical approximation algorithm for optimal k-anonymity. Data Mining Knowledge Discovery 25(1):134–168.CrossrefGoogle Scholar
  • Lambert D (1993) Measures of disclosure risk and harm. J. Official Statist. 9(2):313–331.Google Scholar
  • LeFevre K, DeWitt DJ, Ramakrishnan R (2005) Incognito: Efficient fulldomain k-anonymity. Proc. 2005 ACM SIGMOD Internat. Conf. Management Data, 49–60.Google Scholar
  • LeFevre K, DeWitt DJ, Ramakrishnan R (2006) Mondrian multidimensional k-anonymity. 22nd Internat. Conf. Data Engrg. (IEEE), 25.Google Scholar
  • Li N, Li T, Venkatasubramanian S (2007) t-closeness: Privacy beyond k-anonymity and l-diversity. 2007 IEEE 23rd Internat. Conf. Data Engrg., 106–115.Google Scholar
  • Li XB, Qin J (2017) Anonymizing and sharing medical text records. Inform. Systems Res. 28(2):332–352.LinkGoogle Scholar
  • Li XB, Sarkar S (2006) Privacy protection in data mining: A perturbation approach for categorical data. Inform. Systems Res. 17(3):254–270.LinkGoogle Scholar
  • Li XB, Sarkar S (2011) Protecting privacy against record linkage disclosure: A bounded swapping approach for numeric data. Inform. Systems Res. 22(4):774–789.LinkGoogle Scholar
  • Li XB, Sarkar S (2013) Class-restricted clustering and microperturbation for data privacy. Management Sci. 59(4):796–812.LinkGoogle Scholar
  • Liu Q, Gupta S, Venkataraman S, Liu H (2016) An empirical model of drug detailing: Dynamic competition and policy implications. Management Sci. 62(8):2321–2340.LinkGoogle Scholar
  • Lipsman A, Mudd G, Rich M, Bruich S (2012) The power of “like”: How brands reach (and influence) fans through social-media marketing. J. Advertising Res. 52(1):40–52.CrossrefGoogle Scholar
  • Machanavajjhala A, Gehrke J, Kifer D, Venkitasubramaniam M (2006) l-diversity: Privacy beyond k-anonymity. 22nd Internat. Conf. Data Engrg. (IEEE), 24.Google Scholar
  • Machanavajjhala A, Kifer D, Abowd J, Gehrke J, Vilhuber L (2008) Privacy: Theory meets practice on the map. 2008 IEEE 24th Internat. Conf. Data Engrg. (IEEE), 277–286.Google Scholar
  • Malhotra NK, Kim SS, Agarwal J (2004) Internet users’ information privacy concerns (IUIPC): The construct, the scale, and a causal model. Inform. Systems Res. 15(4):336–355.LinkGoogle Scholar
  • Manchanda P, Rossi PE, Chintagunta PK (2004) Response modeling with nonrandom marketing-mix variables. J. Marketing Res. 41(4):467–478.CrossrefGoogle Scholar
  • Martin KD, Murphy PE (2017) The role of data privacy in marketing. J. Acad. Marketing Sci. 45(2):135–155.CrossrefGoogle Scholar
  • Matthews GJ, Harel O (2011) Data confidentiality: A review of methods for statistical disclosure limitation and methods for assessing privacy. Statist. Surveys 5:1–29.CrossrefGoogle Scholar
  • Meyerson A, Williams R (2004) On the complexity of optimal k-anonymity. Proc. 23rd ACM SIGMOD-SIGACT-SIGART Sympos. Principles Database Systems (ACM), 223–228.Google Scholar
  • Musalem A, Bradlow ET, Raju JS (2009) Bayesian estimation of random‐coefficients choice models using aggregate data. J. Appl. Econometrics 24(3):490–516.CrossrefGoogle Scholar
  • Narayanan A, Shmatikov V (2008) Robust de-anonymization of large sparse datasets. 2008 IEEE Sympos. Security Privacy, 111–125.Google Scholar
  • Nergiz ME, Clifton C (2007) Thoughts on k-anonymization. Data and Knowledge Engrg. 63(3):622–645.Google Scholar
  • Reiter JP (2005) Estimating risks of identification disclosure in microdata. J. Amer. Statist. Assoc. 100(472):1103–1112.CrossrefGoogle Scholar
  • Rogers B (2021) Sensor tower builds the ‘Nielsen’ of the app world. Forbes Online (April 9), https://www.forbes.com/sites/brucerogers/2021/04/09/sensor-tower-builds-the-nielsen-of-the-app-world/?sh=4c92f2472272.Google Scholar
  • Rubin DB (1993) Statistical disclosure limitation. J. Official Statist. 9(2):461–468.Google Scholar
  • Samarati P, Sweeney L (1998) Protecting privacy when disclosing information: k-anonymity and its enforcement through generalization and suppression. Technical report, SRI-CSL-98-04, Computer Science Laboratory, SRI International, Menlo Park, CA.Google Scholar
  • Schneider MJ, Jagpal S, Gupta S, Li S, Yu Y (2017) Protecting customer privacy when marketing with second-party data. Internat. J. Res. Marketing 34(3):593–603.CrossrefGoogle Scholar
  • Schneider MJ, Jagpal S, Gupta S, Li S, Yu Y (2018) A flexible method for protecting marketing data: An application to point-of-sale data. Marketing Sci. 37(1):153–171.LinkGoogle Scholar
  • Smith HJ, Dinev T, Xu H (2011) Information privacy research: An interdisciplinary review. Management Inform. Systems Quart. 35(4):989–1015.CrossrefGoogle Scholar
  • Smith HJ, Milberg SJ, Burke SJ (1996) Information privacy: Measuring individuals’ concerns about organizational practices. Management Inform. Systems Quart. 20(2):167–196.CrossrefGoogle Scholar
  • Sweeney L (2000) Uniqueness of simple demographics in the US population. Technical report, Carnegie Mellon University, Pittsburgh, PA.Google Scholar
  • Sweeney L (2002a) Achieving k-anonymity privacy protection using generalization and suppression. Internat. J. Uncertainty Fuzziness Knowledge-Based Systems 10(5):571–588.CrossrefGoogle Scholar
  • Sweeney L (2002b) k-anonymity: A model for protecting privacy. Internat. J. Uncertainty Fuzziness Knowledge-Based Systems 10(5):557–570.CrossrefGoogle Scholar
  • Tucker CE (2014) Social networks, personalized advertising, and privacy controls. J. Marketing Res. 51(5):546–562.CrossrefGoogle Scholar
  • Verizon (2019) 2019 Data breach investigations report. Accessed August 15, 2022, https://enterprise.verizon.com/resources/executivebriefs/2019-dbir-executive-brief.pdf.Google Scholar
  • Wedel M, Kannan PK (2016) Marketing analytics for data-rich environments. J. Marketing 80(6):97–121.CrossrefGoogle Scholar
  • Wieringa J, Kannan PK, Ma X, Reutterer T, Risselada H, Skiera B (2021) Data analytics in a privacy-concerned world. J. Bus. Res. 122:915–925.CrossrefGoogle Scholar
  • Zhu D, Li XB, Wu S (2009) Identity disclosure protection: A data reconstruction approach for privacy-preserving data mining. Decision Support Systems 48(1):133–140.CrossrefGoogle Scholar
  • Zhu Y, Matsuyama Y, Ohashi Y, Setoguchi S (2015) When to conduct probabilistic linkage vs. deterministic linkage? A simulation study. J. Biomedical Informatics 56:80–86.CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.