Privacy Protection in Data Mining: A Perturbation Approach for Categorical Data
Published Online:1 Sep 2006https://doi.org/10.1287/isre.1060.0095
References
- Security-control methods for statistical databases: A comparative study. ACM Comput. Surveys (1989) 21(4):515–556Crossref, Google Scholar
- Privacy-preserving data mining. Proc. 2000 ACM SIGMOD Internat. Conf. Management of Data (2000) (ACM Press, New York) 439–450Crossref, Google Scholar
- Disclosure limitation of sensitive rules. Proc. IEEE Knowledge and Data Engineering Exchange Workshop (KEDX'99) (1999) (IEEE Computer Science Society, Washington, D.C.) 45–52Google Scholar
- UCI repository of machine learning databases. (1998) . Department of Information and Computer Science, University of California, Irvine, CA. http://www.ics.uci.edu/∼mlearn/MLRepository.htmlGoogle Scholar
- Report on preparation of the data set and improvements on Sullivan's algorithm. (2002) . http://neon.vb.cbs.nl/casc/Google Scholar
- Disclosure detection in multivariate categorical databases: Auditing confidentiality protection through two new matrix operators. Management Sci. (1999) 45(12):1710–1723Link, Google Scholar
- Network models for complementary cell suppression. J. Amer. Statist. Assoc. (1995) 90(432):1453–1462Crossref, Google Scholar
- “How did they get my name?”: An exploratory investigation of consumer attitudes toward secondary information use. MIS Quart. (1993) 17(3):341–363Crossref, Google Scholar
- Data swapping: A technique for disclosure control. J. Statist. Planning Inference (1982) 6(1):73–85Crossref, Google Scholar
- Inference control for statistical databases. Computer (1983) 16(7):69–82Crossref, Google Scholar
- Revealing information while preserving privacy. Proc. 22nd ACM SIGMOD-SIGACT-SIGART Sympos. Principles Database Systems (2003) (ACM Press, New York) 202–210Crossref, Google Scholar
- Pattern Classification (2001) (John Wiley & Sons, New York) Google Scholar
- The risk of disclosure for microdata. J. Bus. Econom. Statist. (1989) 7(2):201–217Google Scholar
- Optimal disclosure limitation strategy in statistical databases: Deterring tracker attacks through additive noise. J. Amer. Statist. Assoc. (2000) 95(451):720–729Crossref, Google Scholar
- , Mukesh M., Tjoa A. M. Data swapping: Balancing privacy against precision in mining for logic rules. Data Warehousing and Knowledge Discovery (DaWak'99) (1999) (Springer-Verlag, Berlin, Germany) 389–398Crossref, Google Scholar
- Privacy preserving mining of association rules. Proc. 8th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (2002) (ACM Press, New York) 217–228Crossref, Google Scholar
- , Domingo-Ferrer J., Torra V. Data swapping: Variations on a theme by Dalenius and Reiss. Privacy in Statistical Databases (2004) (Springer-Verlag, Berlin, Germany) 14–29Crossref, Google Scholar
- Disclosure limitation using perturbation and related methods for categorical data. J. Official Statist. (1998) 14(4):485–502Google Scholar
- MIS faculty salary survey results. (2004) . http://www.pitt.edu/∼galletta/salsurv.htmlGoogle Scholar
- Privacy protection of binary confidential data against deterministic, stochastic, and insider threat. Management Sci. (2002) 48(6):749–764Link, Google Scholar
- Privacy: Entitlement or illusion? Personnel J. (1996) 75(5):74–88Google Scholar
- Post randomization for statistical disclosure control: Theory and implementation. J. Official Statist. (1998) 14(4):463–478Google Scholar
- Transforming data to satisfy privacy constraints. Proc. 8th ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (2002) (ACM Press, New York) 279–288Crossref, Google Scholar
- Information Theory and Statistics (1959) (John Wiley & Sons, New York) Google Scholar
- . The determinants of contraceptive method and service point choice. Secondary Analysis of the 1987 National Indonesia Contraceptive Prevalence Survey (1991) I(East-West Population Institute, Jakarta, Indonesia) . Fertility and Family PlanningGoogle Scholar
- Adaptive data reduction for large-scale transaction data. (2005) . Working paper, School of Management, University of Texas at Dallas, Richardson, TXGoogle Scholar
- A data perturbation approach to privacy protection in data mining. (2005) . Working paper, School of Management, University of Texas at Dallas, Richardson, TXGoogle Scholar
- A data distortion by probability distribution. ACM Trans. Database Systems (1985) 10(3):395–411Crossref, Google Scholar
- A general additive data perturbation method for database security. Management Sci. (1999) 45(10):1399–1415Link, Google Scholar
- C4.5: Programs for Machine Learning (1993) (Morgan Kaufmann, San Mateo, CA) Google Scholar
- Practical data-swapping: The first steps. ACM Trans. Database Systems (1984) 9(1):20–37Crossref, Google Scholar
- Protecting privacy. Comm. ACM (1992) 35(4):164Crossref, Google Scholar
- The security of confidential numerical data in databases. Inform. Systems Res. (2002) 13(4):389–403Link, Google Scholar
- Security of statistical databases: Multidimensional transformation. ACM Trans. Database Systems (1981) 6(1):95–112Crossref, Google Scholar
- Stanford Student Computer and Network Privacy Project A study of student privacy issues at Stanford University. Comm. ACM (2002) 45(3):23–25Crossref, Google Scholar
- Construction of masking error for categorical variables. Proc. Section Survey Res. Methods (1990) (American Statistical Association, Alexandria, VA) 435–439Google Scholar
- Association rule hiding. IEEE Trans. Knowledge Data Engrg. (2004) 16(4):434–447Crossref, Google Scholar
- Consumer privacy concerns about Internet marketing. Comm. ACM (1998) 41(3):63–70Crossref, Google Scholar
- Data Mining: Practical Machine Learning Tools and Techniques with Java Implementations (2000) (Morgan Kaufmann, San Francisco, CA) Google Scholar

