On the Existence and Significance of Data Preprocessing Biases in Web-Usage Mining
Published Online:1 May 2003https://doi.org/10.1287/ijoc.15.2.148.14449
References
- Experimentation: An Introduction to Measurement Theory and Experiment Design (1994) (Prentice Hall, Englewood Cliffs, NJ) Google Scholar
- Measuring the accuracy of sessionizers for web usage analysis. Workshop on Web Mining at the 2001 SIAM Conference on Data Mining (2001) 7–14Google Scholar
- Mastering Data Mining: Art and Science of Customer Relationship Management (1999) (John Wiley and Sons, New York) Google Scholar
- KDD-Cup 2000 Organizers' Report: Peeling the onion. SIGKDD Explorations (2000) 2:1–8Google Scholar
- Discovering Internet marketing intelligence through online analytical web usage mining. ACMSIGMOD Record (1999) 27:57–61Google Scholar
- Discovering Data Mining: From Concept to Implementation (1997) (Prentice Hall, Inc., Upper Saddle River, NJ) Google Scholar
- Data preparation for mining World Wide Web browsing patterns. Knowledge and Inform. Systems (1999) 1:5–31Crossref, Google Scholar
- Introduction to chaos. (1998) . Lecture Notes in Theoretical Physic, Dept. of Physics, Cal. Institute of Technology. http://www.cmp.caltech.edu/~mcc/chaos_new/Lorenz.htmlGoogle Scholar
- Determining WWW user's next access and its application to pre-fetching. Internat. Sympos. Comput. and Comm. ‘97 (1997) (Alexandria, Egypt)33–42Google Scholar
- E-Metrics: Tomorrow's business metrics today. Proc. of the Sixth ACM SIGKDD Internat. Conf. KDD (2000) KDD 2000, Boston, MA:12–20Crossref, Google Scholar
- Chaos and fractal. (2001) . Intermediate Physics Seminar, Dept. of Physics, Johns Hopkins University, Baltimore MD. http://www.pha.jhu.edu/~ldb/seminar/butterfly.htmlGoogle Scholar
- Forecasting repeat sales at cdnow: A case study. Interfaces (1999) 31:94–107Crossref, Google Scholar
- , Fayyad U., Piatetsky-Shapiro G., Smyth P., Uthurusamy R. From data mining to knowledge discovery: An overview. Advances in Knowledge Discovery and Data Mining (1996) (MIT Press, Cambridge, MA) 20–42Google Scholar
- Asking questions can change choice behavior: Does it do so automatically or effortfully? J. of Exper. Psych. Appl. (2000) 6:195–206Crossref, Google Scholar
- Chaos—Making a New Science (1987) (Mountain Man Graphics, Newport Beach, Australia) Google Scholar
- Statistical themes and lessons for data mining. Data Mining and Knowledge Discovery (1997) 1:11–28Crossref, Google Scholar
- The Complete Database Marketing (1996) (Irwin Professional, Chicago, IL) Google Scholar
- On the depth and dynamics of online search behaviour. (2000) . The Wharton School Working Paper #00-014, University of Pennsylvania, Philadelphia, PAGoogle Scholar
- Applied Multivariate Statistical Analysis (1998) (Prentice Hall, Englewood Cliffs, NJ) 697–703Google Scholar
- As E-asy as falling off a web log: Data mining hits the web. SPSS Data Mining (2001) 22:12–24Google Scholar
- On usage metric for determining authoritative sites. Proc. of World Inform. Tech. 2000 (2000) Brisbane, Australia:23–32Google Scholar
- A multivariate analysis of web usage. J. Advertising Res. (1999) 39:53–68Google Scholar
- Data mining for direct marketing: Problems and solutions. Proc. of the Fourth Internat. Conf. on Knowledge Discovery and Data Mining 98 (1998) 73–79Google Scholar
- Deterministic nonperiodic flow. J. Atmosphere Sci. (1963) 20:130–141Crossref, Google Scholar
- Data Mining Your Website (1999) (Digital Press, Boston, MA) Google Scholar
- Automatic personalization based on web usage mining. (1999) . Working Paper TR 99-010, Department of Computer Science, Depaul University, Chicago, ILGoogle Scholar
- Discovery and evaluation of aggregate usage profiles for web personalization. Data Mining and Knowledge Discovery (2002) 6:61–82Crossref, Google Scholar
- Which visits lead to purchases? Dynamic conversion behavior at e-commerce sites. (2000) . Working Paper #00-023, the Wharton School, University of Pennsylvania, Philadelphia, PAGoogle Scholar
- New metrics for new media: Toward the development of web measurement standards. World Wide Web J (1997) 2:213–246Google Scholar
- A comparison of site-centric and user-centric data mining approaches to predicting session-level purchase behavior on the web. (2001a) . Working Paper 01-2001, Department of Operations and Information Management, the Wharton School, University of Pennsylvania, Philadelphia, PAGoogle Scholar
- Personalization from incomplete data: What you don't know can hurt. Proc. of the Seventh ACM SIGKDD Internat. Conf. on KDD 2001 (2001b) San Francisco, CA:154–163Crossref, Google Scholar
- Summary of WWW characterizations. Comput. Networks and ISDN Systems (1998) 30:551–558Crossref, Google Scholar
- The identification and satisfaction of consumer analysis-driven information needs of marketers on the WWW. Eur. J. of Marketing (1998) 32:688–702Crossref, Google Scholar
- Web usage mining: Discovery and applications of usage patterns from web data. SIGKDD Explorations (2000) 1:12–23Crossref, Google Scholar
- Analyzing the footsteps of your Customers. Proc. of the Sixth ACM SIGKDD Internat. Conf. on Web KDD 2000 (2000) Boston, MA:44–52Google Scholar
- The framing of decisions and the psychology of choice. Science (1981) 211:453–458Crossref, Google Scholar
- Enabling scalable online personalization on the web. Proc. of Electronic Commerce (EC00)/ ACM (2000) Minneapolis, MN:185–196Crossref, Google Scholar
- SpeedTracer: A web usage mining and analysis tool. Internet Comput (1999) 37:89–105Google Scholar

