Predictive Analytics for Navigation Data Using Sequence-Based Clustering and Absorbing Markov Chains

Published Online:https://doi.org/10.1287/ijds.2023.0011

References

  • Aartsen B, El-Gayar OF, Noteboom C (2020) A systematic review of web usage mining techniques and future research options. MWAIS 2020 Proc. 25 (AIS eLibrary).Google Scholar
  • Abraham S, Nair S (2015) A predictive framework for cyber security analytics using attack graphs. Preprint, submitted February 4, https://arxiv.org/abs/1502.01240.Google Scholar
  • Adeniyi DA, Wei Z, Yongquan Y (2016) Automated web usage data mining and recommendation system using k-nearest neighbor (KNN) classification method. Appl. Comput. Inform. 12(1):90–108.Google Scholar
  • Ansari Z, Ahmed W, Azeem M, Babu AV (2015) Discovery of web usage profiles using various clustering techniques. Preprint, submitted September 1, https://arxiv.org/abs/1509.00692.Google Scholar
  • Borges J, Levene M (2000) Data mining of user navigation patterns. Masand B, Spiliopoulou M, eds. Web Usage Analysis and User Profiling (Springer, Berlin), 92–112.Google Scholar
  • Borges J, Levene M (2005) Generating dynamic higher-order Markov models in web usage mining. Jorge AM, Torgo L, Brazdil P, Camacho R, Gama J, eds. Knowledge Discovery in Databases (Springer, Berlin), 34–45.Google Scholar
  • Cadez I, Heckerman D, Meek C, Smyth P, White S (2003) Model-based clustering and visualization of navigation patterns on a web site. Data Mining Knowledge Discovery 7:399–424.Google Scholar
  • Chierichetti F, Kumar R, Raghavan P, Sarlos T (2012) Are web users really Markovian? Proc. 21st Internat. Conf. World Wide Web (Association for Computing Machinery, New York), 609–618.Google Scholar
  • Deshpande M, Karypis G (2004) Selective Markov models for predicting web page accesses. ACM Trans. Internet Tech. 4(2):163–184.Google Scholar
  • Gao H, Wang S, Zhang H, Liu B, Zhao D, Liu Z (2022) Network security situation assessment method based on absorbing Markov chain. 2022 Internat. Conf. Networking Network Appl. (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 556–561.Google Scholar
  • Granot IY, Wu CSM, Or-Bach Z (2021) Improving the relevance of a web navigation recommender system using categorization of users’ experience. 2021 IEEE World AI IoT Congress (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 0486–0490.Google Scholar
  • Grinstead CM, Snell JL (2012) Introduction to Probability (American Mathematical Society, Providence, RI).Google Scholar
  • Gu A, Dao T (2024) Mamba: Linear-time sequence modeling with selective state spaces. First Conf. Language Model (COLM) (OpenReview).Google Scholar
  • Hand DJ, Till RJ (2001) A simple generalisation of the area under the ROC curve for multiple class classification problems. Machine Learn. 45(2):171–186.Google Scholar
  • Heyman DP (1995) Accurate computation of the fundamental matrix of a Markov chain. SIAM J. Matrix Anal. Appl. 16(3):954–963.Google Scholar
  • Hu EJ, Shen Y, Wallis P, Allen-Zhu Z, Li Y, Wang S, Wang L, Chen W (2022) LoRA: Low-rank adaptation of large language models. Proc. Tenth Internat. Conf. Learn. Representations (ICLR).Google Scholar
  • Jespersen S, Pedersen TB, Thorhauge J (2003) Evaluating the Markov assumption for web usage mining. Proc. Fifth ACM Internat. Workshop Web Inform. Data Management (Association for Computing Machinery, New York), 82–89.Google Scholar
  • Koehn D, Lessmann S, Schaal M (2020) Predicting online shopping behaviour from clickstream data using deep learning. Expert Systems Appl. 150:113342.Google Scholar
  • Koopmann T, Dallmann A, Hettinger L, Niebler T, Hotho A (2019) On the right track! Analysing and predicting navigation success in Wikipedia. Proc. 30th ACM Conf. Hypertext Social Media (Association for Computing Machinery, New York), 143–152.Google Scholar
  • Kumar H, Anuradha T, Solanki A, Singh KK, et al. (2020) Progressive machine learning approach with WebAstro for web usage mining. Procedia Comput. Sci. 167:1400–1410. Google Scholar
  • Le R, Ku H, Jun D (2021) Sequence-based clustering applied to long-term credit risk assessment. Expert Systems Appl. 165:113940.Google Scholar
  • Levene M, Borges J, Loizou G (2001) Zipf’s law for web surfers. Knowledge. Inform. Systems 3:120–129.Google Scholar
  • Li S, Wu J, Long C, Lin YB (2021) A full-process optimization-based background subtraction for moving object detection on general-purpose embedded devices. IEEE Trans. Consumer Electronics 67(2):129–140.Google Scholar
  • Madan BB, Goševa-Popstojanova K, Vaidyanathan K, Trivedi KS (2004) A method for modeling and quantifying the security attributes of intrusion tolerant systems. Performance Evaluation 56(1–4):167–186.Google Scholar
  • Massa S, Puliafito P (2002) Web usage mining: Knowledge discovery using Markov chains. Data Mining III, WIT Transactions on Information and Communication Technologies, vol. 28 (WIT Press, Southampton, UK).Google Scholar
  • Mlika F, Karoui W (2020) Proposed model to intelligent recommendation system based on Markov chains and grouping of genres. Procedia Comput. Sci. 176:868–877.Google Scholar
  • Pallis G, Vakali A, Pokorny J (2008) A clustering-based prefetching scheme on a web cache environment. Comput. Electr. Engrg. 34(4):309–323.Google Scholar
  • Park JS, Gho G (1997) An algorithm for computing the fundamental matrix of a Markov chain. J. Korean Oper. Res. Management Sci. Soc. 22(1):75–85.Google Scholar
  • Park S, Vasudev V (2017) Predicting web user’s behavior: An absorbing Markov chain approach. Fan M, Heikkilä J, Li H, Shaw M, Zhang H, eds. Internetworked World (Springer, Cham, Switzerland), 170–176.Google Scholar
  • Park S, Suresh NC, Jeong BK (2008) Sequence-based clustering for web usage mining: A new experimental framework and ANN-enhanced k-means algorithm. Data Knowledge Engrg. 65(3):512–543.Google Scholar
  • Piccardi T, Gerlach M, Arora A, West R (2023) A large-scale characterization of how readers browse Wikipedia. ACM Trans. Web. 17(2):11.Google Scholar
  • Plachouras V, Ounis I, Amati G (2005) The static absorbing model for the web. J. Web Engrg. 4(2):165–186.Google Scholar
  • Ranjan C, Ebrahimi S, Paynabar K (2022) Sequence graph transform (SGT): A feature embedding function for sequence data mining. Data Mining Knowledge Discovery 36(2):668–708.Google Scholar
  • Rawat M, Goyal N, Singh S (2017) Advancement of recommender system based on clickstream data using gradient boosting and random forest classifiers. 2017 Eighth Internat. Conf. Comput., Comm. Networking Tech. (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 1–6.Google Scholar
  • Rudin C (2019) Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nature Machine Intelligence 1(5):206–215.Google Scholar
  • Singer P, Helic D, Taraghi B, Strohmaier M (2014) Detecting memory and structure in human navigation patterns using Markov chain models of varying order. PLoS One 9(7):e102070.Google Scholar
  • Sokolova M, Lapalme G (2009) A systematic analysis of performance measures for classification tasks. Inform. Processing Management 45(4):427–437.Google Scholar
  • Taylor HM, Karlin S (1998) An Introduction to Stochastic Modeling, 3rd ed. (Academic Press, San Diego).Google Scholar
  • Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I (2017) Attention is all you need. Guyon I, Von Luxburg U, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R, eds. Advances in Neural Information Processing Systems, vol. 30 (Neural Information Processing Systems, La Jolla, CA), 5999–6009.Google Scholar
  • Vermeer S, Trilling D (2020) Toward a better understanding of news user journeys: A Markov chain approach. Journalism Stud. 21(7):879–894.Google Scholar
  • Vermeer S, Trilling D, Kruikemeier S, de Vreese C (2020) Online news user journeys: The role of social media, news websites, and topics. Digital Journalism 8(9):1114–1141.Google Scholar
  • Wang JL (2004) Markov-chain based reliability analysis for distributed systems. Comput. Electrical Engrg. 30(3):183–205.Google Scholar
  • Wang W, Zaïane OR (2002) Clustering web sessions by sequence alignment. Proc. 13th Internat. Workshop Database Expert Systems Appl. (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 394–398.Google Scholar
  • Wei R, Wang J (2018) multiROC: Calculating and visualizing ROC and PR curves across multi-class classifications. r Package V. 1.1.1. https://CRAN.R-project.org/package=multiROC.Google Scholar
  • Wibowo W, Sari NP, Wilantari RN, Abdul-Rahman S (2021) Association rule mining method for the identification of internet use. Journal of Physics: Conference Series, vol. 1874 (IOP Publishing, Bristol, UK), 012009.Google Scholar
  • Xing D, Shen J (2004) Efficient data mining for web navigation patterns. Inform. Software Tech. 46(1):55–63.Google Scholar
  • Yang Y, Liu X (1999) A re-examination of text categorization methods. Proc 22nd Annual Internat. ACM SIGIR Conf. Res. Development Inform. Retrieval (Association for Computing Machinery, New York), 42–49.Google Scholar
  • Zheng L (2020) Research on e-commerce potential client mining applied to apriori association rule algorithm. 2020 Internat. Conf. Intelligent Transportation, Big Data Smart City (Institute of Electrical and Electronics Engineers, Piscataway, NJ), 667–670.Google Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.