pyJedAI: A Library with Resolution-Related Structures and Procedures for Products

Published Online:https://doi.org/10.1287/ijoc.2023.0410

References

  • Adikari S, Dutta K (2019) A new approach to real-time bidding in online advertisements: Auto pricing strategy. INFORMS J. Comput. 31(1):66–82.LinkGoogle Scholar
  • Adikari S, Dutta K (2021) Adaptive ad network selection for publisher-return optimization in mobile-app advertising. Decision Sci. 52(4):986–1017.CrossrefGoogle Scholar
  • Balsmeier B, Assaf M, Chesebro T, Fierro G, Johnson K, Johnson S, Li G, et al. (2018) Machine learning and natural language processing on the patent corpus: Data, tools, and new measures. J. Econom. Management Strategy 27(3):535–553.CrossrefGoogle Scholar
  • Besbes O, Maglaras C (2012) Dynamic pricing with financial milestones: Feedback-form policies. Management Sci. 58(9):1715–1731.LinkGoogle Scholar
  • Binette O, Steorts RC (2022) (Almost) all of entity resolution. Sci. Adv. 8(12):eabi8021.CrossrefGoogle Scholar
  • Bisong E (2019) Google Colaboratory (Apress, Berkeley, CA), 59–64.CrossrefGoogle Scholar
  • Boegershausen J, Datta H, Borah A, Stephen A (2022) Fields of gold: Scraping web data for marketing insights. J. Marketing 86(5):1–20.CrossrefGoogle Scholar
  • Brunner U, Stockinger K (2020) Entity matching with transformer architectures - A step forward in data integration. Bonifati A, Zhou Y, Salles MAV, Bohm A, Olteanu D, Fletcher GHL, Khan A, eds. Proc. 23rd Internat. Conf. Extending Database Tech. (EDBT) (OpenProceedings.org), 463–473.Google Scholar
  • Cheng X, Zhang J, Yan LL (2020) Understanding the impact of individual users’ rating characteristics on the predictive accuracy of recommender systems. INFORMS J. Comput. 32(2):303–320.AbstractGoogle Scholar
  • Chiang IR, Nunez MA (2007) Improving web-catalog design for easy product search. INFORMS J. Comput. 19(4):510–519.LinkGoogle Scholar
  • Christen P (2012) A survey of indexing techniques for scalable record linkage and deduplication. IEEE Trans. Knowledge Data Engrg. 24(9):1537–1555.CrossrefGoogle Scholar
  • Christophides V, Efthymiou V, Palpanas T, Papadakis G, Stefanidis K (2021) An overview of end-to-end entity resolution for big data. ACM Comput. Surveys 53(6):1–42.CrossrefGoogle Scholar
  • Dalvi NN, Kumar R, Pang B, Ramakrishnan R, Tomkins A, Bohannon P, Keerthi SS, Merugu S (2009) A web of concepts. Paredaens J, Su J, eds. Proc. Twenty-Eighth ACM SIGMOD-SIGACT-SIGART Sympos. Principles Database Systems (PODS) (ACM, New York), 1–12.Google Scholar
  • Edelman B (2012) Using internet data for economic research. J. Econom. Perspect. 26(2):189–206.CrossrefGoogle Scholar
  • Feng J, Bhargava HK, Pennock DM (2007) Implementing sponsored search in web search engines: Computational evaluation of alternative mechanisms. INFORMS J. Comput. 19(1):137–148.LinkGoogle Scholar
  • Franklin MJ, Halevy AY, Maier D (2005) From databases to dataspaces: A new abstraction for information management. SIGMOD Rec. 34(4):27–33.CrossrefGoogle Scholar
  • Ghoshal A, Sarkar S (2014) Association rules for recommendations with multiple items. INFORMS J. Comput. 26(3):433–448.LinkGoogle Scholar
  • Guo R, Sun P, Lindgren E, Geng Q, Simcha D, Chern F, Kumar S (2020) Accelerating large-scale inference with anisotropic vector quantization. Proc. 37th Internat. Conf. Machine Learn. (ICML), Virtual Event, Proceedings of Machine Learning Research, vol. 119 (PMLR, New York), 3887–3896.Google Scholar
  • Hassanzadeh O, Chiang F, Miller RJ, Lee HC (2009) Framework for evaluating clustering algorithms in duplicate detection. Proc. VLDB Endowment 2(1):1282–1293.Google Scholar
  • Heath T, Bizer C (2011) Linked Data: Evolving the Web into a Global Data Space, 1st ed, Synthesis Lectures on the Semantic Web: Theory and Technology (Morgan & Claypool Publishers, San Rafael, CA), 1–136.Google Scholar
  • Hunold M, Kesler R, Laitenberger U (2020) Rankings of online travel agents, channel pricing, and consumer protection. Marketing Sci. 39(1):92–116.LinkGoogle Scholar
  • Hunold M, Laitenberger U, Thébaudin G (2022) Bye-box: An analysis of non-promotion on the Amazon Marketplace. Working paper, Centre de Recherche en Economie et Droit (CRED), Paris.Google Scholar
  • Ioannou E, Nikoletos K, Papadakis G (2024) pyJedAI: A library with resolution-related structures and procedures for products. http://dx.doi.org/10.1287/ijoc.2023.0410.cd, https://github.com/INFORMSJoC/2023.0410.Google Scholar
  • Johnson J, Douze M, Jégou H (2021) Billion-scale similarity search with GPUs. IEEE Trans. Big Data 7(3):535–547.CrossrefGoogle Scholar
  • Köpcke H, Thor A, Rahm E (2010) Evaluation of entity resolution approaches on real-world match problems. Proc. VLDB Endowment 3(1):484–493.Google Scholar
  • Li Y, Li J, Suhara Y, Doan A, Tan W (2020) Deep entity matching with pre-trained language models. Proc. VLDB Endowment 14(1):50–60.Google Scholar
  • Mandilaras GM, Papadakis G, Gagliardelli L, Simonini G, Thanos E, Giannakopoulos G, Bergamaschi S, et al. (2021) Reproducible experiments on three-dimensional entity resolution with JedAI. Inform. Systems 102:101830.CrossrefGoogle Scholar
  • Min B, Ross H, Sulem E, Veyseh APB, Nguyen TH, Sainz O, Agirre E, Heintz I, Roth D (2023) Recent advances in natural language processing via large pre-trained language models: A survey. ACM Comput. Surveys 56(2):1–40.CrossrefGoogle Scholar
  • Mudgal S, Li H, Rekatsinas T, Doan A, Park Y, Krishnan G, Deep R, Arcaute E, Raghavendra V (2018) Deep learning for entity matching: A design space exploration. Das G, Jermaine CM, Bernstein PA, eds. Proc. 2018 Internat. Conf. Management Data (SIGMOD) Conf. (ACM, New York), 19–34.Google Scholar
  • Narayan A, Chami I, Orr LJ, Ré C (2022) Can foundation models wrangle your data? Proc. VLDB Endowment 16(4):738–746.Google Scholar
  • Nikoletos K, Papadakis G, Koubarakis M (2022) pyJedAI: A lightsaber for link discovery. Dimou A, Haller A, Gentile AL, Ristoski P, eds. Proc. ISWC 2022 Posters, Demos Indust. Tracks: From Novel Ideas Industrial Practice Co-located with 21st Internat. Semantic Web Conf. (ISWC), Virtual Conf., CEUR Workshop Proc., vol. 3254 (CEUR-WS.org).Google Scholar
  • Papadakis G, Efthymiou V, Thanos E, Hassanzadeh O (2022) Bipartite graph matching algorithms for clean-clean entity resolution: An empirical evaluation. Stoyanovich J, Teubner J, Guagliardo P, Nikolic M, Pieris A, Muhlig J, Ozcan F, eds. Proc. 25th Internat. Conf. Extending Database Tech. (EDBT) (OpenProceedings.org), vol. 2, 462–474.Google Scholar
  • Papadakis G, Ioannou E, Thanos E, Palpanas T (2021a) The Four Generations of Entity Resolution, Synthesis Lectures on Data Management (Morgan & Claypool Publishers, Williston, VT), 1–170.Google Scholar
  • Papadakis G, Kirielle N, Christen P, Palpanas T (2024) A critical re-evaluation of record linkage bench-marks for learning-based matching algorithms. 40th IEEE Internat. Conf. Data Engineering (ICDE) (IEEE, Piscataway, NJ), 3435–3448.Google Scholar
  • Papadakis G, Skoutas D, Thanos E, Palpanas T (2021b) Blocking and filtering techniques for entity resolution: A survey. ACM Comput. Surveys 53(2):1–42.CrossrefGoogle Scholar
  • Papadakis G, Svirsky J, Gal A, Palpanas T (2016) Comparative analysis of approximate blocking techniques for entity resolution. Proc. VLDB Endowment 9(9):684–695.CrossrefGoogle Scholar
  • Papadakis G, Fisichella M, Schoger F, Mandilaras G, Augsten N, Nejdl W (2023) Benchmarking filtering techniques for entity resolution. 39th IEEE Internat. Conf. Data Engrg. (ICDE) (IEEE, Piscataway, NJ), 653–666.Google Scholar
  • Raffo J, Lhuillery S (2009) How to play the “names game”: Patent retrieval comparing different heuristics. Res. Policy 38(10):1617–1627.CrossrefGoogle Scholar
  • Tang L, Walsh JP (2010) Bibliometric fingerprints: Name disambiguation based on approximate structure equivalence of cognitive maps. Scientometrics 84(3):763–784.CrossrefGoogle Scholar
  • Thirumuruganathan S, Li H, Tang N, Ouzzani M, Govind Y, Paulsen D, Fung G, Doan A (2021) Deep learning for blocking in entity matching: A design space exploration. Proc. VLDB Endowment 14(11):2459–2472.CrossrefGoogle Scholar
  • Wang H, Li J, Wu H, Hovy E, Sun Y (2023) Pre-trained language models and their applications. Engrg. 25:51–65.Google Scholar
  • Yang YC, Liu H, Cai Y (2013) Discovery of online shopping patterns across websites. INFORMS J. Comput. 25(1):161–176.LinkGoogle Scholar
  • Zeakis A, Papadakis G, Skoutas D, Koubarakis M (2023) Pre-trained embeddings for entity resolution: An experimental analysis. Proc. VLDB Endowment 16(9):2225–2238.CrossrefGoogle Scholar
  • Zhang D, Lu Z (2013) Assessing the value of dynamic pricing in network revenue management. INFORMS J. Comput. 25(1):102–115.LinkGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.