Background Music Recommendation on Short Video Sharing Platforms

Published Online:https://doi.org/10.1287/isre.2022.0093

References

  • Abu-El-Haija S, Kothari N, Lee J, Natsev P, Toderici G, Varadarajan B, Vijayanarasimhan S (2016) Youtube-8m: A large-scale video classification benchmark. Preprint, submitted September 27, https://arxiv.org/abs/1609.08675.Google Scholar
  • Bahdanau D, Cho KH, Bengio Y (2015) Neural machine translation by jointly learning to align and translate. Proc. Third Internat. Conf. Learn. Representation (ICLR, Appleton, WI), 1–5.Google Scholar
  • Bi X, Qu A, Shen X (2018) Multilayer tensor factorization with applications to recommender systems. Ann. Statist. 46(6B):3308–3333.CrossrefGoogle Scholar
  • Blacker A (2023) Worldwide and US download leaders 2022. Accessed January 17, 2024, https://blog.apptopia.com/worldwide-and-us-download-leaders-2022.Google Scholar
  • Braunhofer M, Kaminskas M, Ricci F (2013) Location-aware music recommendation. Internat. J. Multimedia Inform. Retrieval 2(1):31–44.CrossrefGoogle Scholar
  • Brent W (2009) Cepstral analysis tools for percussive timbre identification. Proc. Third Internat. Pure Data Convention, 1–7.Google Scholar
  • Cano P, Koppenberger M, Wack N (2005) Content-based music audio recommendation. Proc. 13th Annual ACM Internat. Conf. Multimedia (ACM, New York), 211–212.Google Scholar
  • Devlin J, Chang MW, Lee K, Toutanova K (2018) BERT: Pre-training of deep bidirectional transformers for language understanding. Preprint, submitted October 11, https://arxiv.org/abs/1810.04805.Google Scholar
  • Galli M, Gurini DF, Gasparetti F, Micarelli A, Sansonetti G (2015) Analysis of user-generated content for improving YouTube video. Proc. 9th RecSys Posters (ACM, New York), 1–2.Google Scholar
  • Gómez-Cañón JS, Cano E, Eerola T, Herrera P, Hu X, Yang YH, Gómez E (2021) Music emotion recognition: Toward new, robust standards in personalized and context-sensitive applications. IEEE Signal Processing Magazine 38(6):106–114.CrossrefGoogle Scholar
  • Gomez-Uribe CA, Hunt N (2015) The netflix recommender system: Algorithms, business value, and innovation. ACM Trans. Management Inform. Systems 6(4):1–19.CrossrefGoogle Scholar
  • Grosche P, Müller M, Kurth F (2010) Cyclic tempogram—A mid-level tempo representation for music signals. Proc. 2010 IEEE Internat. Conf. Acoustics Speech Signal Processing (IEEE, Piscataway, NJ), 5522–5525.Google Scholar
  • Hansen C, Hansen C, Maystre L, Mehrotra R, Brost B, Tomasi F, Lalmas M (2020) Contextual and sequential user embeddings for large-scale music recommendation. Proc. 14th ACM Conf. Recommender Systems (ACM, New York), 53–62.Google Scholar
  • Harper FM, Konstan JA (2015) The MovieLens datasets: History and context. ACM Trans. Interactive Intelligent Systems 5(4):1–19.CrossrefGoogle Scholar
  • He R, Fang C, Wang Z, McAuley J (2016) Vista: A visually, socially, and temporally-aware model for artistic recommendation. Proc. 10th ACM Conf. Recommender Systems (ACM, New York), 309–316.Google Scholar
  • He X, Liao L, Zhang H, Nie L, Hu X, Chua TS (2017) Neural collaborative filtering. Proc. 26th Internat. Conf. World Wide Web (ACM, New York), 173–182.Google Scholar
  • Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput. 9(8):1735–1780.CrossrefGoogle Scholar
  • HuggingFace (2020) BERT-base-Chinese. Accessed December 28, 2023, https://huggingface.co/bert-base-chinese.Google Scholar
  • Johnson CC (2014) Logistic matrix factorization for implicit feedback data. Adv. Neural Inform. Processing Systems 27(78):1–9.Google Scholar
  • Kaminskas M, Ricci F, Schedl M (2013) Location-aware music recommendation using auto-tagging and hybrid matching. Proc. 7th ACM Conf. Recommender Systems (ACM, New York), 17–24.Google Scholar
  • Kang WC, McAuley J (2018) Learning consumer and producer embeddings for user-generated content recommendation. Proc. 12th ACM Conf. Recommender Systems (ACM, New York), 407–411.Google Scholar
  • Krizhevsky A, Sutskever I, Hinton GE (2017) ImageNet classification with deep convolutional neural networks. Comm. ACM 60(6):84–90.Google Scholar
  • Kumar V, Minz S (2013) Mood classification of lyrics using SentiWordNet. Proc. 2013 Internat. Conf. Computer Comm. Inform. (IEEE, Piscataway, NJ), 1–5.Google Scholar
  • Kuo FF, Shan MK, Lee SY (2013) Background music recommendation for video based on multimodal latent semantic analysis. Proc. 2013 IEEE Internat. Conf. Multimedia Expo (IEEE, Piscataway, NJ), 1–6.Google Scholar
  • Li W, Zhang Y, Sun Y, Wang W, Li M, Zhang W, Lin X (2019) Approximate nearest neighbor search on high dimensional data—Experiments, analyses, and improvement. IEEE Trans. Knowledge Data Engrg. 32(8):1475–1488.CrossrefGoogle Scholar
  • Liang D, Zhan M, Ellis DP (2015) Content-aware collaborative music recommendation using pre-trained neural networks. Proc. 16th Internat. Soc. Music Inform. Retrieval Conf. (ISMIR, Canada), 295–301.Google Scholar
  • Liao C, Wang PP, Zhang Y (2009) Mining association patterns between music and video clips in professional MTV. Proc. 15th Internat. Conf. Multimedia Model. (Springer, New York), 401–412.Google Scholar
  • Lin TW, Shan MK (2017) Correlation-based background music recommendation by incorporating temporal sequence of local features. Proc. Third IEEE Internat. Conf. Multimedia Big Data (IEEE, Piscataway, NJ), 158–164.Google Scholar
  • Lin JC, Wei WL, Wang HM (2016) DEMV-matchmaker: Emotional temporal course representation and deep similarity matching for automatic music video generation. Proc. 2016 IEEE Internat. Conf. Acoustics Speech Signal Processing (IEEE, Piscataway, NJ), 2772–2776.Google Scholar
  • Lin YT, Tsai TH, Hu MC, Cheng WH, Wu JL (2014) Semantic based background music recommendation for home videos. Internat. Conf. Multimedia Model. (Springer International Publishing, Cham, Switzerland), 283–290.Google Scholar
  • Lin JC, Wei WL, Yang J, Wang HM, Liao HYM (2017) Automatic music video generation based on simultaneous soundtrack recommendation and video editing. Proc. 25th ACM Internat. Conf. Multimedia (ACM, New York), 519–527.Google Scholar
  • Linden G, Smith B, York J (2003) Amazon.com recommendations: Item-to-item collaborative filtering. IEEE Internet Comput. 7(1):76–80.CrossrefGoogle Scholar
  • Liu CL, Chen YC (2018) Background music recommendation based on latent factors and moods. Knowledge Based Systems 159:158–170.CrossrefGoogle Scholar
  • Liu T, Moore A, Yang K, Gray A (2004) An investigation of practical approximate nearest neighbor algorithms. Adv. Neural Inform. Processing Systems 17 (MIT Press, Cambridge, MA).Google Scholar
  • Logan B (2004) Music recommendation from song sets. Proc. Fifth Internat. Soc. Music Inform. Retrieval Conf. (ISMIR, Canada), 425–428.Google Scholar
  • Magron P, Févotte C (2021) Leveraging the structure of musical preference in content-aware music recommendation. Proc. 2021 IEEE Internat. Conf. Acoustics Speech Signal Processing (IEEE, Piscataway, NJ), 581–585.Google Scholar
  • Mayer R, Neumayer R, Rauber A (2008) Combination of audio and lyrics features for genre classification in digital audio collections. Proc. 16th ACM Internat. Conf. Multimedia (ACM, New York), 159–168.Google Scholar
  • McFee B, Raffel C, Liang D, Ellis DP, McVicar M, Battenberg E, Nieto O (2015) Librosa: Audio and music signal analysis in Python. Proc. 14th Python Sci. Conf., vol. 8, 18–25.Google Scholar
  • Nagawade MS, Ratnaparkhe VR (2017) Musical instrument identification using MFCC. Proc. Second IEEE Internat. Conf. Recent Trends Electronics Inform. Comm. Tech. (IEEE, Piscataway, NJ), 2198–2202.Google Scholar
  • Nalini N, Palanivel S (2016) Music emotion recognition: The combined evidence of MFCC and residual phase. Egyptian Inform. J. 17(1):1–10.CrossrefGoogle Scholar
  • Nanopoulos A, Rafailidis D, Symeonidis P, Manolopoulos Y (2009) Musicbox: Personalized music recommendation based on cubic analysis of social tags. IEEE Trans. Audio Speech Language Processing 18(2):407–412.CrossrefGoogle Scholar
  • Prétet L, Richard G, Peeters G (2021) Cross-modal music-video recommendation: A study of design choices. Proc. 2021 Internat. Joint Conf. Neural Networks (IEEE, New York), 1–9.Google Scholar
  • Reimers N, Gurevych I (2019) Sentence-BERT: Sentence embeddings using Siamese BERT-networks. Proc. 2019 Conf. Empirical Methods Natl. Language Processing Ninth Internat. Joint Conf. Natl. Language Processing (Association for Computational Linguistics, Stroudsburg, PA), 3982–3992.Google Scholar
  • Reimers N, Gurevych I (2020) Making monolingual sentence embeddings multilingual using knowledge distillation. Proc. 2020 Conf. Empirical Methods Natl. Language Processing (Association for Computational Linguistics, Stroudsburg, PA), 4512–4525.Google Scholar
  • Rendle S (2010) Factorization machines. Proc. 2010 IEEE Internat. Conf. Data Mining (IEEE, Piscataway, NJ), 995–1000.Google Scholar
  • Rubens N, Elahi M, Sugiyama M, Kaplan D (2015) Active learning in recommender systems. Recommender Systems Handbook (Springer, Boston), 809–846.Google Scholar
  • Schein AI, Popescul A, Ungar LH, Pennock DM (2002) Methods and metrics for cold-start recommendations. Proc. 25th Annual Internat. ACM SIGIR Conf. Res. Development Inform. Retrieval (ACM, New York), 253–260.Google Scholar
  • Shen M, Wang J, Liu O, Wang H (2020) Expert detection and recommendation model with user-generated tags in collaborative tagging systems. J. Database Management 31(4):24–45.CrossrefGoogle Scholar
  • Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. Preprint, submitted September 4, https://arxiv.org/abs/1409.1556.Google Scholar
  • Sturm BL (2013) The GTZAN data set: Its contents, its faults, their effects on evaluation, and its future use. Preprint, submitted June 6, https://arxiv.org/abs/1306.1461.Google Scholar
  • Tsukuda K, Fukayama S, Goto M (2019) ABCPRec: Adaptively bridging consumer and producer roles for user-generated content recommendation. Proc. 42nd Internat. ACM SIGIR Conf. Res. Development Inform. Retrieval (ACM, New York), 1197–1200.Google Scholar
  • Tucker LR (1966) Some mathematical notes on three-mode factor analysis. Psychometrika 31(3):279–311.CrossrefGoogle Scholar
  • Volkovs M, Yu G, Poutanen T (2017) Dropoutnet: Addressing cold start in recommender systems. Adv. Neural Inform. Processing Systems 30 (The MIT Press, Cambridge, MA).Google Scholar
  • Wang X, Wang Y (2014) Improving content-based and hybrid music recommendation using deep learning. Proc. 22nd ACM Internat. Conf. Multimedia (ACM, New York), 627–636.Google Scholar
  • Wang D, Deng S, Xu G (2018) Sequence-based context-aware music recommendation. Inform. Retrieval J. 21(2):230–252.CrossrefGoogle Scholar
  • Wang X, Rosenblum D, Wang Y (2012) Context-aware mobile music recommendation for daily activities. Proc. 20th ACM Internat. Conf. Multimedia (ACM, New York), 99–108.Google Scholar
  • Wang X, Yu L, Ren K, Tao G, Zhang W, Yu Y, Wang J (2017) Dynamic attention deep model for article recommendation by learning human editors’ demonstration. Proc. 23rd ACM SIGKDD Internat. Conf. Knowledge Discovery Data Mining (ACM, New York), 2051–2059.Google Scholar
  • Yang C, Miao L, Jiang B, Li D, Cao D (2020) Gated and attentive neural collaborative filtering for user generated list recommendation. Knowledge Based Systems 187:104839.CrossrefGoogle Scholar
  • Yi J, Zhu Y, Xie J, Chen Z (2021) Cross-modal variational auto-encoder for content-based micro-video background music recommendation. IEEE Trans. Multimedia 25:515–528.CrossrefGoogle Scholar
  • Zhang S, Yao L, Sun A, Tay Y (2019) Deep learning based recommender system: A survey and new perspectives. ACM Comput. Surveys 52(1):1–38.CrossrefGoogle Scholar
  • Zheng F, Zhang G, Song Z (2001) Comparison of different implementations of MFCC. J. Comput. Sci. Tech. 16(6):582–589.CrossrefGoogle Scholar
INFORMS site uses cookies to store information on your computer. Some are essential to make our site work; Others help us improve the user experience. By using this site, you consent to the placement of these cookies. Please read our Privacy Statement to learn more.