LYRICCOVERS 2.0: AN ENHANCED DATASET FOR COVER SONG ANALYSIS
Journal: IADIS INTERNATIONAL JOURNAL ON WWW/INTERNET (Vol.22, No. 2)Publication Date: 2024-12-17
Authors : Maximilian Balluff Maximilian Auch Peter Mandl; Christian Wolff;
Page : 75-92
Keywords : ;
Abstract
This research offers a detailed examination of a novel dataset that collates original musical compositions alongside their derivative cover versions. Unique in its inclusion of both links to YouTube as well as and lyrical content, the dataset enlists more than 78,000 tracks, encompassing more than 24,000 cover song groupings. It stands as the most diverse compendium of cover songs currently available for study. The characteristics of the LyricCovers dataset are thoroughly analyzed through its metadata, and empirical evaluations in the subsequent experimental lyrics analysis section suggest that lyrical analysis is a fundamental component in the identification and study of cover songs. This work presents a baseline approach to cover song detection, with an emphasis on lyrical content processing. It describes the extraction of lyrics from the audio files and the application of the Jina Embeddings 2 Model, fine-tuned with a hard triplet-loss objective, which successfully exploits lyric similarity to accurately identify cover songs.
Other Latest Articles
- “OF COURSE AI DISCRIMINATES”: IDENTIFYING COMMUNICATION GAPS AND CROSS-DISCIPLINARY TRANSLATION CHALLENGES
- A STRATEGY FOR PREDICTING STUDENT PERFORMANCE ON AN ONLINE PLATFORM: PROPOSAL, DEVELOPMENT AND VALIDATION
- PREDICTING REVIEW HELPFULNESS IN VIDEO GAMES: A COMPARATIVE ANALYSIS OF MACHINE LEARNING MODELS AND NLP INTEGRATION
- Critical Analysis of Manju Kapur's “Custody” and “Home”
- Seismic Retrofit of Reinforced Concrete Frames with Filled Masonry Materials Using Fiber Reinforced Polymers
Last modified: 2025-01-03 01:37:14