Cover Song Identification Using Compression-based Distance Measures

M.Sc. Teppo E. Ahonen will defend his doctoral thesis Cover Song Identification Using Compression-based Distance Measures on Friday the 1st of April 2016 at 12 o'clock in the University of Helsinki Exactum Building, Auditorium CK112 (Gustaf Hällströminkatu 2b) His opponent is Academy Professor Petri Toiviainen (University of Jyväskylä) and custos Professor Esko Ukkonen (University of Helsinki). The defence will be held in Finnish.

Measuring similarity in music data is a problem with various potential applications. In recent years, the task known as cover song identification has gained widespread attention. In cover song identification, the purpose is to determine whether a piece of music is a different rendition of a previous version of the composition. The task is quite trivial for a human listener, but highly challenging for a computer.

 

This research approaches the problem from an information theoretic starting point. Assuming that cover versions share musical information with the original performance, we strive to measure the degree of this common information as the amount of computational resources needed to turn one version into another. Using a similarity measure known as normalized compression distance, we approximate the non-computable Kolmogorov complexity as the length of an object when compressed using a real-world data compression algorithm. If two pieces of music share musical information, we should be able to compress one using a model learned from the other.

In order to use compression-based similarity measuring, the meaningful musical information needs to be extracted from the raw audio signal data. The most commonly used representation for this task is known as chromagram: a sequence of real-valued vectors describing the temporal tonal content of the piece of music. Measuring the similarity between two chromagrams effectively with a data compression algorithm requires further processing to extract relevant features and find a more suitable discrete representation for them. Here, the challenge is to process the data without losing the distinguishing characteristics of the music.

In this research, we study the difficult nature of cover song identification and search for an effective compression-based system for the task. Harmonic and melodic features, different representations for them, commonly used data compression algorithms, and several other variables of the problem are addressed thoroughly. The research seeks to shed light on how different choices in the scheme attribute to the performance of the system. Additional attention is paid to combining different features, with several combination strategies studied. Extensive empirical evaluation of the identification system has been performed, using large sets of real-world music data.

Evaluations show that the compression-based similarity measuring performs relatively well but fails to achieve the accuracy of the existing solution that measures similarity by using common subsequences. The best compression-based results are obtained by a combination of distances based on two harmonic representations obtained from chromagrams using hidden Markov model chord estimation, and an octave-folded version of the extracted salient melody representation. The most distinct reason for the shortcoming of the compression performance is the scarce amount of data available for a single piece of music. This was partially overcome by internal data duplication. As a whole, the process is solid and provides a practical foundation for an information theoretic approach for cover song identification.

Availability of the dissertation

An electronic version of the doctoral dissertation is available on the e-thesis site of the University of Helsinki at http://urn.fi/URN:ISBN:978-951-51-2026-7.

Printed copies will be available on request from Teppo E. Ahonen: tel. 02941 51276 or teppo.ahonen@cs.helsinki.fi.

Created date

30.03.2016 - 14:53

The university’s team Game of Nolife won Western European programming contest for students

In the finals in Thailand in spring 2016, the students from the University of Helsinki will face the best teams in the world.

The University of Helsinki has won the inter-university NWERC 2015 programming contest that was held in Linköping recently. It was attended by 95 teams from Western Europe. The Game of Nolife team from the University of Helsinki consisted of computer-science and maths students Tuukka Korhonen, Olli Hirviniemi and Otte Heinävaara.

The Carat research team has published a dataset focusing on collaborative energy diagnostics of mobile devices and applications

 

 

The Carat research team from University of Helsinki publishes a dataset from the Carat project (http://carat.cs.helsinki.fi/) focusing on collaborative energy diagnostics of mobile devices and applications. The dataset was presented at the IEEE PerCom’15 conference last spring in the publication "Energy Modeling of System Settings: A Crowdsourced Approach" that won the Marc Weiser Best Paper Award given at the conference.

Eemil Lagerspetz was awarded a grant by the Jorma Ollila fund of Nokia Foundation on November 24, 2015

 

 
 
Eemil Lagerspetz was awarded a grant by the Jorma Ollila fund of Nokia Foundation on November 24, 2015. Congratulations!
 
The fund was launched in year 2014 to support post doctoral research career development. 
The title of Eemil’s post doctoral research is “Mind The Gap: Combining Trajectory Datasets for a Holistic Picture of Human Mobility” and the research will be carried out at the Hong Kong University of Science and Technology (HKUST) in 2016.
 

Collaborative Networking (CoNe) group researchers got the best paper award at 2nd ACM Conference on Information-Centric Networking (ICN 2015)

 

Collaborative Networking (CoNe) group researchers got the best paper award at 2nd ACM Conference on Information-Centric Networking (ICN 2015), one of the most prestigious venues for ICN research. The article entitled Pro-Diluvian: Understanding Scoped-Flooding for Content Discovery in ICN is lead by Liang Wang - a recent PhD graduate from CoNe research group, and is the outcome of collaboration with Suzan Bayhan and Jussi Kangasharju from UH, Jörg Ott from Aalto University, Arjuna Sathiaseelan and Jon Crowcroft from Cambridge University.