Quantcast


CURATOR

Google India PhD Student, INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR

PINBOARD SUMMARY

Abstract Semantic Representation of Music for Music Information Retrival Tasks

Melody extraction is the task of automatically extracting the dominant melodic line in a polyphonic music. Here, polyphony refers to the music in which more than one instrument may sound concurrently (e.g. piano, violin, drums, human singing voice etc.,) or it can be a single instrument which is capable of producing multiple notes at a given time (e.g. violin). The word melody is the musicological term which is purely subjective in nature. Hence, we can find many definitions of melody in various contexts. The melody representation adopted in my work is the one proposed by Mastaka Goto, such as melody is the sequence of F0 (fundamental frequency or pitch) values correspond to the dominant instrument's perceived pitch. The dominant instrument can be either the human singing voice or any lead instrument in the polyphonic music signal. The accurate extraction of the melody remained as a challenging and unsolved task in the research community because of its two-fold complexity. Firstly, the polyphonic music signal is the superposition of many instruments which play simultaneously. Hence, it is hard to attribute specific frequency bands and energy levels to a specific instrument. Secondly, the task of determining the sequence of pitch values that constitutes the main melody. This in turn poses mainly three challenges: (i) determining the melody regions in the music signal, (ii) ensuring the estimated F0 is in the correct octave range and (iii) selecting the right melody pitch when there is more than one note present at the same time. The accurately extracted melody can be used in many potential applications such as automatic music transcription, query by humming, music de-soloing, singer identification, and in many other music information retrieval tasks.

4 ITEMS PINNED