A Quantitative Comparison of Different Approaches for Melody Extraction from Polyphonic Audio Recording
Authors
Emilia Gómez
Sebastian Streich
Beesuan Ong
Rui Pedro Paiva
Sven Tappert
Jan-Mark Batke
Graham Poliner
Dan Ellis
Juan-Pablo Bello
Sebastian Streich
Beesuan Ong
Rui Pedro Paiva
Sven Tappert
Jan-Mark Batke
Graham Poliner
Dan Ellis
Juan-Pablo Bello
Abstract
This paper provides an overview of current state-of-the-artapproaches for melody extraction from polyphonic audio recordings, and it
proposes a methodology for the quantitative evaluation of melody
extraction algorithms. We first define a general architecture for melody
extraction systems and discuss the difficulties of the problem in hand; then,
we review different approaches for melody extraction which represent the
current state-of-the-art in this area. We propose and discuss a methodology
for evaluating the different approaches, and we finally present some results
and conclusions of the comparison.
TechReport Number
RPP-002PDF File
Cited by
Year 2011 : 3 citations
Fonseca N. (2011). “Singing voice resynthesis using concatenative-based techniques”, PhD Thesis, University of Porto, Portugal
Serrà J. J. (2011). “Identification of versions of the same musical composition by processing audio descriptions”. PhD Thesis, Universitat Pompeu Fabra, Barcelona, Spain.
???. "?? ?? ??? ?? ??? ?? ??? ?? ??." ?????? 16.4 (2011): 84-92.
Year 2010 : 1 citations
1. JL Durrieu (2010). “Transcription et séparation automatique de la mélodie principale dans les signaux de musique polyphoniques”, PhD Thesis, Telecom Paris Tech
Year 2008 : 4 citations
Misra (2008). “Technical report on audio and speech processing”. Technical Report FP6-027026, K-Space D3.7
Oudtshoorn B. (2008). “Investigating the Feasibility of Near Real-Time Music Transcription on Mobile Devices”. Internal Report, University of Western Australia.
Rao V. and Rao P. (2008). “Vocal Melody Detection in the Presence of Pitched Accompaniment using Harmonic Matching Methods”. Proceedings of the International Conference on Digital Audio Effects – DAFx’08, Espoo, Finland
Salamon J. (2008). “Chroma-based Predominant Melody and Bass Line Extraction from Music Audio Signals”. MSc Thesis, University Pompeu Fabra, Barcelona.
Year 2007 : 3 citations
Demopoulos R. J. and Katchabaw M. J. (2007). “Investigating the Feasibility of Near Real-Time Music Transcription on Mobile Devices”. Technical Report #677, University of Western Ontario, Canada.
Poliner G., Ellis D., Ehmann A., Gomez E., Streich S. and Ong B. (2007). “Melody Transcription from Music Audio: Approaches and Evaluation”. IEEE Transactions on Audio, Speech, and Language Processing, Vol. 15, No. 4, pp. 1247 - 1256.
Reis G. and Fernandez Veja F. (2007). “Electronic synthesis using genetic algorithms for automatic music transcription”. Proceedings of the 9th Annual Conference on Genetic and Evolutionary Computation, London, England.
Year 2006 : 2 citations
de Cheveigné A. (2006). ““Multiple F0 Estimation”, in Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, edited by DeLiang Wang and Guy J. Brown, John Wiley and sons.
Lemvigh M. B. (2006). “Automatisk transskribering af musik”. Technical Report, University of Copenhagen, Denmark.