CISUC

On the Detection of Melody Notes in Polyphonic Audio

Authors

Abstract

This paper describes a method for melody detection in polyphonic musical signals. Our approach starts by ob-taining a set of pitch candidates for each time frame, with recourse to an auditory model. Trajectories of the most salient pitches are then constructed. Next, note candi-dates are obtained by trajectory segmentation (in terms of frequency and pitch salience variations). Too short, low-salience and harmonically related notes are then elimi-nated. Finally, the notes comprising the melody are ex-tracted. This is the main topic of this paper.
We select the melody notes by making use of note sa-liences and melodic smoothness. First, we select the notes with highest pitch salience at each moment. Then, by the melodic smoothness principle, we exploit the fact that tonal melodies are usually smooth. Thus, long music intervals indicate the presence of possibly erroneous notes, which are substituted by notes that smooth out the melodic contour.
Finally, false positives in the extracted melody should be eliminated. To this end, we remove spurious notes that correspond to abrupt drops in note saliences or du-rations. Additionally, note clustering is conducted to further discriminate between true melody notes and false positives.

Keywords

Melody detection, melodic smoothness, feature extraction, note clustering

Subject

Music Information Retrieval

Conference

ISMIR'2005, September 2005

PDF File


Cited by

Year 2012 : 1 citations

 1. YR Chien, HM Wang, SK Jeng – 2012. “SIMULATED FORMANT MODELING OF ACCOMPANIED SINGING SIGNALS FOR VOCAL MELODY EXTRACTION”, Proceedings of the 9th Sound and Music Computing Conference, Copenhagen, Denmark, p.33-40

Year 2011 : 2 citations

 Chien, Y., Wang, H., Jeng, S. (2011). “An Acoustic-Phonetic Approach to Vocal Melody Extraction”. In ISMIR(2011), pp.25-30.

 Klapuri A. (2011). “Pattern Induction and Matching in Music Signals”. EXPLORING MUSIC CONTENTS, Lecture Notes in Computer Science, 2010, Volume 6684/2011, pp. 188-204.

Year 2010 : 5 citations

 Allali J., Ferraro P. et. al (2010). “Polyphonic Alignment Algorithms for Symbolic Music Retrieval”. AUDITORY DISPLAY, Lecture Notes in Computer Science, 2010, Volume 5954/2010, pp. 466-482.

 JL Durrieu (2010). “Transcription et séparation automatique de la mélodie principale dans les signaux de musique polyphoniques”, PhD Thesis, Telecom Paris Tech

 Fernández C. (2010). “Detector de melodía”. http://quieroseringenieroinformatico.blogspot.com/2010/01/detector-de-melodia.html

 Kuder M. (2010). Extraction of Predominant Melody from Audio Recordings. EngD thesis. University of Ljubljana, Slovenia.

 López D. H. (2010). “ANÁLISIS DE LAS CARACTERÍSTICAS ACÚSTICAS DE LA PERCUSIÓN HUMANA”. MSc Thesis, Universidad Autónoma de Madrid.

Year 2009 : 3 citations

 Allali J., Ferraro P. et. al (2009). “Polyphonic Alignment Algorithms for Symbolic Music Retrieval”. Auditory Display, Lecture Notes in Computer Science, 2010, Volume 5954/2010, 466-482.

 Allali J., Ferraro P. et. al (2009). “Toward a General Framework for Polyphonic Comparison”. Fundamenta Informaticae, Vol. 97 (3), pp.331-346.

 Chien Y.-R. and Wang H.-M. (2009). "Vocality-Sensitive Melody Extraction from Popular Songs". APSIPA ASC 2009

Year 2008 : 5 citations

 Every M. R. (2008). “Discriminating Between Pitched Sources in Music Audio”, IEEE Transactions on Audio, Speech and Language Processing, Vol. 16, No. 2, pp. 267-277.

 Klapuri, A. (2008). "Multipitch Analysis of Polyphonic Music and Speech Signals Using an Auditory Model," Audio, Speech, and Language Processing, IEEE Transactions on , vol.16, no.2, pp.255-266, Feb. 2008.

 Ryynänen M. and Klapuri A. (2008). “Automatic Transcription of Melody, Bass Line, and Chords in Polyphonic Music”. Computer Music Journal, Vol. 32, No. 3, pp. 72-86.

 Suyoto I. (2008). CROSS-DOMAIN CONTENT-BASED RETRIEVAL OF AUDIO MUSIC THROUGH TRANSCRIPTION. PhD Thesis, School of Computer Science and Information Technology, College of Science, Engineering, and Technology, RMIT University, Melbourne, Victoria, Australia

 Suyoto I., Uitdenbogerd A. and Scholer F. (2008). “Searching Musical Audio Using Symbolic Queries”. IEEE Transactions on audio, Speech and Language Processing, Vol. 16, No. 2, pp. 372-381.

Year 2007 : 7 citations

 Cao C., Li M., Liu J. and Yan Y. (2007). “Singing Melody Extraction in Polyphonic Music by Harmonic Tracking”. Proceedings of the International Conference on Music Information Retrieval – ISMIR’2007, Vienna, Austria, September 2007.

 Chien Y.-R. and Wang H.-M. (2007). “Vocality-Sensitive Melody Extraction from Popular Songs”. 2nd Beijing-Hong Kong International Doctoral Forum.

 Dittmar C., Dressler K. and Rosenbauer K. (2007). “A Toolbox for Automatic Transcription of Polyphonic Music”, Audio Mostly 2007 - 2nd Conference on Interaction with Sound, Röntgenbau, Ilmenau, Germany.

 Duda A., Nurnberger A., Stober S. (2007). “Towards Query By Singing/Humming on Audio Databases”. Proceedings of the International Conference on Music Information Retrieval – ISMIR’2007, Vienna, Austria, September 2007.

 Hanna P. and Ferraro P. (2007). “Polyphonic Music Retrieval by Local Edition of Quotiented Sequences”. International Workshop on Content-Based Multimedia Indexing – CBMI’07.

 Wright D. P. (2007). “Analysis and Interpretation of Music for Dance”, BSc Thesis, University of Teesside, Schol of Computing.

 Suyoto I., Uitdenbogerd A. and Scholer F. (2007). “Effective Retrieval of Polyphonic Audio with Polyphonic Symbolic Queries”. Proceedings of the International Workshop on Multimedia Information, Augsburg, Germany.

Year 2006 : 6 citations

 de Cheveigné A. (2006). “Multiple F0 Estimation”, in Computational Auditory Scene Analysis: Principles, Algorithms, and Applications, edited by DeLiang Wang and Guy J. Brown, John Wiley and sons.

 Ellis D. and Poliner G. (2006). “Classification-Based Melody Transcription”. Machine Learning Journal, Vol. 65, No. 2-3. Pp. 439-456.

 López, Daniel Hernández, and D. Daniel Hernández López. "Análisis de las características acústicas de la percusión humana." Information Retrieval 1.1 (2006): 1-90.

 Orio N. (2006). “Music Retrieval: A Tutorial and Review”. Foundations and Trends in Information Retrieval, Vol. 1, No. 1, pp. 1-90, November 2006.

 Ryynänen M. and Klapuri A. (2006). “Transcription of the Singing Melody in Polyphonic Music”. Proceedings of the International Conference on Music Information Retrieval – ISMIR’2006, Victoria, Canada, October 2006.

 Ryynänen M. and Klapuri A. (2006). “Transcription of the Singing Melody in Polyphonic Music (MIREX 2006)”. Proceedings of the Music Information Retrieval Exchange – MIREX’2006.