DocumentCode
178656
Title
Transcribing vocal expression from polyphonic music
Author
Ikemiya, Yukara ; Itoyama, Katsutoshi ; Okuno, Hiroshi G.
Author_Institution
Grad. Sch. of Inf., Kyoto Univ., Kyoto, Japan
fYear
2014
fDate
4-9 May 2014
Firstpage
3127
Lastpage
3131
Abstract
A method for transcribing vocal expressions such as vibrato, glissando, and kobushi separately from polyphonic music is described. The expressions appear as fluctuation in the fundamental frequency contour of the singing voice. They can be used for search and retrieval of music and for expressive singing voice synthesis based on singing style since they strongly reflect the individuality of the singer. The fundamental frequency contour of the singing voice is estimated using the Viterbi algorithm with limitation from a corresponding note sequence. Next, the notes are aligned with the fundamental frequency sequence temporally. Finally, each expression is identified and parameterized in accordance with designed rules. Experiments demonstrated that this method can transcribe expressions in the singing voice from commercial recordings.
Keywords
information retrieval; music; speech synthesis; Viterbi algorithm; expressive singing voice synthesis; fundamental frequency contour; music information retrieval; note sequence; polyphonic music; vocal expressions; Accuracy; Cost function; Estimation; Frequency estimation; Hidden Markov models; Speech; Time-frequency analysis; F0 estimation; Singing voice analysis; Vocal expression identification / transcription;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854176
Filename
6854176
Link To Document