Title :
LyricSynchronizer: Automatic Synchronization System Between Musical Audio Signals and Lyrics
Author :
Fujihara, Hiromasa ; Goto, Misako ; Ogata, J. ; Okuno, Hiroshi G.
Author_Institution :
Nat. Inst. of Adv. Ind. Sci. & Technol. (AIST), Tsukuba, Japan
Abstract :
This paper describes a system that can automatically synchronize polyphonic musical audio signals with their corresponding lyrics. Although methods for synchronizing monophonic speech signals and corresponding text transcriptions by using Viterbi alignment techniques have been proposed, these methods cannot be applied to vocals in CD recordings because vocals are often overlapped by accompaniment sounds. In addition to a conventional method for reducing the influence of the accompaniment sounds, we therefore developed four methods to overcome this problem: a method for detecting vocal sections, a method for constructing robust phoneme networks, a method for detecting fricative sounds, and a method for adapting a speech-recognizer phone model to segregated vocal signals. We then report experimental results for each of these methods and also describe our music playback interface that utilizes our system for synchronizing music and lyrics.
Keywords :
Viterbi detection; audio signal processing; music; synchronisation; LyricSynchronizer; Viterbi alignment techniques; automatic synchronization system; fricative sound detection; lyrics; monophonic speech signals; music playback interface; phoneme networks; polyphonic musical audio signals; speech-recognizer phone model; vocal section detection; Feature extraction; Harmonic analysis; Hidden Markov models; Periodic structures; Power system harmonics; Speech recognition; Viterbi algorithm; Alignment; Viterbi algorithm; lyrics; singing voice; vocal;
Journal_Title :
Selected Topics in Signal Processing, IEEE Journal of
DOI :
10.1109/JSTSP.2011.2159577