DocumentCode
178418
Title
Very fast unit selection using Viterbi search with zero-concatenation-cost chains
Author
Kala, Jiri ; Matousek, Jindrich
Author_Institution
Dept. of Cybern., Univ. of West Bohemia, Pilsen, Czech Republic
fYear
2014
fDate
4-9 May 2014
Firstpage
2569
Lastpage
2573
Abstract
This paper introduces a very fast heuristic search algorithm for unit-selection speech synthesis. The algorithm modifies commonly used Viterbi search framework by introducing zero-concatenation-cost (ZCC) chains of unit candidates that immediately neighbored in a source speech corpus. ZCC chains are preferred as they represent perfect speech segment concatenations (so there is no need to compute concatenation costs inside the chains) unless a so-called target specification is violated. The number of ZCC chains is reduced based on statistics calculated upon the synthesis of a large number of utterances. ZCC chains are then combined with single unit candidates to fill possible gaps in the sequence of candidates. The proposed method reduces the computational load of a unit selection system up to hundreds of times. According to listening tests, the quality of synthetic speech was not deteriorated.
Keywords
search problems; speech synthesis; Viterbi search framework; ZCC chains; concatenation costs; perfect speech segment concatenations; source speech corpus; synthetic speech; target specification; unit selection system; unit-selection speech synthesis; very fast heuristic search algorithm; very fast unit selection; zero-concatenation-cost chains; Conferences; Context; Cost function; Speech; Speech synthesis; Viterbi algorithm; Viterbi algorithm; non-uniform units; speech synthesis; unit selection; zero cost concatenation;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on
Conference_Location
Florence
Type
conf
DOI
10.1109/ICASSP.2014.6854064
Filename
6854064
Link To Document