Title :
Modeling inter-cluster and intra-cluster discrimination among triphones
Author :
Ko, Tae Kuk ; Mak, Brian ; Dongpeng Chen
Author_Institution :
Dept. of Comput. Sci. & Eng., Hong Kong Univ. of Sci. & Technol., Hong Kong, China
Abstract :
Discriminative training is a major contribution to the success of automatic speech recognition (ASR) in the last decade. However, since most ASR systems employ state tying which ties `similar´ states in a cluster, discriminative training may only improve inter-cluster discrimination, but states belonging to the same cluster obviously cannot be distinguished. Recently, the concept of distinct acoustic modeling was investigated by a new acoustic modeling method called eigentriphone modeling. In the new method, states are grouped, but not tied, into separate clusters, and the difference vectors between mean vectors of the member states and their cluster center vector are modeled by a basis approach using a set of eigenvectors which are also called eigentriphones. This paper investigates whether the intercluster discrimination achieved by discriminative training and intra-cluster discrimination obtained by eigentriphone modeling are additive. In a simple procedure that is applied to each state cluster, the discriminatively trained cluster center vector is integrated with the difference vectors trained by eigentriphone modeling to construct the final mean vectors of the distinct states in the cluster. Experimental evaluation on the WSJ0 5K task shows that the two techniques are indeed additive.
Keywords :
eigenvalues and eigenfunctions; speech recognition; ASR systems; WSJ0 5K task; automatic speech recognition; difference vectors; discriminative training; discriminatively trained cluster center vector; distinct acoustic modeling; eigentriphone modeling; eigenvectors; intercluster discrimination; intracluster discrimination; mean vectors; Acoustics; Hidden Markov models; Speech; Speech processing; Speech recognition; Training; Vectors; Eigentriphone; adaptation; discriminative training; regularization;
Conference_Titel :
Chinese Spoken Language Processing (ISCSLP), 2014 9th International Symposium on
Conference_Location :
Singapore
DOI :
10.1109/ISCSLP.2014.6936683