DocumentCode :
244901
Title :
Dual-Domain Hierarchical Classification of Phonetic Time Series
Author :
Hamooni, Hossein ; Mueen, Abdullah
Author_Institution :
Dept. of Comput. Sci., Univ. of New Mexico, Albuquerque, NM, USA
fYear :
2014
fDate :
14-17 Dec. 2014
Firstpage :
160
Lastpage :
169
Abstract :
Phonemes are the smallest units of sound produced by a human being. Automatic classification of phonemes is a well-researched topic in linguistics due to its potential for robust speech recognition. With the recent advancement of phonetic segmentation algorithms, it is now possible to generate datasets of millions of phonemes automatically. Phoneme classification on such datasets is a challenging data mining task because of the large number of classes (over a hundred) and complexities of the existing methods. In this paper, we introduce the phoneme classification problem as a data mining task. We propose a dual-domain (time and frequency) hierarchical classification algorithm. Our method uses a Dynamic Time Warping (DTW) based classifier in the top layers and time-frequency features in the lower layer. We cross-validate our method on phonemes from three online dictionaries and achieved up to 35% improvement in classification compared to existing techniques. We provide case studies on classifying accented phonemes and speaker invariant phoneme classification.
Keywords :
data mining; linguistics; signal classification; speech recognition; time series; DTW based classifier; automatic classification; data mining; datasets; dual-domain hierarchical classification; dynamic time warping; frequency hierarchical classification algorithm; linguistics; online dictionaries; phonetic segmentation algorithms; phonetic time series; robust speech recognition; sound units; speaker invariant phoneme classification; time hierarchical classification algorithm; time-frequency features; Accuracy; Dictionaries; Robustness; Speech; Speech recognition; Standards; Time series analysis; Big data; Phoneme classification; time series mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining (ICDM), 2014 IEEE International Conference on
Conference_Location :
Shenzhen
ISSN :
1550-4786
Print_ISBN :
978-1-4799-4303-6
Type :
conf
DOI :
10.1109/ICDM.2014.92
Filename :
7023333
Link To Document :
بازگشت