DocumentCode
1262326
Title
Bayesian classification for data from the same unknown class
Author
Huang, Hung-Ju ; Chun-Nan Hsu
Author_Institution
Dept. of Commun. Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
Volume
32
Issue
2
fYear
2002
fDate
4/1/2002 12:00:00 AM
Firstpage
137
Lastpage
145
Abstract
In this paper, we address the problem of how to classify a set of query vectors that belong to the same unknown class. Sets of data known to be sampled from the same class are naturally available in many application domains, such as speaker recognition. We refer to these sets as homologous sets. We show how to take advantage of homologous sets in classification to obtain improved accuracy over classifying each query vector individually. Our method, called homologous naive Bayes (HNB), is based on the naive Bayes classifier, a simple algorithm shown to be effective in many application domains. RNB uses a modified classification procedure that classifies multiple instances as a single unit. Compared with a voting method and several other variants of naive Bayes classification, HNB significantly outperforms these methods in a variety of test data sets, even when the number of query vectors in the homologous sets is small. We also report a successful application of HNB to speaker recognition. Experimental results show that HNB can achieve classification accuracy comparable to the Gaussian mixture model (GMM), the most widely used speaker recognition approach, while using less time for both training and classification
Keywords
Bayes methods; set theory; speaker recognition; Gaussian mixture model; classification accuracy; homologous naive Bayes algorithm; homologous sets; machine learning; naive Bayes classifier; query vectors; speaker recognition; Bayesian methods; Helium; Information science; Laboratories; Machine learning; Pattern recognition; Speaker recognition; Speech processing; Testing; Voting;
fLanguage
English
Journal_Title
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
Publisher
ieee
ISSN
1083-4419
Type
jour
DOI
10.1109/3477.990870
Filename
990870
Link To Document