Title :
Text-independent speaker identification using binary-pair partitioned neural networks
Author :
Rudasi, Laszlo ; Zahorian, Stephen A.
Author_Institution :
Dept. of Electr. & Comput. Eng., Old Dominion Univ., Norfolk, VA, USA
Abstract :
The N-way speaker identification task is partitioned into N*(N-1)/2 binary-pair classifications. The binary-pair classifications are performed with small neural nets, each trained to make independent binary decisions on small fragments of speech data. Three issues were investigated concerning optimally combining a large number of fragmentary binary decisions into a single N-way decision: (1) incorporating speech energy and phonetic content information to compute an improved probability measure at the individual speech frame level; (2) combining binary frame-level decisions into a binary segment-level decision; and (3) combining the binary segment-level decisions into a single N-way segment level decision. It was shown that the two-way classifiers can be combined to achieve 100% speaker identification performance for large speaker populations
Keywords :
neural nets; speech recognition; binary-pair classifications; binary-pair partitioned neural networks; frame-level decisions; phonetic content information; segment-level decisions; speech data; speech energy; text-independent speaker identification; Automatic speech recognition; Energy measurement; Humans; Neural networks; Performance evaluation; Power system reliability; Resonance; Signal processing; Speaker recognition; Speech recognition;
Conference_Titel :
Neural Networks, 1992. IJCNN., International Joint Conference on
Conference_Location :
Baltimore, MD
Print_ISBN :
0-7803-0559-0
DOI :
10.1109/IJCNN.1992.227240