DocumentCode
1086232
Title
An objective parallel evaluator of segmentation/Classification performance for multiple systems
Author
Silverman, Harvey F. ; Dixon, N. Rex
Author_Institution
IBM Thomas J. Watson Research Center, Yorktown Heights, N.Y.
Volume
23
Issue
1
fYear
1975
fDate
2/1/1975 12:00:00 AM
Firstpage
92
Lastpage
99
Abstract
A major difficulty in the development of methodologies for segmentation and classification in automatic recognition of continuous speech is the determination of objective, reliable performance statistics. Compounding this difficulty is the large amount of data necessary to make reasonably accurate performance estimates. The system to be described provides for concurrent objective evaluation of up to five independent segmentation/classification methods against a single, carefully transcribed referent. A basic assumption of the evaluator is that the systems to be compared, as well as the referent, can each use the same digital data as input. Violation of this assumption would lead to time-shift errors, and objective comparison among systems would be exceedingly difficult. For segmentation, the evaluator provides first-order statistics, at the phonetic, class and summary levels, in the form of highly concise tables for the following four types of errors: 1) Missed events; 2) Adventitious events; 3) Misplaced events; and 4) Adventitious and misplaced events. For classification, first-order statistics are derived in the form of confusion matrices at the phonetic, class and summary levels. While the system is still in the developmental process, it is operational and currently used. Examples of output will be presented.
Keywords
Automatic speech recognition; Error analysis; Fatigue; Speech analysis; Statistics; System performance; Variable speed drives;
fLanguage
English
Journal_Title
Acoustics, Speech and Signal Processing, IEEE Transactions on
Publisher
ieee
ISSN
0096-3518
Type
jour
DOI
10.1109/TASSP.1975.1162643
Filename
1162643
Link To Document