DocumentCode
3063704
Title
A speech spectrogram expert
Author
Johannsen ; MacAllister, J. ; Michalek, T. ; Ross, S.
Author_Institution
Verbex, Bedford, Ma
Volume
8
fYear
1983
fDate
30407
Firstpage
746
Lastpage
749
Abstract
Various authors have pointed out that humans can become quite adept at deriving phonetic transcriptions from speech spectrograms (as good as 90% accuracy at the phoneme level). In this paper, we describe an expert system which attempts to simulate this performance. The speech spectrogram expert (SPEX) is actually a society made up of three experts: a 2-dimensional vision expert, an acoustic-phonetic expert, and a phonetics expert. The visual reasoning expert finds important visual features of the spectrogram. The acoustic-phonetic expert reasons about how visual features relate to phonemes, and about how phonemes change visually in different contexts. The phonetics expert reasons about allowable phoneme sequences and transformations, and deduces an English spelling for phoneme strings. The speech spectrogram expert is highly interactive, allowing users to investigate hypotheses and edit rules.
Keywords
Auditory system; Engines; Expert systems; Eyes; Humans; Performance analysis; Spectrogram; Speech processing; Speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP '83.
Type
conf
DOI
10.1109/ICASSP.1983.1172057
Filename
1172057
Link To Document