DocumentCode :
2788778
Title :
Unsupervised knowledge acquisition for Extracting Named Entities from speech
Author :
Bechet, Frederic ; Charton, Eric
Author_Institution :
Aix-Marseille Univ., Marseille, France
fYear :
2010
fDate :
14-19 March 2010
Firstpage :
5338
Lastpage :
5341
Abstract :
This paper presents a Named Entity Recognition (NER) method dedicated to process speech transcriptions. The main principle behind this method is to collect in an unsupervised way lexical knowledge for all entries in the ASR lexicon. This knowledge is gathered with two methods: by automatically extracting NEs on a very large set of textual corpora and by exploiting directly the structure contained in the Wikipedia resource. This lexical knowledge is used to update the statistical models of our NER module based on a mixed approach with generative models (Hidden Markov Models - HMM) and discriminative models (Conditional Random Field - CRF). This approach has been evaluated within the French ESTER 2 evaluation program and obtained the best results at the NER task on ASR transcripts.
Keywords :
hidden Markov models; knowledge acquisition; speech recognition; ASR lexicon; French ESTER 2 evaluation program; HMM; Wikipedia resource; conditional random field; hidden Markov models; lexical knowledge; named entity extraction; named entity recognition; speech transcriptions; statistical models; textual corpora; unsupervised knowledge acquisition; Automatic speech recognition; Data mining; Hidden Markov models; Knowledge acquisition; Ontologies; Radio broadcasting; Speech processing; Speech recognition; TV broadcasting; Wikipedia; Information retrieval; Named Entity; Speech recognition; Statistical Tagging Models;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
Conference_Location :
Dallas, TX
ISSN :
1520-6149
Print_ISBN :
978-1-4244-4295-9
Electronic_ISBN :
1520-6149
Type :
conf
DOI :
10.1109/ICASSP.2010.5494962
Filename :
5494962
Link To Document :
بازگشت