Title :
Transformation-based named entity extraction from spoken content for personal memory aid
Author_Institution :
Dept. of Comput. Sci. & Eng., Sogang Univ., Seoul, South Korea
fDate :
11/1/2010 12:00:00 AM
Abstract :
This paper proposes an automatic transformation-based rule-based Named Entity (NE) extraction from spoken content for personal memory aid. The proposed automatic rule inference based on transformation has shown itself to be a viable alternative to the stochastic approach in NE extraction, while retaining the advantages of a rule-based approach: lightweight memory requirements and computation, and extensible to the inclusion of personal information. The performance of the proposed system is compared with one of the successful stochastic systems. When only the sequences of words are available, both systems show almost equal performance as is also the case with additional information such as punctuation, capitalisation and name lists. The best results of the proposed system were measured at 0.9134 in terms of F-measure. In cases where input texts are corrupted by speech recognition errors, the performance of both systems is degraded by almost the same level (0.0062 of F-measure loss per 1% of additional speech recognition error). However, the proposed system requires only 94KB and simple computation on the transition diagram of a finite automata.
Keywords :
inference mechanisms; speech recognition; F-measure; automatic rule inference; capitalisation; finite automata; memory requirements; name lists; personal information; personal memory aid; punctuation; speech recognition errors; spoken content; transformation-based rule-based named entity extraction; transition diagram; word sequences; Data mining; Feature extraction; Hidden Markov models; Speech; Speech recognition; Stochastic systems; Training data; Named entity extraction, Transformation-based, Spoken content, Personal memory aid.;
Journal_Title :
Consumer Electronics, IEEE Transactions on
DOI :
10.1109/TCE.2010.5681147