DocumentCode :
153306
Title :
Flexible Noisy Text Correction
Author :
Sariev, Andrey ; Nenchev, Vladislav ; Gerdjikov, Stefan ; Mitankin, Petar ; Ganchev, Hristo ; Mihov, Stoyan ; Tinchev, Tinko
fYear :
2014
fDate :
7-10 April 2014
Firstpage :
31
Lastpage :
35
Abstract :
We present a new general and language independent approach to the noisy text correction problem developed and implemented in the framework of the CULTURA project. We briefly describe the core candidate generator, REBELS, the complete system concept, its efficient implementation based on functional automata and its immediate applications. The quality of the whole system is empirically established in different experimental settings where language and noise sources are varied.
Keywords :
automata theory; error correction; language translation; learning (artificial intelligence); text analysis; text editing; CULTURA project; REBELS; complete system concept; core candidate generator; flexible noisy text correction; functional automata; language independent approach; Automata; Computational modeling; Nickel; Noise; Noise measurement; Optical character recognition software; Standards; OCR correction; finite state automata; historical texts normalisation; noisy-text correction; statistical methods;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on
Conference_Location :
Tours
Print_ISBN :
978-1-4799-3243-6
Type :
conf
DOI :
10.1109/DAS.2014.12
Filename :
6830964
Link To Document :
بازگشت