Title :
Flexible Noisy Text Correction
Author :
Sariev, Andrey ; Nenchev, Vladislav ; Gerdjikov, Stefan ; Mitankin, Petar ; Ganchev, Hristo ; Mihov, Stoyan ; Tinchev, Tinko
Abstract :
We present a new general and language independent approach to the noisy text correction problem developed and implemented in the framework of the CULTURA project. We briefly describe the core candidate generator, REBELS, the complete system concept, its efficient implementation based on functional automata and its immediate applications. The quality of the whole system is empirically established in different experimental settings where language and noise sources are varied.
Keywords :
automata theory; error correction; language translation; learning (artificial intelligence); text analysis; text editing; CULTURA project; REBELS; complete system concept; core candidate generator; flexible noisy text correction; functional automata; language independent approach; Automata; Computational modeling; Nickel; Noise; Noise measurement; Optical character recognition software; Standards; OCR correction; finite state automata; historical texts normalisation; noisy-text correction; statistical methods;
Conference_Titel :
Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on
Conference_Location :
Tours
Print_ISBN :
978-1-4799-3243-6
DOI :
10.1109/DAS.2014.12