Title :
Morphologic non-word error detection
Author :
Bressan, Stéphane ; Irawan, Riky
Author_Institution :
Nat. Univ. of Singapore, Singapore
fDate :
30 Aug.-3 Sept. 2004
Abstract :
Writing and sending e-mails and short messages (SMS) has become one of the most pervasive activities in our daily life. Whether emails from our computers, short messages from our portable phones or both from our portable digital assistants, there is no occasion that does not deserve a text message: "Dear Colleagues, please find attached to this e-mail...", "C U at 9pm?", "Forget to turn return DVD", etc. Many have warned that the typos, misspellings, grammatical errors and other linguistic indelicacies, which are commonly accepted in these messages, are announcing the decadence of human languages and communication. One answer to these reservations and critics is to provide the tools for the automatic detection and correction of such errors. We are interested in the problem of the detection of nonwords. We propose and evaluate two families of new methods based on extended n-grams and morpheme, respectively. We show that most methods we propose yield a better performance than the state of the art technique.
Keywords :
electronic mail; electronic messaging; error correction; error detection; linguistics; natural languages; word processing; automatic error detection; e-mail sending; error correction; human languages; linguistic indelicacy; morphologic nonword error detection; portable digital assistants; portable phones; short messages; Computer errors; Cultural differences; Dictionaries; Electronic mail; Error correction; Humans; Information systems; Natural languages; Portable computers; Writing;
Conference_Titel :
Database and Expert Systems Applications, 2004. Proceedings. 15th International Workshop on
Print_ISBN :
0-7695-2195-9
DOI :
10.1109/DEXA.2004.1333445