DocumentCode
259221
Title
Classification and Clustering English Writing Errors Based on Native Language
Author
Flanagan, Brendan ; Chengjiu Yin ; Suzuki, Takumi ; Hirokawa, Sachio
Author_Institution
Grad. Sch. of Inf. Sci. & Electr. Eng., Kyushu Univ., Fukuoka, Japan
fYear
2014
fDate
Aug. 31 2014-Sept. 4 2014
Firstpage
318
Lastpage
323
Abstract
It is important for language learners to determine and reflect on their writing errors in order to overcome weaknesses. Each language learner has their own unique writing error characteristics and therefore has different learning needs. In this paper, we analyze the writing errors of foreign language learners on the language learning SNS website Lang-8 to investigate the characteristics of errors by native language. 142,465 sentences were collected from Lang-8 for analysis. For each native language, the predicted scores of 15 error categories from SVM machine learning models are used as a vector representation of each sentence. These score vectors are then clustered to determine error co-occurrence within the same sentence. The results were then analyzed to determine the error characteristics of different native languages.
Keywords
computer aided instruction; learning (artificial intelligence); natural language processing; pattern classification; pattern clustering; social networking (online); support vector machines; vectors; English writing errors; Lang-8; SNS Web site; SVM; classification; clustering; error characteristics; machine learning; native language; vector representation; Data models; Educational institutions; Principal component analysis; Support vector machines; Vectors; Writing; Language learning; machine learning; native language characteristics; writing error categories;
fLanguage
English
Publisher
ieee
Conference_Titel
Advanced Applied Informatics (IIAIAAI), 2014 IIAI 3rd International Conference on
Conference_Location
Kitakyushu
Print_ISBN
978-1-4799-4174-2
Type
conf
DOI
10.1109/IIAI-AAI.2014.72
Filename
6913316
Link To Document