Title :
Archaisms and neologisms identification in texts
Author :
Costin-Gabriel, Chiru ; Rebedea, Traian Eugen
Author_Institution :
Comput. Sci. Dept., Politeh. Univ. of Bucharest, Bucharest, Romania
Abstract :
In this paper we present an application for identifying archaisms and neologisms in texts. The application also provides the ability to view graphically the evolution trends of these words for a better interpretation of the results. The presented solution consists of two phases: the learning phase in which we identify the general evolution trends of three categories of words (archaisms, neologisms and common words) and the classification phase in which we label new words with their corresponding category. For both phases, the application requires Internet access because it is using the Google Books N-gram Viewer to generate the images that back up the decisions.
Keywords :
Internet; natural language processing; pattern classification; text analysis; Google books n-gram viewer; Internet access; archaisms identification; classification phase; learning phase; natural language processing; neologisms identification; text mining; Dictionaries; Google; Market research; Natural language processing; Principal component analysis; Standards; Transforms; NLP; PCA; archaisms; neologisms; text mining;
Conference_Titel :
RoEduNet Conference 13th Edition: Networking in Education and Research Joint Event RENAM 8th Conference, 2014
Conference_Location :
Chisinau
Print_ISBN :
978-1-4799-6860-2
DOI :
10.1109/RoEduNet-RENAM.2014.6955312