Title :
A large synchronous corpus as monitoring corpus: Some comparative content analysis of Chinese and Japanese language developments
Author :
Tsou, Benjamin K. ; Chin, Andy C.
Author_Institution :
Res. Centre on Linguistics & Language Inf. Sci., Hong Kong Inst. of Educ., Hong Kong, China
Abstract :
Appropriate and large corpora are uncommon but they can provide important resources for wide ranging efforts in natural language processing, ranging from contextualized or localized speech and text input to automatic patent translation. They also provide lesser known rich resources for human and automatic content analysis such as sentiment analysis of texts and product reviews. Furthermore they can function as a monitoring corpus and enhance the human centered communication environment by allowing more substantive introspection and comparison of content rather than the linguistic form in communication.
Keywords :
natural language processing; speech processing; Chinese language development; Japanese language developments; automatic content analysis; automatic patent translation; comparative content analysis; contextualized speech; large synchronous corpus; localized speech; monitoring corpus; natural language processing; sentiment analysis; text input; Chinese; Japanese; homothematic coprus; lingusitic and social variation; monitoring corpus; synchronous corpus;
Conference_Titel :
Universal Communication Symposium (IUCS), 2010 4th International
Conference_Location :
Beijing
Print_ISBN :
978-1-4244-7821-7
DOI :
10.1109/IUCS.2010.5666763