Title :
Preserving text categorization through translation
Author :
Bleyberg, Maria Zamfir
Author_Institution :
Dept. of Comput. & Inf. Sci., Kansas State Univ., Manhattan, KS, USA
Abstract :
We treat natural language documents as we treat strongly-typed functional programming languages by introducing semantic categories as types. We use axioms to define primitive semantic categories and inference rules to capture the meaningful relationships among the primitive semantic categories. Primitive categories are nodes of a sense type decision tree. Axioms and inference rules are used to construct compound categories, validate category hypotheses, and eliminate ambiguities. The same categorization is obtained when this approach is applied to a text in a given natural language or to its translation into another language if a one-to-one mapping can be defined between the axioms and inference rules associated to the initial language and the axioms and inference rules associated to the other language
Keywords :
computational linguistics; data mining; decision trees; inference mechanisms; language translation; natural languages; text analysis; ambiguities; axioms; category hypotheses; compound categories; inference rules; lambda calculus; meaningful relationships; natural language documents; natural language text; one-to-one mapping; primitive semantic categories; semantic categories; sense type decision tree; strongly-typed functional programming languages; text categorization; text mining; translation; Application software; Decision trees; Electronic mail; Functional programming; Law; Logic; Marine vehicles; Natural languages; Text categorization; Text mining;
Conference_Titel :
Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
Conference_Location :
Tokyo
Print_ISBN :
0-7803-5731-0
DOI :
10.1109/ICSMC.1999.815675