DocumentCode
350031
Title
Preserving text categorization through translation
Author
Bleyberg, Maria Zamfir
Author_Institution
Dept. of Comput. & Inf. Sci., Kansas State Univ., Manhattan, KS, USA
Volume
5
fYear
1999
fDate
1999
Firstpage
912
Abstract
We treat natural language documents as we treat strongly-typed functional programming languages by introducing semantic categories as types. We use axioms to define primitive semantic categories and inference rules to capture the meaningful relationships among the primitive semantic categories. Primitive categories are nodes of a sense type decision tree. Axioms and inference rules are used to construct compound categories, validate category hypotheses, and eliminate ambiguities. The same categorization is obtained when this approach is applied to a text in a given natural language or to its translation into another language if a one-to-one mapping can be defined between the axioms and inference rules associated to the initial language and the axioms and inference rules associated to the other language
Keywords
computational linguistics; data mining; decision trees; inference mechanisms; language translation; natural languages; text analysis; ambiguities; axioms; category hypotheses; compound categories; inference rules; lambda calculus; meaningful relationships; natural language documents; natural language text; one-to-one mapping; primitive semantic categories; semantic categories; sense type decision tree; strongly-typed functional programming languages; text categorization; text mining; translation; Application software; Decision trees; Electronic mail; Functional programming; Law; Logic; Marine vehicles; Natural languages; Text categorization; Text mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
Conference_Location
Tokyo
ISSN
1062-922X
Print_ISBN
0-7803-5731-0
Type
conf
DOI
10.1109/ICSMC.1999.815675
Filename
815675
Link To Document