• DocumentCode
    350031
  • Title

    Preserving text categorization through translation

  • Author

    Bleyberg, Maria Zamfir

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Kansas State Univ., Manhattan, KS, USA
  • Volume
    5
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    912
  • Abstract
    We treat natural language documents as we treat strongly-typed functional programming languages by introducing semantic categories as types. We use axioms to define primitive semantic categories and inference rules to capture the meaningful relationships among the primitive semantic categories. Primitive categories are nodes of a sense type decision tree. Axioms and inference rules are used to construct compound categories, validate category hypotheses, and eliminate ambiguities. The same categorization is obtained when this approach is applied to a text in a given natural language or to its translation into another language if a one-to-one mapping can be defined between the axioms and inference rules associated to the initial language and the axioms and inference rules associated to the other language
  • Keywords
    computational linguistics; data mining; decision trees; inference mechanisms; language translation; natural languages; text analysis; ambiguities; axioms; category hypotheses; compound categories; inference rules; lambda calculus; meaningful relationships; natural language documents; natural language text; one-to-one mapping; primitive semantic categories; semantic categories; sense type decision tree; strongly-typed functional programming languages; text categorization; text mining; translation; Application software; Decision trees; Electronic mail; Functional programming; Law; Logic; Marine vehicles; Natural languages; Text categorization; Text mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Systems, Man, and Cybernetics, 1999. IEEE SMC '99 Conference Proceedings. 1999 IEEE International Conference on
  • Conference_Location
    Tokyo
  • ISSN
    1062-922X
  • Print_ISBN
    0-7803-5731-0
  • Type

    conf

  • DOI
    10.1109/ICSMC.1999.815675
  • Filename
    815675