• DocumentCode
    1586669
  • Title

    Text classification in fragmented sublanguage domains

  • Author

    Frail, Robert P. ; Freedman, Roy S.

  • Author_Institution
    Dept. of Comput. Sci., Polytech. Univ., New York, NY, USA
  • fYear
    1991
  • Firstpage
    33
  • Lastpage
    36
  • Abstract
    The unique problems involved in developing text classification systems for texts that have low conceptual predictability are addressed. The authors present a shell called the FLUE (fragmented language understanding environment), which is capable of generating applications in fragmented sublanguage domains. The FLUE combines an expressive concept representation with a robust parsing technique called piecewise parsing. A common source of classification failure is unrecognized lexemes. The representation of concepts leverages differences in word class restrictions in order to learn unknown lexemes. The parser´s parallel search for concepts also gives it a large measure of immunity to the conceptual unpredictabilities of fragmented texts. Yet, the technique can be scaled to accommodate more grammatical texts, something not usually possible for other systems. Although the authors demonstrate the technique on English language texts, it is applicable to texts in other languages
  • Keywords
    classification; computational linguistics; grammars; natural languages; word processing; English language texts; FLUE; classification failure; conceptual unpredictabilities; expressive concept representation; fragmented language understanding environment; fragmented sublanguage domains; grammatical texts; low conceptual predictability; parallel search; piecewise parsing; robust parsing technique; text classification systems; unknown lexemes; unrecognized lexemes; word class restrictions; Artificial intelligence; Computer science; Content based retrieval; Data mining; Database languages; Electronic mail; Information retrieval; Natural language processing; Robustness; Text categorization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Artificial Intelligence Applications, 1991. Proceedings., Seventh IEEE Conference on
  • Conference_Location
    Miami Beach, FL
  • Print_ISBN
    0-8186-2135-4
  • Type

    conf

  • DOI
    10.1109/CAIA.1991.120842
  • Filename
    120842