• DocumentCode
    2750693
  • Title

    Segmentation of Mandarin Braille word and Braille translation based on multi-knowledge

  • Author

    Minghu, Jiang ; Xiaoyan, Zhu ; Ying, Xia ; Gang, Tan ; Baozong, Yuan ; Xiaofang, Tang

  • Author_Institution
    Inst. of Inf. Sci., Northern Jiaotong Univ., Beijing, China
  • Volume
    3
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    2070
  • Abstract
    This paper is about the segmentation of Braille words and the transformation from Mandarin Braille to Chinese characters. Braille word segmentation consists of the rules base, the signs base of segmentation and knowledge base for disambiguation and mistakes. By using adjacency constraints and bidirectional maximum matching with a dictionary, our system´s segmentation precision is better than 99% for the common text. By incorporating a pinyin knowledge dictionary into the system, we perfectly solved the problem of ambiguity in the translation from Braille to pinyin and developed a statistical language model based on the transformation of pinyin into characters. By using a multi-knowledge base to carry out the disambiguation process for each pinyin sentence, we built a multi-level graph and used a Viterbi search to find the sequence of Chinese characters with maximum likelihood, and used an N-best algorithm to get the N most likely character sequences. The experimental results show that the system´s overall precision for translation from Braille codes to Chinese characters is 94.38%
  • Keywords
    knowledge based systems; language translation; sequences; Braille translation; Chinese characters; Mandarin Braille word; N-best algorithm; Viterbi search; adjacency constraints; bidirectional maximum matching; character sequences; disambiguation; knowledge base; maximum likelihood; mistakes; multi-knowledge; multi-level graph; pinyin knowledge dictionary; rules base; segmentation; segmentation precision; signs base; statistical language model; Dictionaries; Intelligent systems; Joining processes; Law; Legal factors; Natural languages; Smoothing methods; Statistics; Viterbi algorithm; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Signal Processing Proceedings, 2000. WCCC-ICSP 2000. 5th International Conference on
  • Conference_Location
    Beijing
  • Print_ISBN
    0-7803-5747-7
  • Type

    conf

  • DOI
    10.1109/ICOSP.2000.893513
  • Filename
    893513