• DocumentCode
    476202
  • Title

    Cross sentence alignment based on singular value decomposition

  • Author

    Ho, Anna ; Wong, Fai ; de Oliveveira, F. ; Li, Yi-ping

  • Author_Institution
    Fac. of Sci. & Technol., Univ. of Macau, Macao
  • Volume
    5
  • fYear
    2008
  • fDate
    12-15 July 2008
  • Firstpage
    2524
  • Lastpage
    2529
  • Abstract
    This paper describes a way of performing alignment of Portuguese-Chinese bilingual pairs from the given two documents. Extracting the alignment pairs is a critical step for building Portuguese-Chinese bilingual corpus for Example Based Machine Translation Systems (EBMT). In short, the proposed alignment system performs four steps: break down the document into sentences level, score each pair of sentence by different features, apply Singular Value Decomposition on the results, extract the aligned pair base on a similarity function.
  • Keywords
    language translation; natural languages; singular value decomposition; text analysis; EBMT; Portuguese-Chinese bilingual corpus; bilingual document alignment; cross sentence alignment; example based machine translation system; similarity function; singular value decomposition; Cybernetics; Dictionaries; Government; Machine learning; Matrix decomposition; Natural languages; Noise measurement; Noise reduction; Singular value decomposition; Testing; Bilingual document alignment; Cross sentence alignment; Segmentation; Singular value decomposition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning and Cybernetics, 2008 International Conference on
  • Conference_Location
    Kunming
  • Print_ISBN
    978-1-4244-2095-7
  • Electronic_ISBN
    978-1-4244-2096-4
  • Type

    conf

  • DOI
    10.1109/ICMLC.2008.4620833
  • Filename
    4620833