DocumentCode
476202
Title
Cross sentence alignment based on singular value decomposition
Author
Ho, Anna ; Wong, Fai ; de Oliveveira, F. ; Li, Yi-ping
Author_Institution
Fac. of Sci. & Technol., Univ. of Macau, Macao
Volume
5
fYear
2008
fDate
12-15 July 2008
Firstpage
2524
Lastpage
2529
Abstract
This paper describes a way of performing alignment of Portuguese-Chinese bilingual pairs from the given two documents. Extracting the alignment pairs is a critical step for building Portuguese-Chinese bilingual corpus for Example Based Machine Translation Systems (EBMT). In short, the proposed alignment system performs four steps: break down the document into sentences level, score each pair of sentence by different features, apply Singular Value Decomposition on the results, extract the aligned pair base on a similarity function.
Keywords
language translation; natural languages; singular value decomposition; text analysis; EBMT; Portuguese-Chinese bilingual corpus; bilingual document alignment; cross sentence alignment; example based machine translation system; similarity function; singular value decomposition; Cybernetics; Dictionaries; Government; Machine learning; Matrix decomposition; Natural languages; Noise measurement; Noise reduction; Singular value decomposition; Testing; Bilingual document alignment; Cross sentence alignment; Segmentation; Singular value decomposition;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2008 International Conference on
Conference_Location
Kunming
Print_ISBN
978-1-4244-2095-7
Electronic_ISBN
978-1-4244-2096-4
Type
conf
DOI
10.1109/ICMLC.2008.4620833
Filename
4620833
Link To Document