Title :
Word ordering entropy difference of English and Chinese texts
Author :
To, Kiwing ; Wang, C.S. ; Liang, Tyne
Author_Institution :
Inst. of Phys., Acad. Sinica, Taipei, Taiwan
Abstract :
We compare the relative change in word ordering entropies of English and Chinese translations of the Bible by treating the texts as composed of sequence of multi-grams. Using a randomized text with exactly the same segmentation structures and words statistics as the references, we find that the changes in the multi-grams entropies for the Chinese version are always larger than those for the English version. This suggests that the word ordering in the Chinese version is more ambiguous than that in the English version.
Keywords :
computational linguistics; entropy; text analysis; Bible; Chinese texts; Chinese version; English texts; English version; multigram sequence; randomized text; segmentation structures; word ordering entropy difference; words statistics; Books; Computers; Entropy; Measurement uncertainty; Syntactics; Uncertainty; Vocabulary; entropy; multi-gram analysis;
Conference_Titel :
Computer Symposium (ICS), 2010 International
Conference_Location :
Tainan
Print_ISBN :
978-1-4244-7639-8
DOI :
10.1109/COMPSYM.2010.5685443