Title :
Segmentation of Greek Text by Dynamic Programming
Author :
Fragkou, P. ; Petridis, V. ; Kehagias, Athanasios
Author_Institution :
Aristotle Univ. of Thessaloniki, Thessaloniki
Abstract :
We introduce a dynamic programming algorithm to perform linear segmentation of concatenated texts by global minimization of a segmentation cost function which consists of: (a) within-segment word similarity (expressed in terms of the generalized density of the text dotplot) and (b) prior information regarding segment length. Our algorithm is evaluated on two Greek text collections and proves that it outperforms several other algorithms because it performs global optimization of a global cost function.
Keywords :
dynamic programming; text analysis; word processing; Greek text linear segmentation; concatenated texts; cost function; dynamic programming; global optimization; within-segment word similarity; Artificial intelligence; Concatenated codes; Cost function; Dynamic programming; Heuristic algorithms; Minimization methods; Performance evaluation; Statistics; Vocabulary;
Conference_Titel :
Tools with Artificial Intelligence, 2007. ICTAI 2007. 19th IEEE International Conference on
Conference_Location :
Patras
Print_ISBN :
978-0-7695-3015-4
DOI :
10.1109/ICTAI.2007.25