DocumentCode :
1991529
Title :
A multisample criterion for changepoint analysis of texts
Author :
Zakrevskaya, N.S.
Author_Institution :
Novosibirsk State Tech. Univ., Russia
fYear :
2005
fDate :
26 June-2 July 2005
Firstpage :
749
Lastpage :
750
Abstract :
We construct a criterion to differ homogeneous and non-homogeneous texts. This criterion is based on triplets´ frequencies analysis: we find the most deviated corresponding empirical bridge and analyze its deviation. The approach can differ homogeneous and non-homogeneous texts.
Keywords :
natural languages; text analysis; homogeneous texts; nonhomogeneous texts; text changepoint analysis; text identification; triplet frequencies analysis; Bridges; Frequency conversion; Libraries; Sections; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Science and Technology, 2005. KORUS 2005. Proceedings. The 9th Russian-Korean International Symposium on
Print_ISBN :
0-7803-8943-3
Type :
conf
DOI :
10.1109/KORUS.2005.1507893
Filename :
1507893
Link To Document :
بازگشت