Title :
Rhythmycal criteria of text meaningfulness [Rhythmycal read Rhythmical]
Author_Institution :
Linguistic Inst., Acad. of Sci., Moscow, Russia
Abstract :
The paper describes the discovered close relationship between the printed prosaic text rhythm and its contents. Processing enormous numbers of texts contained in the WWW can be optimized using some criteria, reflecting their nature. One of the current problems is that the programs that process texts on the Internet can never know definitely what kind of word stuff they do. The same result can be achieved after meaningless word mixture processing. Only word order or word sequence reflects the meaning of the whole text. But it is hard to convert it into numerical indexes. The ancient idea of a close relation between prosaic text contents and the flow of its rhythm is under discussion. First the rhythm of prosaic text really exists and prosaic rhythm primitives are the regular verse rhythm sequences. But these rhythm primitives should be variable. They should constantly change their elementary rhythm patterns. Otherwise the whole text would be rejected by the human mind. The problem is to find a numeric index reflecting such a flowing matter. The rhythmical indexes presented can be regarded as a statistical test measuring the meaningfulness of any sequence of words. Really, this statistical criterion can be called the razor of the meaningfulness
Keywords :
Internet; statistical analysis; text analysis; Internet; WWW; elementary rhythm patterns; flowing matter; numeric index; numerical indexes; printed prosaic text rhythm; prosaic rhythm primitives; prosaic text; regular verse rhythm sequences; rhythm primitives; rhythmical criteria; statistical criterion; statistical test; text meaningfulness; text processing; word Mixture processing; word order; word sequence; Books; Humans; Indexing; Internet; Packaging; Rhythm; Speech processing; Testing; Text processing; World Wide Web;
Conference_Titel :
Systems, Man, and Cybernetics, 2001 IEEE International Conference on
Conference_Location :
Tucson, AZ
Print_ISBN :
0-7803-7087-2
DOI :
10.1109/ICSMC.2001.969858