Title of article :
Scale and time dependence of serial correlations in word-length time series of written texts
Author/Authors :
Rodriguez، نويسنده , , E. and Aguilar-Cornejo، نويسنده , , M. and Femat، نويسنده , , Juan R. and Alvarez-Ramirez، نويسنده , , J.، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2014
Pages :
9
From page :
378
To page :
386
Abstract :
This work considered the quantitative analysis of large written texts. To this end, the text was converted into a time series by taking the sequence of word lengths. The detrended fluctuation analysis (DFA) was used for characterizing long-range serial correlations of the time series. To this end, the DFA was implemented within a rolling window framework for estimating the variations of correlations, quantified in terms of the scaling exponent, strength along the text. Also, a filtering derivative was used to compute the dependence of the scaling exponent relative to the scale. The analysis was applied to three famous English-written literary narrations; namely, Alice in Wonderland (by Lewis Carrol), Dracula (by Bram Stoker) and Sense and Sensibility (by Jane Austen). The results showed that high correlations appear for scales of about 50–200 words, suggesting that at these scales the text contains the stronger coherence. The scaling exponent was not constant along the text, showing important variations with apparent cyclical behavior. An interesting coincidence between the scaling exponent variations and changes in narrative units (e.g., chapters) was found. This suggests that the scaling exponent obtained from the DFA is able to detect changes in narration structure as expressed by the usage of words of different lengths.
Keywords :
DFA , Written texts , Word length sequence , Quantitative analysis
Journal title :
Physica A Statistical Mechanics and its Applications
Serial Year :
2014
Journal title :
Physica A Statistical Mechanics and its Applications
Record number :
1738907
Link To Document :
بازگشت