Title of article
A top-down linguistic approach to the analysis of genomic sequences: The metabotropic glutamate receptors 1 and 5 in human and in mouse as a case study
Author/Authors
Menconi، نويسنده , , Giulia and Puliti، نويسنده , , Aldamaria and Sbrana، نويسنده , , Isabella and Conti، نويسنده , , Valerio and Marangoni، نويسنده , , Roberto، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2011
Pages
9
From page
134
To page
142
Abstract
This paper presents a top-down strategy to detect features in genomic sequences. The strategyʹs core is to exploit dictionary-based compression algorithms and analyse the content of the automatically generated dictionary. We classify the different over-represented segments and in the case study we correlate them to experimentally identified or theoretically forecasted biological features. A large spectrum analysis reveals that the only feature co-located with the a priori extracted segments is the torsional flexibility of DNA, while non-B DNA configurations are anti-localized and other features are mostly independent of the extracted sequences. This analysis unravels complex relationships between the linguistic structures investigated under our approach and some known biological features.
Keywords
Over-represented segments , DNA flexibility , Combinatorics on words
Journal title
Journal of Theoretical Biology
Serial Year
2011
Journal title
Journal of Theoretical Biology
Record number
1540487
Link To Document