• Title of article

    A top-down linguistic approach to the analysis of genomic sequences: The metabotropic glutamate receptors 1 and 5 in human and in mouse as a case study

  • Author/Authors

    Menconi، نويسنده , , Giulia and Puliti، نويسنده , , Aldamaria and Sbrana، نويسنده , , Isabella and Conti، نويسنده , , Valerio and Marangoni، نويسنده , , Roberto، نويسنده ,

  • Issue Information
    روزنامه با شماره پیاپی سال 2011
  • Pages
    9
  • From page
    134
  • To page
    142
  • Abstract
    This paper presents a top-down strategy to detect features in genomic sequences. The strategyʹs core is to exploit dictionary-based compression algorithms and analyse the content of the automatically generated dictionary. We classify the different over-represented segments and in the case study we correlate them to experimentally identified or theoretically forecasted biological features. A large spectrum analysis reveals that the only feature co-located with the a priori extracted segments is the torsional flexibility of DNA, while non-B DNA configurations are anti-localized and other features are mostly independent of the extracted sequences. This analysis unravels complex relationships between the linguistic structures investigated under our approach and some known biological features.
  • Keywords
    Over-represented segments , DNA flexibility , Combinatorics on words
  • Journal title
    Journal of Theoretical Biology
  • Serial Year
    2011
  • Journal title
    Journal of Theoretical Biology
  • Record number

    1540487