• DocumentCode
    17815
  • Title

    Product Grammars for Alignment and Folding

  • Author

    Honer zu Siederdissen, Christian ; Hofacker, Ivo L. ; Stadler, Peter F.

  • Author_Institution
    Dept. of Theor. Chem., Univ. of Vienna, Vienna, Austria
  • Volume
    12
  • Issue
    3
  • fYear
    2015
  • fDate
    May-June 1 2015
  • Firstpage
    507
  • Lastpage
    519
  • Abstract
    We develop a theory of algebraic operations over linear and context-free grammars that makes it possible to combine simple “atomic” grammars operating on single sequences into complex, multi-dimensional grammars. We demonstrate the utility of this framework by constructing the search spaces of complex alignment problems on multiple input sequences explicitly as algebraic expressions of very simple one-dimensional grammars. In particular, we provide a fully worked frameshift-aware, semiglobal DNA-protein alignment algorithm whose grammar is composed of products of small, atomic grammars. The compiler accompanying our theory makes it easy to experiment with the combination of multiple grammars and different operations. Composite grammars can be written out in LATEX for documentation and as a guide to implementation of dynamic programming algorithms. An embedding in Haskell as a domain-specific language makes the theory directly accessible to writing and using grammar products without the detour of an external compiler. Software and supplemental files available here: http://www.bioinf. uni-leipzig.de/Software/gramprod/.
  • Keywords
    DNA; bioinformatics; context-free grammars; molecular biophysics; programming languages; proteins; LATEX; algebraic operations; complex alignment problems; complex multidimensional grammars; context-free grammars; domain-specific language; dynamic programming algorithms; folding; frameshift-aware semiglobal DNA-protein alignment algorithm; linear grammars; multiple input sequences; product grammars; simple atomic grammars; Bioinformatics; Computational biology; Dynamic programming; Grammar; Heuristic algorithms; IEEE transactions; Production; Haskell; context free grammar; linear grammar; multiple alignment; product structure;
  • fLanguage
    English
  • Journal_Title
    Computational Biology and Bioinformatics, IEEE/ACM Transactions on
  • Publisher
    ieee
  • ISSN
    1545-5963
  • Type

    jour

  • DOI
    10.1109/TCBB.2014.2326155
  • Filename
    6819790