• DocumentCode
    589963
  • Title

    Anatomy of a globally recursive embedded LINPACK benchmark

  • Author

    Dongarra, Jack ; Luszczek, Piotr

  • Author_Institution
    Innovative Comput. Lab., Univ. of Tennessee, Knoxville, TN, USA
  • fYear
    2012
  • fDate
    10-12 Sept. 2012
  • Firstpage
    1
  • Lastpage
    6
  • Abstract
    We present a complete bottom-up implementation of an embedded LINPACK benchmark on iPad 2. We use a novel formulation of a recursive LU factorization that is recursive and parallel at the global scope. We be believe our new algorithm presents an alternative to existing linear algebra parallelization techniques such as master-worker and DAG-based approaches. We show a assembly API that allows us a much higher level of abstraction and provides rapid code development within the confines of mobile device SDK. We use performance modeling to help with the limitation of the device and the limited access to device from the development environment not geared for HPC application tuning.
  • Keywords
    application program interfaces; embedded systems; linear algebra; notebook computers; parallel processing; software libraries; DAG-based approach; assembly API; code development; globally recursive embedded LINPACK benchmark; iPad 2; linear algebra parallelization techniques; master-worker; mobile device SDK; recursive LU factorization; Assembly; Hardware; Performance evaluation; Registers; Synchronization; Tablet computers; Tiles;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    High Performance Extreme Computing (HPEC), 2012 IEEE Conference on
  • Conference_Location
    Waltham, MA
  • Print_ISBN
    978-1-4673-1577-7
  • Type

    conf

  • DOI
    10.1109/HPEC.2012.6408679
  • Filename
    6408679