DocumentCode
2442087
Title
Offline library adaptation using automatically generated heuristics
Author
De Mesmay, Frédéric ; Voronenko, Yevgen ; Püschel, Markus
Author_Institution
Dept. of Electr. & Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear
2010
fDate
19-23 April 2010
Firstpage
1
Lastpage
10
Abstract
Automatic tuning has emerged as a solution to provide high-performance libraries for fast changing, increasingly complex computer architectures. We distinguish offline adaptation (e.g., in ATLAS) that is performed during installation without the full problem description from online adaptation (e.g., in FFTW) that is performed at runtime. Offline adaptive libraries are simpler to use, but, unfortunately, writing the adaptation heuristics that power them is a daunting task. The overhead of online adaptive libraries, on the other hand, makes them unsuitable for a number of applications. In this paper, we propose to automatically generate heuristics in the form of decision trees using a statistical classifier, effectively converting an online adaptive library into an offline one. As testbed we use Spiral-generated adaptive transform libraries for current multicores with vector extensions. We show that replacing the online search with generated decision trees maintains a performance competitive with vendor libraries while allowing for a simpler interface and reduced computation overhead.
Keywords
decision trees; learning (artificial intelligence); pattern classification; software libraries; statistical analysis; ATLAS; FFTW; Spiral-generated adaptive transform libraries; automatic tuning; automatically generated heuristics; decision trees; high-performance libraries; offline adaptive libraries; statistical classifier; Classification tree analysis; Computer architecture; Decision trees; Discrete Fourier transforms; High performance computing; Linear algebra; Runtime; Software libraries; Timing; Writing; FFT; automatic performance tuning; decision trees; fast Fourier transform; high-performance computing; library generation; machine learning; statistical classifier;
fLanguage
English
Publisher
ieee
Conference_Titel
Parallel & Distributed Processing (IPDPS), 2010 IEEE International Symposium on
Conference_Location
Atlanta, GA
ISSN
1530-2075
Print_ISBN
978-1-4244-6442-5
Type
conf
DOI
10.1109/IPDPS.2010.5470479
Filename
5470479
Link To Document