DocumentCode
1748596
Title
Rapid profiling via stratified sampling
Author
Sastry, S. Subramanya ; Bodík, Rastislav ; Smith, James E.
Author_Institution
Dept. of Comput. Sci., Wisconsin Univ., Madison, WI, USA
fYear
2001
fDate
2001
Firstpage
278
Lastpage
289
Abstract
Sophisticated binary translators and dynamic optimizers demand a program profiler with low overhead, high accuracy, and the ability to collect a variety of profile types. A profiling scheme that achieves these goals is proposed. Conceptually, the hardware compresses a stream of profile data by counting identical events; the compressed profile data is passed to software for analysis. Compressing the high-bandwidth event stream greatly reduces software overhead. Because optimizations can tolerate some profiling errors, we allow the stream compressor to be lossy, thereby enabling a low-cost sampling-based hardware design. Because the hardware compressor is insensitive to the event content, it supports various profile types and can process multiple types simultaneously. Basic components of our framework are periodic and random samplers, counters, and hash functions. These components are composed to form a variety of stream compressors. One design is both simple and very effective: the input stream is hash-split into multiple substreams, each of which is fed into a simple periodic sampler that selects every kth event. This stratified periodic sampler performs better than conventional random sampling because it biases each substream towards a small number of unique events, thereby reducing sampling error and allowing faster convergence to an accurate profile. For example, convergence to a given level of accuracy is about twice as fast for gcc. When sampling overhead is considered, the stratified periodic profiler achieves less than 3% error while incurring an overhead of only 3.5% for gcc
Keywords
computer architecture; program interpreters; binary translators; dynamic optimizers; hash functions; high-bandwidth event stream; program profiler; random sampling; rapid profiling; software overhead; stratified periodic sampler; stratified sampling; Convergence; Counting circuits; Design optimization; Dynamic scheduling; Fluid flow measurement; Hardware; Information analysis; Magnetohydrodynamic power generation; Performance analysis; Sampling methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Architecture, 2001. Proceedings. 28th Annual International Symposium on
Conference_Location
Goteborg
ISSN
1063-6897
Print_ISBN
0-7695-1162-7
Type
conf
DOI
10.1109/ISCA.2001.937456
Filename
937456
Link To Document