Title :
Reproducibility of Experimental Results from a Highly Parallelized Classification Algorithm
Author :
Wiebe, Conrad ; Pizzi, Nick J.
Author_Institution :
Univ. of Manitoba, Winnipeg
Abstract :
The classification and interpretation of high-dimensional biomedical data is frequently a computationally expensive problem. When analyzing such data it is often advantageous to identify a subset of relevant features that minimize classification errors. During the discovery of such subsets, reproducibility (repeatability) of experimental results is an essential requirement. We present a load balanced parallel algorithm for identifying discriminatory feature subsets. Also presented are various techniques used to ensure the repeatability of the algorithm´s experimental results. Experiments conducted on biomedical spectra using a variety of the presented parallelization approaches show some techniques to be significantly more effective than others.
Keywords :
feature extraction; medical computing; parallel algorithms; pattern classification; biomedical spectra; discriminatory feature subsets; high-dimensional biomedical data; highly parallelized classification algorithm; load balanced parallel algorithm; parallelization approaches; reproducibility; Bioinformatics; Classification algorithms; Computer science; Data analysis; Iterative methods; Load management; Machine learning algorithms; Monitoring; Random number generation; Reproducibility of results;
Conference_Titel :
Electrical and Computer Engineering, 2007. CCECE 2007. Canadian Conference on
Conference_Location :
Vancouver, BC
Print_ISBN :
1-4244-1020-7
Electronic_ISBN :
0840-7789
DOI :
10.1109/CCECE.2007.153