Title :
Optimal preprocessing and FCM clustering of MIR, NIR and combined MIR-NIR spectra for classification of maize roots
Author :
Rammal, Abbas ; Perrin, Eric ; Vrabie, Valeriu ; Bertrand, Isabelle ; Habrant, Anouck ; Chabbert, Brigitte
Author_Institution :
CReSTIC, Univ. of Reims Champagne-Ardenne (URCA), Reims, France
fDate :
April 29 2014-May 1 2014
Abstract :
InfraRed spectroscopy (IR) provides useful information of the molecular composition of biological systems. Mid-InfraRed (MIR) spectroscopy reflects fundamental molecular vibrations whereas Near-InfraRed (NIR) spectroscopy exhibits the overtones and combinations of fundamental vibrations and bonds. In most applications, the samples are mixed with potassium bromide (KBr) powder, or simply unmixed. Two technics are investigated: IR absorption on mixed samples and Diffuse Reflectance IR Fourier Transform (DRIFT) on unmixed samples. IR spectra are collected in either MIR or NIR regions. However, the preprocessing of IR spectra, the choice of the spectral band and the combination of MIR-NIR information are important factors that could substantially influence analyses. This study investigates these factors while attempting to retrieve three different genotypes of maize roots via a Fuzzy C-Mean (FCM) classification of IR spectra. A bootstrapping procedure is used as the number of samples is limited. Results show that KBr spectroscopy is better than DRIFT spectroscopy for MIR region; MIR provides equivalent information as NIR for DRIFT spectroscopy; combination of MIR-NIR information gives preprocessing independent results. Several distances are tested in FCM classification. The city bloc distance gives optimal results compared with Euclidean, Chebyshev, correlation and diagonal distance.
Keywords :
Fourier transform infrared spectroscopy; botany; crops; fuzzy set theory; genomics; infrared spectra; pattern classification; pattern clustering; statistical analysis; Chebyshev distance; DRIFT spectroscopy; Euclidean distance; FCM classification; FCM clustering; IR absorption; IR spectra preprocessing; MIR regions; NIR regions; biological systems; bootstrapping procedure; city bloc distance; combined MIR-NIR spectra; correlation distance; diagonal distance; diffuse reflectance IR Fourier transform; fundamental bonds; fundamental molecular vibrations; fundamental vibrations; fuzzy c-mean classification; maize root classification; maize root genotypes; midinfrared spectroscopy; molecular composition; near-infrared spectroscopy; optimal preprocessing; potassium bromide powder; spectral band; Biomass; Classification algorithms; Clustering algorithms; Correlation; Polynomials; Soil; Spectroscopy; Bootstrapping; Classification of Lignocellulosic Biomass; Distance; FCM clustering; Maize Roots; Mid InfraRed; Near InfraRed;
Conference_Titel :
e-Technologies and Networks for Development (ICeND), 2014 Third International Conference on
Conference_Location :
Beirut
Print_ISBN :
978-1-4799-3165-1
DOI :
10.1109/ICeND.2014.6991363