Title :
Semi-supervised Machine Learning Algorithm in Near Infrared Spectral Calibration: A Case Study to Determine Cetane Number and Total Aromatics of Diesel Fuels
Author :
Wang, Songjing ; Wu, Di ; Liu, Kangsheng
Author_Institution :
Dept. of Math., Zhejiang Univ., Hangzhou, China
Abstract :
A new spectral calibration algorithm, Laplacian regularized least squares (LapRLS), was proposed. Commonly least squares support vector machine (LS-SVM) and partial least squares (PLS) are used for the spectral quantitative model establishment. However, LS-SVM and PLS are supervised machine learning algorithms which just make use of labeled data. LapRLS is a semi-supervised machine learning algorithm which makes use of both labeled and unlabeled data for training. In this study, LapRLS was used to establish the quantitative relationship between near infrared (NIR) spectra and cetane number (CN) and total aromatics of diesel fuels. Near infrared (NIR) spectroscopy is a widely used technique for monitoring chemical compounds in petroleum industry. A total of 381 obtained samples were randomly split into two sets under different proportion. One set was used as calibration set (labeled data) whereas the remaining samples were used as the prediction set (unlabeled data). LapRLS, LS-SVM, and PLS were used to establish determination models based on NIR spectra. Results show that the best performance of determination was achieved by LapRLS, which indicates that LapRLS can utilize unlabeled data effectively on NIR spectral data for the determination of the chemical compounds of diesel fuels.
Keywords :
calibration; infrared spectroscopy; learning (artificial intelligence); least squares approximations; petroleum; petroleum industry; support vector machines; LS-SVM; LapRLS; Laplacian regularized least squares; PLS; cetane number determination; chemical compounds monitoring; diesel fuels; labeled data; least squares support vector machine; near infrared spectral calibration; near infrared spectroscopy; partial least squares; petroleum industry; semi-supervised machine learning algorithm; total aromatics determination; unlabeled data; Algorithm design and analysis; Calibration; Fuels; Kernel; Laplace equations; Spectroscopy; Training; laplacian regularized least squares; near infrared spectroscopy; semi-supervised learning; supervised learning;
Conference_Titel :
Intelligent Computation Technology and Automation (ICICTA), 2012 Fifth International Conference on
Conference_Location :
Zhangjiajie, Hunan
Print_ISBN :
978-1-4673-0470-2
DOI :
10.1109/ICICTA.2012.84