• DocumentCode
    1944614
  • Title

    Regression in the Presence Missing Data Using Ensemble Methods

  • Author

    Hassan, Mostafa M. ; Atiya, Amir F. ; El-Gayar, Neamat ; El-Fouly, Raafat

  • Author_Institution
    Dept. of Comput. Eng., Cairo Univ., Cairo
  • fYear
    2007
  • fDate
    12-17 Aug. 2007
  • Firstpage
    1261
  • Lastpage
    1265
  • Abstract
    We consider the problem of missing data, and develop ensemble-network models for handling the missing data. The proposed method is based on utilizing the inherent uncertainty of the missing records in generating diverse training sets for the ensemble´s networks. The proposed method is based on generating the missing values using their probability density. We repeat this procedure many time thereby creating several complete data sets. A network is trained for each of these data sets, therefore obtaining an ensemble of networks. Several variants are proposed, including the univariate approach and the multivariate approach, which differ in the way missing values are generated. Simulation results confirm the general superiority of the proposed methods compared to the conventional approaches.
  • Keywords
    data handling; neural nets; probability; regression analysis; ensemble-network model; missing data handling; missing records; probability density; regression; Information technology; Learning systems; Linear regression; Machine learning algorithms; Maximum likelihood estimation; Neural networks; Parameter estimation; Statistics; Training data; Uncertainty;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2007. IJCNN 2007. International Joint Conference on
  • Conference_Location
    Orlando, FL
  • ISSN
    1098-7576
  • Print_ISBN
    978-1-4244-1379-9
  • Electronic_ISBN
    1098-7576
  • Type

    conf

  • DOI
    10.1109/IJCNN.2007.4371139
  • Filename
    4371139