A Strategy for Dealing With Missing Values by Using Selective Activation Neurons in a Multi-Topology Framework

Authors

Abstract

Neural Networks (NN) have proven to be able to successfully solve problems in many areas. However, for large scale real problems, data is often incomplete. This is a serious problem, because NN cannot handle directly missing values. The usual approach to solve this problem consists of removing attributes and/or samples containing unknown values. This strategy is very attractive since it is simple to implement and reduces the dimensionality of data, therefore potentially reducing the complexity of the problem. However removing features or instances containing vital information, which can not be compensated by the remaining data, may result in the unattainability of accurate NN models. Another strategy consists of estimating missing values. However, wrong estimations of crucial data can lead to unpredicted results. Moreover these techniques do not account for real situations (e.g. where sensors may fail) and the output of such NN models may cause instabilities in the whole process. In this paper we propose a technique for handling missing values that accounts for the creation of different transparent NN models with respect to the missing features instead of relying on tedious data pre-processing techniques. The resulting models are bounded to share information among them. Contrary to the imputation of data (estimate) our models take into account the uncertainty caused by unknown values. Moreover the presented technique is prepared to deal with faulty sensors. The preliminary results obtained in several datasets show the efficacy of the proposed approach.

Keywords

Missing Values, Machine Learning

Subject

Missing Values

Conference

IEEE World Congress on Computational Intelligence (WCCI 2010), July 2010

DOI

Cited by

Year 2015 : 1 citations

K.Karthikeyan, Sayantani Basu. Neural Network Technique in Data Mining for Prediction of Earth Quake. International Journal Of Pharmacy & Technology. Vol. 7, Issue 3, pp. 9543-9554 (2015)

Year 2012 : 1 citations

Andréa Martiniano da Silva. Aplicação de técnica da mineração de dados na identificação do perfil de empregados absenteístas e presenteístas em uma empresa de courier da cidade de são paulo. Master’s thesis, Universidade Nove de Julho – UNINOVE, 2012.