Set characterization-selection towards classification based on interaction index - INRAE - Institut national de recherche pour l’agriculture, l’alimentation et l’environnement Accéder directement au contenu
Article Dans Une Revue Fuzzy Sets and Systems Année : 2015

Set characterization-selection towards classification based on interaction index

Résumé

In many real world datasets both the individual and coordinated action of features may be relevant for class identification. In this paper, a computational strategy for relevant feature selection based on the characterization of redundant or complementary features is proposed. The characterization is achieved using fuzzy measures and an interaction index computed from fuzzy measure coefficients. Fuzzy measure identification requires raw data to be turned into confidence degrees. This key step is carried out considering the distributions of feature values across all the classes. Fuzzy measure coefficients are then estimated with an improved version of the Heuristic Least Mean Squares algorithm that includes an efficient management of untouched coefficients. Then, a generalization of the Shapley index for an arbitrary number of features is used. Simulations experiments on synthetic datasets are performed to study the behavior of this generalized interaction index. For extreme datasets, containing either redundant or complementary features as well as noise, the index value is defined by mathematical formula. This result is used to motivate feature selection guidelines that take into account feature interactions. Experimental results on benchmark datasets show that the proposal allows for the design of compact, interpretable and competitive classification models.
Fichier principal
Vignette du fichier
mo2015-pub00043660.pdf (977.21 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01357526 , version 1 (30-08-2016)

Identifiants

Citer

J. Murillo, S. Guillaume, F. Spetale, E. Tapia, P. Bulacio. Set characterization-selection towards classification based on interaction index. Fuzzy Sets and Systems, 2015, 270, pp.74-89. ⟨10.1016/j.fss.2014.09.015⟩. ⟨hal-01357526⟩
63 Consultations
93 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More