Article

Feature subset selection in large dimensionality domains

Details

Citation

Gheyas IA & Smith L (2010) Feature subset selection in large dimensionality domains. Pattern Recognition, 43 (1), pp. 5-13. http://www.sciencedirect.com/science/journal/00313203; https://doi.org/10.1016/j.patcog.2009.06.009

Abstract
Searching for an optimal feature subset from a high dimensional feature space is known to be an NP-complete problem. We present a hybrid algorithm, SAGA, for this task. SAGA combines the ability to avoid being trapped in a local minimum of Simulated Annealing with the very high rate of convergence of the crossover operator of Genetic Algorithms, the strong local search ability of greedy algorithms and the high computational efficiency of Generalized Regression Neural Networks. We compare the performance over time of SAGA and well-known algorithms on synthetic and real datasets. The results show that SAGA outperforms existing algorithms.

Keywords
feature subset selection; high dimensional datasets; Neural networks (Computing science); Genetic algorithms

Journal
Pattern Recognition: Volume 43, Issue 1

StatusPublished
Publication date31/01/2010
URLhttp://hdl.handle.net/1893/1654
PublisherElsevier
Publisher URLhttp://www.sciencedirect.com/science/journal/00313203
ISSN0031-3203

People (1)

Professor Leslie Smith

Professor Leslie Smith

Emeritus Professor, Computing Science

Files (1)