Partical Swarm Optimization

Submitted by: Submitted by

Views: 704

Words: 14783

Pages: 60

Category: Science and Technology

Date Submitted: 04/25/2010 01:52 PM

Report This Essay

ARTICLE IN PRESS

European Journal of Operational Research xxx (2010) xxx–xxx

Contents lists available at ScienceDirect

European Journal of Operational Research

journal homepage: www.elsevier.com/locate/ejor

Discrete Optimization

A discrete particle swarm optimization method for feature selection in binary classification problems

Alper Unler *, Alper Murat

Department of Industrial and Manufacturing Engineering, Wayne State University, 4815 Fourth St. Detroit, MI 48202, USA

a r t i c l e

i n f o

a b s t r a c t

This paper investigates the feature subset selection problem for the binary classification problem using logistic regression model. We developed a modified discrete particle swarm optimization (PSO) algorithm for the feature subset selection problem. This approach embodies an adaptive feature selection procedure which dynamically accounts for the relevance and dependence of the features included the feature subset. We compare the proposed methodology with the tabu search and scatter search algorithms using publicly available datasets. The results show that the proposed discrete PSO algorithm is competitive in terms of both classification accuracy and computational performance. Ó 2010 Elsevier B.V. All rights reserved.

Article history: Received 29 April 2009 Accepted 23 February 2010 Available online xxxx Keywords: Feature selection Particle swarm optimization Metaheuristics Binary classification Logistic regression

1. Introduction Successful decision-making, whether done by an individual or as a group relies on the presence of many ingredients. Availability and quality of information is an essential element of successful decisions (O’Reilly, 1982). With the advent of key information technologies over the past several decades, decision makers have now access to vast amount of historical data. The extraction of valuable information from these data sources requires purposeful application of rigorous analysis techniques such as data...