Unsupervised feature selection using feature similarity

Mitra, P. ; Murthy, C. A. ; Pal, S. K. (2002) Unsupervised feature selection using feature similarity IEEE Transactions on Pattern Analysis and Machine Intelligence, 24 (3). pp. 301-312. ISSN 0162-8828

[img]
Preview
PDF - Publisher Version
301kB

Official URL: http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arn...

Related URL: http://dx.doi.org/10.1109/34.990133

Abstract

In this article, we describe an unsupervised feature selection algorithm suitable for data sets, large in both dimension and size. The method is based on measuring similarity between features whereby redundancy therein is removed. This does not need any search and, therefore, is fast. A new feature similarity measure, called maximum information compression index, is introduced. The algorithm is generic in nature and has the capability of multiscale representation of data sets. The superiority of the algorithm, in terms of speed and performance, is established extensively over various real-life data sets of different sizes and dimensions. It is also demonstrated how redundancy and information loss in feature selection can be quantified with an entropy measure.

Item Type:Article
Source:Copyright of this article belongs to Institute of Electrical and Electronic Engineers.
ID Code:26053
Deposited On:06 Dec 2010 13:10
Last Modified:17 May 2016 09:24

Repository Staff Only: item control page