|Table of Contents|

An Improved Algorithm for Feature Selection Based on Pairwise Constraint(PDF)

南京师范大学学报(工程技术版)[ISSN:1006-6977/CN:61-1281/TN]

Issue:
2011年01期
Page:
56-61
Research Field:
Publishing date:

Info

Title:
An Improved Algorithm for Feature Selection Based on Pairwise Constraint
Author(s):
Yang Yang1Liu Huidong2
1.Intensification Culture School,Nanjing Normal University,Nanjing 210046,China; 2.School of Computer Science and Technology,Nanjing Normal University,Nanjing 210046,China
Keywords:
machine learningfeature selectionpairwise constraintclassification
PACS:
TP181
DOI:
-
Abstract:
Feature selection is key issue in machine learning field. As compared with unsupervised feature selection methods,supervised feature selection approaches have more better performances. However,most of the existing supervised feature selection algorithms mainly aim at the cases using the labels as supervised information,here these methods are not applied to the cases with pairwise constraints. In the real application,it is more easier to get the pairwise constraints as comparing with getting labels. So the researchers proposed a feature selection based on pairwise constraint, the algorithm obtains a feature sequence by measuring the significance of each single feature,but in fact the feature subset combining by those more important features may be not an effective feature subset. Therefore,in this paper,we introduce an improved feature selection algorithm based on pairwise constraint,the newly developed algorithm focuses on evaluating the importance of a feature subset but not a single feature,that is,it uses the empty feature subset as starting point,and then gradually extends this feature subset by adding a most effective feature in every round,in this way an effective ranking feature list is obtained. Experimental results show that the newly proposed algorithm is flexible.

References:

[1]Liu H,Motoda H. Feature selection for knowledge discovery and data mining[M]. Boston: Kluwer,1998.
[2]Yu L,Liu H. Feature selection for high-dimensional data: a fast correlation-based filter solution[C]/ / Proceedings of the 20th International Conferences on Machine Learning. Washington DC,2003: 856-863.
[3]Kohavi R,John G. Wrappers for feature subset selection[J]. Artificial Intelligence,1997,19( 1 /2) : 273-324.
[4]毛勇,周晓波,夏铮,等. 特征选择算法研究综述[J]. 模式识别与人工智能,2007,20( 2) : 211-218. Mao Yong,Zhou Xiaobo,Xia Zheng,et al. A survey for study of feature selection algorithms[J]. Pattern Recognition & Artificial Intelligence,2007,20( 2) : 211-218. ( in Chinese)
[5]朱颢东,李红婵,钟勇. 新颖的无监督特征选择方法[J]. 电子科技大学学报,2010,39( 3) : 412-415. Zhu Haodong,Li Hongchan,Zhong Yong. New unsupervised feature selection method[J]. Journal of University of Electronic Science and Technology of China,2010,39( 3) : 412-415. ( in Chinese)
[6]Mitra P,Murthy C A,Pal S K. Unsupervised feature selection using feature similarity[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2002,24( 3) : 301-312.
[7]Bishop C M. Neural Networks for Pattern Recognition[M]. Oxford: Oxford University Press,1995.
[8]王博,黄九鸣,贾焰,等. 适用于多种监督模型的特征选择方法研究[J]. 计算机研究与发展,2010,47( 9) : 1 548-1 557. Wang Bo,Huang Jiuming,Jia Yan,et al. Research on a common feature selection method for multiple supervised models[J]. Journal of Computer Research and Development,2010,47( 9) : 1 548-1 557. ( in Chinese)
[9]Zhao Z,Liu H. Semi-supervised feature selection via spectral analysis[C]/ / Proceedings of the 7th SIAM International Conference on Data Mining. Minneapolis: MN,2007: 641-646.
[10]Xing E P,Ng A Y,Jordan M I,et al. Distance metric learning,with application to clustering with side-information[C]/ / Proceedings of the Conference on Advances in Neural Information Processing Systems( NIPS) . 2002: 505-512.
[11]Zhang D,Chen S,Zhou Z H. Constraint Score: A new filter method for feature selection with pairwise constraints[J]. Pattern Recognition,2008,41( 5) : 1 440-1 451.
[12]Witten I H,Frank E. Data Mining: Practical Machine Learning Tools and Techniques[M]. 2nd ed. San Francisco: Morgan Kaufmann,2005.

Memo

Memo:
-
Last Update: 2013-03-21