|Table of Contents|

Online Hierarchical Streaming Feature Selection Based on Neighborhood Decision Error Rate(PDF)

南京师范大学学报(工程技术版)[ISSN:1006-6977/CN:61-1281/TN]

Issue:
2022年04期
Page:
9-18
Research Field:
计算机科学与技术
Publishing date:

Info

Title:
Online Hierarchical Streaming Feature Selection Based on Neighborhood Decision Error Rate
Author(s):
Wang Chenxi12Liu Yuankui12Lv Yan12Lin Yaojin12
(1.School of Computer Science,Minnan Normal University,Zhangzhou 363000,China)
(2.Key Laboratory of Data Science and Intelligence Application,Minnan Normal University,Zhangzhou 363000,China)
Keywords:
online streaming feature selectionhierarchical classificationsibling relationshipsneighborhood decision error rate
PACS:
TP18
DOI:
10.3969/j.issn.1672-1292.2022.04.002
Abstract:
In many practical application fields,there are numerous scenes in which the entire feature space cannot be available in advance,candidate features flow into the feature space dynamically over time,and the number of samples is fixed. At the same time,there exists a hierarchical structure relationship between classes,and traditional feature selection methods cannot be able to meet the demand. Based on these,an online streaming feature selection algorithm for hierarchical classification learning is presented. Firstly,a decision error rate calculation formula is designed on the basis of the largest nearest neighbor according to sibling relationships. Secondly,two online evaluation criteria of online significance selection and online relevance analysis are proposed to select features with minimum decision error. Finally,experimental results on six hierarchical datasets manifest that the proposed algorithm is better than some existing online streaming feature selection algorithms.

References:

[1]胡清华,王煜,周玉灿,等. 大规模分类任务的分层学习方法综述[J]. 中国科学:信息科学,2018,48(5):7-20.
[2]赵红. 面向层次结构数据的特征选择方法[D]. 天津:天津大学,2019.
[3]FREEMAN C,KULIC D,BASIR O. Joint feature selection and hierarchical classifier design[C]//Proceedings of 2011 IEEE International Conference on Systems,Man and Cybernetics. Anchorage,USA:IEEE,2011.
[4]SONG J,ZHANG P Z,QIN S J,et al. A method of the feature selection in hierarchical text classification based on the category discrimination and position information[C]//Proceedings of 2015 International Conference on Industrial Informatics-Computing Technology,Intelligent Technology,Industrial Information Integration. Wuhan,China:ICIICII,2015.
[5]PAN S R,WU J,ZHU X Q. Cogboost:boosting for fast cost-sensitive graph classification[J]. IEEE Transactions on Knowledge & Data Engineering,2015,27(11):2933-2946.
[6]ZHAO H,ZHU P F,WANG P,et al. Hierarchical feature selection with recursive regularization[C]//Proceedings of the 26th International Joint Conference on Artificial Intelligence. Melbourne,Australia:AAAI Press,2017:3483-3489.
[7]ZHOU J,FOSTER D P,STINE R A,et al. Streamwise feature selection[J]. Journal of Machine Learning Research,2006,7(1):1861-1885.
[8]YU K,WU X D,DING W,et al. Scalable and accurate online feature selection for big data[J]. ACM Transactions on Knowledge Discovery from Data,2016,11(2):1-39.
[9]LIN Y J,HU Q H,LIU J H,et al. Streaming feature selection for multi-label learning based on fuzzy mutual information[J]. IEEE Transactions on Fuzzy Systems,2017,25(6):1491-1507.
[10]LIU J H,LIN Y J,WU S X,et al. Online multi-label group feature selection[J]. Knowledge-Based Systems,2018,143:42-57.
[11]LI H G,WU X D,LI Z,et al. Group feature selection with streaming features[C]//Proceedings of 2013 IEEE 13th International Conference on Data Mining. Dallas,USA:IEEE,2013.
[12]HU Q H,PEDRYCZ W,YU D R,et al. Selecting discrete and continuous features based on neighborhood decision error minimization[J]. IEEE Transactions on Systems Man & Cybernetics(Part B),2010,40(1):137-150.
[13]EISNER R,POULIN B,SZAFRON D,et al. Improving protein function prediction using the hierarchical structure of the gene ontology[C]//Procceedings of IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology. La Jolla,USA:IEEE,2005.
[14]CECI M,MALERBA D. Classifying web documents in a hierarchy of categories:a comprehensive study[J]. Journal of Intelligent Information Systems,2007,28(1):37-78.
[15]WU X D,YU K,DING W,et al. Online feature selection with streaming features[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2013,35(5):1178-1192.
[16]ZHOU P,HU X G,LI P P,et al. OFS-Density:A novel online streaming feature selection method[J]. Pattern Recognition,2019,86:48-61.
[17]ZHOU P,HU X G,LI P P,et al. Online feature selection for high-dimensional class-imbalanced data[J]. Knowledge-Based Systems,2017,136:187-199.
[18]ZHOU P,HU X G,LI P P. A new online feature selection method using neighborhood rough set[C]//Proceedings of 2017 IEEE International Conference on Big Knowledge. Hefei,China:IEEE,2017.

Memo

Memo:
-
Last Update: 2022-12-15