Issue |
JNWPU
Volume 38, Number 3, June 2020
|
|
---|---|---|
Page(s) | 471 - 477 | |
DOI | https://doi.org/10.1051/jnwpu/20203830471 | |
Published online | 06 August 2020 |
Feature Selection on Maximum Information Coefficient for Underwater Target Recognition
一种水下目标识别的最大信息系数特征选择方法
1
School of Marine Science and Technology, Northwestern Polytechnical University, Xi'an 710072, China
2
School of Electronic Information and Artificial Intelligence, Shaanxi University of Science and Technology, Xi'an 710021, China
Received:
1
July
2019
Feature selection is an essential process in the identification task because the irrelevant and redundant features contained in the unselected feature set can reduce both the performance and efficiency of recognition. However, when identifying the underwater targets based on their radiated noise, the diversity of targets, and the complexity of underwater acoustic channels introduce various complex relationships among the extracted acoustic features. For this problem, this paper employs the normalized maximum information coefficient (NMIC) to measure the correlations between features and categories and the redundancy among different features and further proposes an NMIC based feature selection method (NMIC-FS). Then, on the real-world dataset, the average classification accuracy estimated by models such as random forest and support vector machine is used to evaluate the performance of the NMIC-FS. The analysis results show that the feature subset obtained by NMIC-FS can achieve higher classification accuracy in a shorter time than that without selection. Compared with correlation-based feature selection, laplacian score, and lasso methods, the NMIC-FS improves the classification accuracy faster in the process of feature selection and requires the least acoustic features to obtain classification accuracy comparable to that of the full feature set.
摘要
由于未经选择的特征集合中包含的无关特征和冗余特征会导致识别性能和识别效率的下降,特征选择是识别任务中的重要步骤。然而,基于辐射噪声识别水下目标时,由于目标的多样性和水声信道的复杂性,提取的声学特征之间存在多种线性相关之外的复杂关系。针对此问题,以归一化最大信息系数度量特征与类别之间的相关度以及特征之间的冗余度,提出了基于归一化最大信息系数的特征选择方法(NMIC-FS),并在实测数据集上以随机森林和支持向量机等模型估计的平均分类精度评估其性能。水下目标数据分析结果表明,与未选择前相比,NMIC-FS所得特征子集性能在更短的分类时间得到更高的分类正确率。与相关特征选择法、拉普拉斯分数法和套索法等方法相比,NMIC-FS在特征选择过程中能更迅速地提升分类正确率,可用更少的特征得到与使用特征全集时相当的分类正确率。
Key words: feature selection / ship-radiated noise / maximum correlation coefficient
关键字 : 特征选择 / 舰船辐射噪声 / 最大相关系数
© 2019 Journal of Northwestern Polytechnical University. All rights reserved.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.