Issue |
JNWPU
Volume 37, Number 3, June 2019
|
|
---|---|---|
Page(s) | 465 - 470 | |
DOI | https://doi.org/10.1051/jnwpu/20193730465 | |
Published online | 20 September 2019 |
Classification of Few Labeled Images Based on Integrated GMM Clustering
基于集成GMM聚类的少标记样本图像分类
1
School of Astronautics, Northwestern Polytechnical University, Xi’an, 710072, China
2
Air Defense Academy, Air Force Engineering University, Xi'an 710043, China
Received:
4
April
2018
In order to improve the classifier classification accuracy of by using convolutional neural network training, a large amount of labeled data is often required, but sometimes labeled data is not easily obtained.This paper proposes a solution based on the idea of integrated GMM clustering and label delivery for classifying images with few labeled samples, assigning tags to unlabeled data through certain rules, and converting unlabeled data into labeled data for training of the model.In this paper, experiments are performed on hand-written digital recognition data sets. The results show that the present algorithm has a great improvement in the accuracy of model classification comparing with the method of using only labeled samples in the case of few labeled samples. The effectiveness of the present algorithm is validated.
摘要
为了提高卷积神经网络训练的分类器分类准确率,往往需要大量的已标记数据,但有时已标记数据并不容易获得。针对少标记样本图像分类问题,提出基于集成GMM聚类与标签传递思想的解决方案,通过一定的规则给未标记数据赋予标签,将未标记数据转换成已标记数据用于模型的训练。在手写数字识别数据集上进行实验,结果表明新算法在少标记样本的情况下,结合集成GMM聚类的方法比只采用有标记样本训练得到的模型分类准确率有着较大提高,验证了该算法的有效性。
Key words: integrated GMM clustering / few labeled samples / voting rules
关键字 : 集成GMM聚类 / 少标记样本 / 投票规则
© 2019 Journal of Northwestern Polytechnical University. All rights reserved.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Current usage metrics show cumulative count of Article Views (full-text article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 48-96 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.