重庆工商大学学报（自然科学版）

引用本文:	张辉宜，张进，黄俊.基于图注意力网络的多标签图像分类模型(J/M/D/N,J:杂志，M：书，D：论文，N：报纸).期刊名称,2022，39（1）：34-41
	CHEN X. Adap tive slidingmode contr ol for discrete2ti me multi2inputmulti2 out put systems[ J ]. Aut omatica, 2006, 42(6): 4272-435

【打印本页】【下载PDF全文】【查看/发表评论】【EndNote】【RefMan】【BibTex】

←前一篇|后一篇→

过刊浏览高级检索

本文已被：浏览 1529次下载 2038次	码上扫一扫！
分享到：微信更多字体:加大+\|默认\|缩小-
基于图注意力网络的多标签图像分类模型
张辉宜，张进，黄俊
安徽工业大学计算机科学与技术学院，安徽马鞍山 243000

摘要:

针对ML-GCN中标签共现嵌入维度过高影响模型分类性能和ML-GCN中没有充分发掘标签之间不对称关系的问题，提出一种基于图注意力网络的多标签图像分类模型ML-GAT;ML-GAT模型首先对高维标签语义嵌入矩阵进行降维；然后通过降维后的低维标签语义嵌入表示和标签类别共现图得到标签共现嵌入；与此同时ML-GAT将多标签原始图像输入卷积神经网络进行图像通用特征提取，将卷积神经网络提取出的多标签图像通用特征按照图注意力网络计算得到的标签共现嵌入的维度进行维度统一；最后ML-GAT融合标签共现嵌入和图像通用特征得到每一张多标签图像的标签预测评分；在VOC 2007与MS-COCO 2014上的实验结果表明：在训练样本充分且标签类别数足够多的情况下，ML-GAT取得了较好的实验结果，通过和其他模型比较分析，ML-GAT模型所采取的策略可以一定程度上提升模型的多标签图像分类性能。

关键词: 多标签分类图注意力网络卷积神经网络深度学习

DOI：

分类号:

基金项目:

Multi label Image Classification Model Based on Graph Attention Network

ZHANG Hui-yi，ZHANG Jin，HUANG Jun

School of Computer Science and Technology， Anhui University of Technology，Anhui Maanshan 243000， China

Abstract:

In order to solve the problem that the high co-occurrence dimension of labels in ML-GCN reduces the model classification performance and the asymmetrical relationship between labels is not fully explored in ML-GCN, a multi label image classification model of ML-GAT based on graph attention network is proposed. Firstly, the ML-GAT model reduces the dimensionality of the semantic embedding matrix of high dimensional labels. Then the label co-occurrence embedding is obtained by the low dimensional label semantic embedding representation and the label category co-occurrence graph after dimensionality reduction. At the same time, ML-GAT inputs the original multi label image into the convolutional neural network to extract the general features of the image, and the general features of the multi label image extracted by the convolutional neural network are unified in dimension according to the embedded dimensions of the labels calculated by the graph attention network. Finally, ML-GAT fusion of the image features after co-occurrence and dimensionality reduction of labels is used to obtain the label prediction score of each multi label image. Experimental results on VOC 2007 and MS-COCO 2014 show that ML-GAT achieves good experimental results under the condition of sufficient training samples and sufficient number of label categories. By comparing with other models, the strategy adopted by ML-GAT model can improve the multi label image classification performance of the model to a certain extent.

Key words: multi label classification graph attention network convolutional neural network deep learning

关注微信二维码

期刊界 勤云,期刊,采编