引用本文:张辉宜1,夏媛龙1,周克武2,包向华2,陶 陶1.一种融合标签间强相关性的多标签图像分类方法(J/M/D/N,J:杂志,M:书,D:论文,N:报纸).期刊名称,2023,40(5):8-15
CHEN X. Adap tive slidingmode contr ol for discrete2ti me multi2inputmulti2 out put systems[ J ]. Aut omatica, 2006, 42(6): 4272-435
【打印本页】   【下载PDF全文】   查看/发表评论  【EndNote】   【RefMan】   【BibTex】
←前一篇|后一篇→ 过刊浏览    高级检索
本文已被:浏览 887次   下载 2010 本文二维码信息
码上扫一扫!
分享到: 微信 更多
一种融合标签间强相关性的多标签图像分类方法
张辉宜1,夏媛龙1,周克武2,包向华2,陶 陶1
1. 安徽工业大学 计算机科学与技术学院,安徽 马鞍山 243000 2. 马鞍山市公安局,安徽 马鞍山 243000
摘要:
为了将标签间的语义相关性引入多标签图像分类模型中,传统的方法例如 ML-GCN 通过设置单阈值将标 签条件概率矩阵二值化为标签共现矩阵,然而,仅设置单阈值很难归纳所有的标签语义关系情况。 针对这一问题, 提出一种融合标签间强相关性的多标签图像分类方法—MGAN(Multiple Graph Convolutional Attention Networks), 通过设置多个阈值,将传统的标签条件概率矩阵按照不同的相关性程度分割为多个子图;同时,为了提升多标签分 类性能,也引入图像区域空间相关性。 另外,针对传统的“CNN+GCN”方法将标签与特征的融合张量视为预测分数 缺乏可解释性问题,将标签与特征的融合张量视为注意力分数;在 MS-COCO 和 PASCAL VOC 数据集上与其他主 流多标签图像分类方法进行了对比实验,平均准确率分别达到了 94. 9%和 83. 7%,相较于经典 ML-GCN 模型,分 别获得了 0. 9%和 0. 8%准确率提升,且在“Binary”和“Re-weighted”邻接矩阵模式下,MGAN 都有较好的表现,验证 了新的融合方法可以缓解图卷积神经网络过平滑问题对多标签图像分类的影响。
关键词:  多标签图像分类  语义相关性  图卷积网络  注意力机制  区域空间相关性
DOI:
分类号:
基金项目:
A Method of Multi-label Image Classification with Fusing Powerful Semantic Correlation
ZHANG Huiyi1, XIA Yuanlong1, ZHOU Kewu2, BAO Xianghua2, TAO Tao1
1. School of Computer Science and Technology Anhui University of Technology Anhui Maanshan 243000 China 2. Ma’anshan Public Security Bureau Anhui Maanshan 243000 China
Abstract:
In order to introduce semantic correlation between labels into multi-label image classification model traditional methods such as ML-GCN transform label conditional probability matrix into label co-occurrence matrix by using single threshold value. However it is difficult to sum up all semantic relationships of all labels by using single threshold value. To solve this problem a method of multi-label image classification with fusing powerful semantic correlation MGAN was proposed. By setting multiple thresholds the traditional conditional probability matrix of labels was divided into multiple subgraphs according to different degrees of correlation. Meanwhile in order to improve the performance of multi-label classification image region spatial correlation was also introduced. In addition the traditional ??CNN +GCN method regards the fusion tensor of label and feature as the lack of interpretability of the predicted fraction. To solve this problem MGAN regards the labels and feature?? s fusion tensor as the attention score. Compared with other mainstream multi-label image classification methods on MS-COCO and PASCAL VOC datasets the mAP were 94. 9% and 83. 7% respectively which were 0. 9% and 0. 8% higher than traditional ML-GCN model. And MGAN performed well in both ??Binary and??Re-weighted adjacency matrix mode which verified that the new fusion method can alleviate the influence of graph convolutional neural network?? s ??over smoothing problem on multi-label image classification.
Key words:  multi-label image classification  semantic correlation  graph convolutional network  attention mechanism  regional spatial correlation
重庆工商大学学报(自然科学版) 版权所有
地址:中国 重庆市 南岸区学府大道19号 重庆工商大学学术期刊社 邮编:400067
电话:023-62769495 传真:
您是第5738754位访客
关注微信二维码
重庆工商大学学报(自然科学版)
引用本文:
【打印本页】   【下载PDF全文】   查看/发表评论  【EndNote】   【RefMan】   【BibTex】
←前一篇|后一篇→ 过刊浏览    高级检索
本文已被:浏览次   下载  
分享到: 微信 更多
摘要:
关键词:  
DOI:
分类号:
基金项目:
Abstract:
Key words:  
重庆工商大学学报(自然科学版) 版权所有
地址:中国 重庆市 南岸区学府大道19号 重庆工商大学学术期刊社 邮编:400067
电话:023-62769495 传真:
您是第5743601位访客
关注微信二维码