基于级联注意力的结肠息肉图像分割算法研究
DOI:
作者:
作者单位:

作者简介:

通讯作者:

基金项目:


Research on Colon Polyp Image Segmentation Algorithm Based on Cascaded Attention
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    目的 针对现有 Transformer 模型在息肉图像分割中存在注意力分散以及作为编码器提取的多级特征在融合时易产生信息丢失导致的分割精度不高的问题,提出一种新的分割模型 PVT-CAMNet。 方法 在该模型中,使用金 字塔式 Transformer(Pyramid Vision Transformer, PVT)作为编码器,接着设计了多尺度特征注意力提取模块(Multiscale Feature Attention Extraction,MFAE)和层间注意力聚合模块(Inter-layer Attention Aggregation, IA)。 其中,PVT通过其自注意力机制保证了模型的泛化能力,MFAE 使用不同大小的滤波器多尺度提取特征,旨在缓解注意力分散问题;IA 交互融合不同层级特征,有效解决多级特征融合产生的信息丢失问题;最后引入全局上下文模块 (Global Context,GC) 使模型更好地理解特征图之间的像素依赖关系。 结果 在 Kvasir、CVC - ClinicDB、CVC -ColonDB 和 ETIS 数据集上进行了评估,相较于最优基线模型,mDice、mIoU 分别提高了 1. 76%、0. 81%、1. 51%、 1. 74%、3. 15%、2. 65% 和 1. 73%、3. 84%。 结论 PVT-CAMNet 的学习性能和泛化性能均优于其他先进方法,在息肉图像分割上具有一定的应用价值。

    Abstract:

    Objective Aiming at the problems of scattered attention in existing Transformer models for polyp image segmentation and the low segmentation accuracy caused by information loss during the fusion of multi-level features extracted by the encoder a new segmentation model named PVT-CAMNet is proposed. Methods In this model the Pyramid Vision Transformer PVT was used as the encoder. Then a Multi-scale Feature Attention Extraction MFAE module and an Inter-layer Attention Aggregation IA module were designed. Among them the PVT ensured the generalization ability of the model through its self-attention mechanism. The MFAE used filters of different sizes to extract features at multiple scales to alleviate the problem of scattered attention. The IA interactively fused features at different levels to effectively solve the problem of information loss caused by the fusion of multi-level features. Finally a Global Context GC module was introduced to enable the model to better understand the pixel dependency relationship between feature maps. Results Evaluations were carried out on the Kvasir CVC-ClinicDB CVC-ColonDB and ETIS datasets.Comparing the performance of the proposed PVT-CAMNet model with that of the optimal baseline model the mDice valuesof PVT-CAMNet increased by 1. 76% 1. 51% 3. 15% and 1. 73% respectively and the mIoU values of PVT-CAMNet increased by 0. 81% 1. 74% 2. 65% and 3. 84% respectively on these four datasets. Conclusion PVT-CAMNet is superior to other advanced methods in both learning performance and generalization capability demonstrating significant application value in polyp image segmentation.

    参考文献
    相似文献
    引证文献
引用本文

周孟然a, 陆 鹏b.基于级联注意力的结肠息肉图像分割算法研究[J].重庆工商大学学报(自然科学版),2026,43(1):1-10
ZHOU Mengrana, LU Pengb. Research on Colon Polyp Image Segmentation Algorithm Based on Cascaded Attention[J]. Journal of Chongqing Technology and Business University(Natural Science Edition),2026,43(1):1-10

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-03-09
×
2025年《中国学术期刊影响因子年报》发布