基于狄利克雷变分自编码的深度嵌入聚类
作者:

Deep Embedded Clustering Based on Dirichlet Variational Autoencoder
Author:
  • 摘要
  • | |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
    摘要:

    目的 基于深度神经网络的聚类模型由于能从原始数据中学习到有效特征,在各种无监督应用中受到了广 泛关注。 针对现有的基于自编码的聚类模型没有生成能力,且通常以高斯分布作为先验,限制了对多模态特征的 表达能力问题,提出一种深度嵌入聚类模型———DVADEC( Deep Embedded Clustering based on Dirichlet Variational Autoencoder) ,该模型将狄利克雷变分自编码器的表征学习能力和嵌入聚类的聚类能力结合到一个统一的模型中。 方法 首先,在预训练阶段,利用狄利克雷分布的多模态特性,将其作为先验分布来指导隐变量的学习过程;然后, 将训练好的权重加载到聚类模型中,并通过在隐藏空间中嵌入聚类层来进行类别分配;最后,通过交替优化目标函 数来微调网络,以提升聚类结果。 结果 实验结果显示:DVADEC 模型在 4 个基准数据集上展现出较好的聚类性能, 其中在 MNIST 图像数据集上达到了 97. 13%的准确率,在 REUTER-10k 文本数据集上达到了 80. 1%的准确率。 另 外,可视化结果显示潜在特征具有明显的可分性,且根据特征生成的样本轮廓清晰、平滑多样。 结论 DVADEC 模 型融合了生成能力和多模态特征的表达能力,并显著提高了特征提取和聚类性能,为数据挖掘和模式识别领域提 供了新的思路和技术手段。

    Abstract:

    Objective Deep neural network-based clustering models capable of learning effective features from raw data have received widespread attention in various unsupervised applications. Existing autoencoder-based clustering models lack generative ability and generally use Gaussian distribution as a prior limiting the expression of multimodal features. This paper proposes a deeply embedded clustering model—DVADEC Deep Embedded Clustering based on Dirichlet Variational Autoencoder which integrates the representation learning capability of Dirichlet variational autoencoder and the clustering capability of embedded clustering into a unified model. Methods Firstly during the pre-training phase the multimodal nature of Dirichlet distribution is utilized as a prior distribution to guide the learning process of latent variables. Then the trained weights are loaded into the clustering model and class assignments are performed by embedding clustering layers in the latent space. Finally the network is fine-tuned through alternating optimization of the objective function to enhance clustering results. Results Experimental results demonstrate that the DVADEC model exhibits good clustering performance on four benchmark datasets achieving an accuracy of 97. 13% on the MNIST image dataset and an accuracy of 80. 1% on the REUTER-10k text dataset. Furthermore visualization results demonstrate clear separability of latent features and samples generated based on features exhibit distinct smooth and diverse contours. Conclusion The DVADEC model integrates generative capability and the ability to express multimodal features significantly enhancing feature extraction and clustering performance. It provides new perspectives and technical means for the fields of data mining and pattern recognition.

    参考文献
    相似文献
    引证文献
引用本文

李必嘉 ,吴昊旻.基于狄利克雷变分自编码的深度嵌入聚类[J].重庆工商大学学报(自然科学版),2025,42(3):52-62
LI Bijia, WU Haomin. Deep Embedded Clustering Based on Dirichlet Variational Autoencoder[J]. Journal of Chongqing Technology and Business University(Natural Science Edition),2025,42(3):52-62

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 在线发布日期: 2025-05-14
×
2024年《重庆工商大学学报(自然科学版)》影响因子显著提升