重庆工商大学学报（自然科学版）

引用本文:	李必嘉1 ,吴昊旻2.基于狄利克雷变分自编码的深度嵌入聚类(J/M/D/N,J:杂志，M：书，D：论文，N：报纸).期刊名称,2025，42（3）：52-62
	CHEN X. Adap tive slidingmode contr ol for discrete2ti me multi2inputmulti2 out put systems[ J ]. Aut omatica, 2006, 42(6): 4272-435

【打印本页】【下载PDF全文】【查看/发表评论】【EndNote】【RefMan】【BibTex】

←前一篇|后一篇→

过刊浏览高级检索

本文已被：浏览 722次下载 731次	码上扫一扫！
分享到：微信更多字体:加大+\|默认\|缩小-
基于狄利克雷变分自编码的深度嵌入聚类
李必嘉1 ,吴昊旻2
1. 重庆师范大学数学科学学院,重庆 401331 2. 重庆师范大学重庆国家应用数学中心,重庆 401331

摘要:

目的基于深度神经网络的聚类模型由于能从原始数据中学习到有效特征,在各种无监督应用中受到了广泛关注。针对现有的基于自编码的聚类模型没有生成能力,且通常以高斯分布作为先验,限制了对多模态特征的表达能力问题,提出一种深度嵌入聚类模型———DVADEC( Deep Embedded Clustering based on Dirichlet Variational Autoencoder) ,该模型将狄利克雷变分自编码器的表征学习能力和嵌入聚类的聚类能力结合到一个统一的模型中。方法首先,在预训练阶段,利用狄利克雷分布的多模态特性,将其作为先验分布来指导隐变量的学习过程;然后, 将训练好的权重加载到聚类模型中,并通过在隐藏空间中嵌入聚类层来进行类别分配;最后,通过交替优化目标函数来微调网络,以提升聚类结果。结果实验结果显示:DVADEC 模型在 4 个基准数据集上展现出较好的聚类性能, 其中在 MNIST 图像数据集上达到了 97. 13%的准确率,在 REUTER-10k 文本数据集上达到了 80. 1%的准确率。另外,可视化结果显示潜在特征具有明显的可分性,且根据特征生成的样本轮廓清晰、平滑多样。结论 DVADEC 模型融合了生成能力和多模态特征的表达能力,并显著提高了特征提取和聚类性能,为数据挖掘和模式识别领域提供了新的思路和技术手段。

关键词: 深度聚类无监督学习神经网络狄利克雷分布变分自编码

DOI：

分类号:

基金项目:

Deep Embedded Clustering Based on Dirichlet Variational Autoencoder

LI Bijia1, WU Haomin2

1. School of Mathematical Science Chongqing Normal University Chongqing 401331 China 2. National Center for Applied Mathematics in Chongqing Chongqing Normal University Chongqing 401331 China

Abstract:

Objective Deep neural network-based clustering models capable of learning effective features from raw data have received widespread attention in various unsupervised applications. Existing autoencoder-based clustering models lack generative ability and generally use Gaussian distribution as a prior limiting the expression of multimodal features. This paper proposes a deeply embedded clustering model—DVADEC Deep Embedded Clustering based on Dirichlet Variational Autoencoder which integrates the representation learning capability of Dirichlet variational autoencoder and the clustering capability of embedded clustering into a unified model. Methods Firstly during the pre-training phase the multimodal nature of Dirichlet distribution is utilized as a prior distribution to guide the learning process of latent variables. Then the trained weights are loaded into the clustering model and class assignments are performed by embedding clustering layers in the latent space. Finally the network is fine-tuned through alternating optimization of the objective function to enhance clustering results. Results Experimental results demonstrate that the DVADEC model exhibits good clustering performance on four benchmark datasets achieving an accuracy of 97. 13% on the MNIST image dataset and an accuracy of 80. 1% on the REUTER-10k text dataset. Furthermore visualization results demonstrate clear separability of latent features and samples generated based on features exhibit distinct smooth and diverse contours. Conclusion The DVADEC model integrates generative capability and the ability to express multimodal features significantly enhancing feature extraction and clustering performance. It provides new perspectives and technical means for the fields of data mining and pattern recognition.

Key words: deep clustering unsupervised learning neural networks Dirichlet distribution variational autoencoder