基于词典和多特征融合的中文医学命名实体识别
DOI:
作者:
作者单位:

作者简介:

通讯作者:

基金项目:


Chinese Medical Named Entity Recognition Based on Lexicon and Multi-feature Fusion
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
    摘要:

    目的 针对现有方法中存在因分词导致级联错误从而影响实体识别效果,以及如何构建并融合高质量医学 实体特征的问题,提出一个基于词典和多特征融合的中文医学命名实体识别模型。 方法 该模型首先利用词典匹配 机制和 Lattice 点阵结构来融合字符与医学词汇信息,利用字词的相对位置信息获取相对位置嵌入,并对汉字拼音 进行编码得到拼音嵌入;然后提出一个融合 Transformer 模型来挖掘不同特征之间的互补性,以增强词汇信息并促 进字词信息和拼音信息更好地融合;最后,将融合多特征的字符表示输入到条件随机场中来获得预测的标签。 结果 在 CCKS-2019 和 Resume 数据集上的实验结果表明,该方法在多个指标上均得到了较好的提升。 结论 避免了分词 错误对命名实体识别效果造成的影响,通过融合 Transformer 模型更好地融合了多种医学实体特征,加强了模型识 别词边界的能力,进而提高了模型识别医学实体的准确率,为后续构建医学知识图谱,实现智能化医学诊断提供了 帮助。

    Abstract:

    Objective A Chinese medical named entity recognition model based on lexicon and multi-feature fusion is proposed to address the problems in existing methods including the cascading errors caused by word segmentation that affect the entity recognition effect and the issue of how to construct and fuse high-quality medical entity features. Methods Firstly the model fused the information of characters and medical vocabulary with the lexicon matching mechanism and Lattice structure. It obtained relative position embeddings by using the relative position information of words and characters and encoded Chinese character pinyin to get pinyin embeddings. Then a fusion Transformer model was proposed to mine the complementarity between different features so as to enhance the vocabulary information and promote bettter fusion of word and characyer information and pinyin information.Finally,the character representation fused with multiple festures was input into a conditional random field to obtain predicted labels.Result Experimental results on the CCKS-2019 and Resume datasets demonstrated that the proposed method achieved notable improvements across multiple evaluation metrics. Conclusion The proposed method effectively avoids the negative impact of word segmentation errors on named entity recognition. It achieves efficient fusion of various medical entity features through the fusion Transformer model. As a result it significantly enhanced the model?? s ability to recognize word boundaries thereby improving the accuracy of medical entity recognition. This provides strong support for the subsequent construction of medical knowledge graphs and the realization of intelligent medical diagnosis.

    参考文献
    相似文献
    引证文献
引用本文

雷宇翔,廖 涛.基于词典和多特征融合的中文医学命名实体识别[J].重庆工商大学学报(自然科学版),2026,43(2):27-34
LEI Yuxiang LIAO Tao. Chinese Medical Named Entity Recognition Based on Lexicon and Multi-feature Fusion[J]. Journal of Chongqing Technology and Business University(Natural Science Edition),2026,43(2):27-34

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2026-04-03
×
2025年《中国学术期刊影响因子年报》发布