基于联邦多模态的中文医学影像报告自动生成模型

计算机工程与科学

• • 下一篇

基于联邦多模态的中文医学影像报告自动生成模型

骆练, 黄保华, 张志诚, 金强, 王晓华

准确、高效地自动生成医学影像报告成为提高诊疗效率、缓解医疗资源短缺之关键。目前，医学影像报告自动生成研究以英文为主，无法满足我国的临床需求，且存在医疗数据共享与患者隐私保护矛盾和全新训练模型成本高昂问题。为此，提出基于联邦多模态的中文医学影像报告自动生成模型，该模型以联邦学习为框架，以多模态大模型VisualGLM-6B为基座，采用低秩自适应大模型微调技术，允许医疗机构在不暴露本地数据前提下微调以优化模型性能。同时，本文设计了一种联邦自适应算法，用于动态计算不同机构模型聚合系数以提高模型的泛化性能。实验结果表明，本文模型在MIMIC-CXR内部测试集上的BLEU-1、BLEU-2、BLEU-3、BLEU-4、METEOR和ROUGE指标相较于基线XrayGLM分别提高了18.06%，17.85%，11.37%，1.61%，13.44%和5.06%，且在外部测试数据集OPEN-I上也表现出良好的泛化性能。

出版日期:2025-06-10 发布日期:2025-06-10

A federated multimodal model based on automatic generation of Chinese medical image reports

LUO Lian, HUANG Baohua, ZHANG Zhicheng, JIN Qiang, WANG Xiaohua

(1.School of Computer and Electronic Information, Guangxi University, Nanning 530004,Guangxi, China;
2.Information Department, Affiliated Hospital of Zunyi Medical University, Zunyi 563000, Guizhou, China;
3.Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen,518000,Guangdong,China;
4.School of Medical Information Engineering, Zunyi Medical University, Zunyi 563000, Guizhou, China)

Online:2025-06-10 Published:2025-06-10

摘要/Abstract

摘要： 医学影像报告自动生成; 联邦学习; 多模态大模型微调; 医疗数据共享; 患者隐私保护

关键词: automatic generation of medical image reports, federated learning, multimodal large model fine-tuning, medical data sharing, patient privacy protection

Abstract: Accurate and efficient automatic generation of medical image reports has become the key to improve diagnosis and treatment efficiency and alleviate the shortage of medical resources. At present, the research on automatic generation of medical image reports is mainly in English, which cannot meet the clinical needs of China, and there are conflicts between medical data sharing and patient privacy protection and high training cost of new models. To this end, we proposed a federated multimodal Chinese medical image report automatic generation model, which uses federated learning as the framework, the multimodal large model VisualGLM-6B as the base, and low-rank adaptive large model fine-tuning technology, which allows healthcare organisations to fine-tune to optimise the model performance without exposing the local data. Meanwhile, we designed a federated adaptive algorithm for dynamically calculating model aggregation coefficients for different institutions to improve the generalisation performance of the model. The experimental results show that the BLEU-1, BLEU-2, BLEU-3, BLEU-4, METEOR and ROUGE metrics of this paper's model on the internal test set of MIMIC-CXR are improved by 18.06%, 17.85%, 11.37%, 1.61%, 13.44%, and 5.06%, respectively, compared to the baseline XrayGLM, and that the performance of the model on the external test dataset OPEN-I also shows good generalisation performance.

骆练, 黄保华, 张志诚, 金强, 王晓华. 基于联邦多模态的中文医学影像报告自动生成模型[J]. 计算机工程与科学.

LUO Lian, HUANG Baohua, ZHANG Zhicheng, JIN Qiang, WANG Xiaohua. A federated multimodal model based on automatic generation of Chinese medical image reports[J]. Computer Engineering & Science.

基于联邦多模态的中文医学影像报告自动生成模型

A federated multimodal model based on automatic generation of Chinese medical image reports

PDF

可视化

摘要/Abstract

引用本文

使用本文

相关文章 0

编辑推荐

Metrics

本文评价