论文中文题名: | 基于小样本学习的医学图像分类方法研究 |
姓名: | |
学号: | 22207223093 |
保密级别: | 公开 |
论文语种: | chi |
学科代码: | 085400 |
学科名称: | 工学 - 电子信息 |
学生类型: | 硕士 |
学位级别: | 工学硕士 |
学位年度: | 2025 |
培养单位: | 西安科技大学 |
院系: | |
专业: | |
研究方向: | 计算机视觉 |
第一导师姓名: | |
第一导师单位: | |
论文提交日期: | 2025-06-16 |
论文答辩日期: | 2025-06-06 |
论文外文题名: | Research on Medical Image Classification Methods Based on Few-Shot Learning |
论文中文关键词: | |
论文外文关键词: | Few-shot learning ; Medical Image classification ; Spatial transfor mation network ; Wavelet transfonn ; Multi-scale feature extraction |
论文中文摘要: |
近年来,随着卷积神经网络的快速发展,基于深度学习的医学影像分析成为了计算机领域研究的热点。然而,由于保护患者隐私、标注成本高及病症样本不足等问题,制作大规模医学图像数据集是非常困难的,深度学习模型无法进行充分训练,从而导致特征提取能力不足。现有的方法主要通过数据增强来增加训练样本,这种做法不但费时费力,还可能引入噪声数据。此外,医学图像具有复杂的结构特征和较高的类间相似度,使得分类难度进一步增加。因此,如何利用有限标注样本对医学图像进行准确分类成为了当前的研究热点之一。 (1)针对医学图像标注数据较少导致模型特征提取能力不足的问题,本文提出了一种基于空间变换网络的小样本医学图像分类模型STW-ResNet。首先,模型采用了“预训练微调+元学习”的小样本学习架构,并设计了训练难度递进的两阶段预训练策略,使模型能够更有效地学习源域特征。其次引入空间变换网络模块,通过仿射变换自适应放大病变区域,增强网络对图像空间变化的适应能力,提高边缘特征提取能力。最后,提出结合特征分布校准和最邻近质心算法的特征变换分类器,简化分类过程的同时提升了分类精度。在ISIC2018皮肤病变数据集和Pap-smear宫颈细胞数据集上对模型进行验证,与基线模型相比,STW-ResNet在ISIC2018数据集上的平均分类精度提高了2.50%;在Pap-smear数据集上的平均分类精度提高了3.91%。 (2)针对医学图像复杂度高、类间相似度大导致模型分类精度不高的问题,本文提出了一种基于多尺度小波特征融合的双分支小样本医学图像分类模型MSTWs-ResNet。首先,通过引入多尺度特征提取网络,捕获图像在不同尺度下的特征信息,有效提高了模型对于复杂特征的识别能力,并结合小波变换特征融合模块,解决不同尺度特征融合时产生的信息丢失问题;其次,将多尺度特征提取网络与改进WideResNet结合,构建双分支特征提取网络来进一步增强模型的特征提取能力;最后,引入稀疏轴向MLP,降低模型参数量的同时,高效建立图像的全局依赖关系,增强对相似特征的区分能力。实验结果表明,MSTWs-ResNet能够在复杂特征和相似特征上提取到有效特征,在两个数据集上都取得了较好的分类精度。与基线模型相比,在ISIC2018数据集上的平均分类精度提高了3.20%;在Pap-smear数据集上的平均分类精度提高了6.73%。 |
论文外文摘要: |
In recent years, with the rapid development of convolutional neural networks, medical image analysis based on deep learning has become a hot topic in computer science research. However, due to issues such as protecting patient privacy, high annotation costs, and insufficient disease samples, it is very difficult to produce large-scale medical image datasets, and deep learning models cannot be fully trained, resulting in insufficient feature extraction capabilities. Existing methods mainly increase training samples through data augmentation, which is not only time-consuming and labor-intensive, but may also introduce noisy data. In addition, medical images have complex structural features and high inter-class similarity, which further increases the difficulty of classification. Therefore, how to accurately classify medical images using limited annotated samples has become one of the current research hotspots. (1) In response to the problem of insufficient model feature extraction capabilities due to the lack of annotated data for medical images, this paper proposes a few shot medical image classification model STW-ResNet based on a spatial transformer network. First, the model adopts a few shot learning architecture of "pre-training fine-tuning + meta-learning" and designs a two-stage pre-training strategy with progressive training difficulty, so that the model can learn source domain features more effectively. Secondly, the spatial transformer network module is introduced to adaptively enlarge the lesion area through affine transformation, enhance the network's adaptability to image spatial changes, and improve the edge feature extraction capability. Finally, a feature transformation classifier combining feature distribution calibration and nearest centroid algorithm is proposed to simplify the classification process and improve the classification accuracy. The model is verified on the ISIC2018 skin lesion dataset and the Pap-smear cervical cell dataset. Compared with the baseline model, the average classification accuracy of STW-ResNet on the ISIC2018 dataset is improved by 2.50%; the average classification accuracy on the Pap-smear dataset is improved by 3.91%. (2) In order to solve the problem of low model classification accuracy due to high complexity of medical images and large similarity between classes, this paper proposes a dual-branch few shot medical image classification model MSTWs-ResNet based on multi-scale wavelet feature fusion. Firstly, by introducing a multi-scale feature extraction network, the feature information of the image at different scales is captured, which effectively improves the model's ability to recognize complex features. In addition, the wavelet transform feature fusion module is combined to solve the problem of information loss when different scale features are fused. Secondly, the multi-scale feature extraction network is combined with the improved WideResNet to construct a dual-branch feature extraction network to further enhance the model's feature extraction ability. Finally, the sparse axial MLP is introduced to reduce the number of model parameters while efficiently establishing the global dependency of the image and enhancing the ability to distinguish similar features. Experimental results show that MSTWs-ResNet can extract effective features from complex features and similar features, and achieves good classification accuracy on both datasets. Compared with the baseline model, the average classification accuracy on the ISIC2018 dataset is improved by 3.20%; the average classification accuracy on the Pap-smear dataset is improved by 6.73%. |
参考文献: |
[2] 赵愉,王得旭,顾力栩.人工智能技术在计算机辅助诊断领域的发展新趋势[J].中国科学:生命科学,2020,50(11):1321-1334. [20] 袁媛,陈明惠,柯舒婷等.基于集成卷积神经网络和 Vit 的眼底图像分类研究[J].中国激光,2022,49(20):108-116. [38] 谢莉,舒卫平,耿俊杰,等.结合加权原型和自适应张量子空间的小样本宫颈细胞分类[J].计算机应用,2024,44(10):3200-3208. [46] 赵嘉晖.基于高维多目标优化的小样本皮肤癌元学习方法研究[D].太原科技大学,2024.DOI:10.27721/d.cnki.gyzjc.2024.000889. [53] 白文鑫.基于少样本学习的医学图像分类算法研究[D].北京邮电大学,2023.DOI:10.26969/d.cnki.gbydu.2023.000122. [63] Zagoruyko S, Komodakis N. Wide Residual Networks[J]. arXiv e-prints, 2016: arXiv: 1605.07146. [69] 韦世红, 刘红梅, 唐宏, 等. 多级度量网络的小样本学习[J]. 计算机工程与应用,2023,59(02): 94-101. |
中图分类号: | TP391.41 |
开放日期: | 2025-06-16 |