机构:[1]Hebei GEO Univ, Sch Informat Engn, Shijiazhuang 050031, Hebei, Peoples R China[2]Hebei GEO Univ, New Retail Joint Res Inst, Shijiazhuang 050031, Hebei, Peoples R China[3]Hebei Med Univ, Canc Res Inst, Hosp 4, Shijiazhuang 050000, Hebei, Peoples R China河北医科大学第四医院[4]Hebei Lanhui Technol Co Ltd, Shijiazhuang 050031, Hebei, Peoples R China
Early diagnosis and prevention of colorectal cancer rely on colonoscopic polyp examination.Accurate automated polyp segmentation technology can assist clinicians in precisely identifying polyp regions, thereby conserving medical resources. Although deep learning-based image processing methods have shown immense potential in the field of automatic polyp segmentation, current automatic segmentation methods for colorectal polyps are still limited by factors such as the complex and variable intestinal environment and issues related to detection equipment like glare and motion blur. These limitations result in an inability to accurately distinguish polyps from surrounding mucosal tissue and effectively identify tiny polyps. To address these challenges, we designed a multi-attention-based model, PVT-MA. Specifically, we developed the Cascading Attention Fusion (CAF) Module to accurately identify and locate polyps, reducing false positives caused by environmental factors and glare. Additionally, we introduced the Series Channels Coordinate Attention (SCC) Module to maximize the capture of polyp edge information. Furthermore, we incorporated the Receptive Field Block (RFB) Module to enhance polyp features and filter image noise.We conducted quantitative and qualitative evaluations using six metrics across four challenging datasets. Our PVT-MA model achieved top performance on three datasets and ranked second on one. The model has only 26.39M parameters, a computational cost of 10.33 GFlops, and delivers inference at a high speed of 47.6 frames per second (FPS).
基金:
Shijiazhuang Introducing High-level Talents' Startup Funding Project [248790067A]; Startup Foundation for PhD of Hebei GEO University [BQ201322]; Natural Science Foundation of Hebei Province [H2024403001]; Scientific Research Project of Hebei Provincial Department of Education [BJK2024099]
第一作者机构:[1]Hebei GEO Univ, Sch Informat Engn, Shijiazhuang 050031, Hebei, Peoples R China[2]Hebei GEO Univ, New Retail Joint Res Inst, Shijiazhuang 050031, Hebei, Peoples R China
共同第一作者:
通讯作者:
通讯机构:[1]Hebei GEO Univ, Sch Informat Engn, Shijiazhuang 050031, Hebei, Peoples R China[2]Hebei GEO Univ, New Retail Joint Res Inst, Shijiazhuang 050031, Hebei, Peoples R China
推荐引用方式(GB/T 7714):
Shang Xiao,Wu Siqi,Liu Yuhao,et al.PVT-MA: pyramid vision transformers with multi-attention fusion mechanism for polyp segmentation[J].APPLIED INTELLIGENCE.2025,55(1):doi:10.1007/s10489-024-06041-5.
APA:
Shang, Xiao,Wu, Siqi,Liu, Yuhao,Zhao, Zhenfeng&Wang, Shenwen.(2025).PVT-MA: pyramid vision transformers with multi-attention fusion mechanism for polyp segmentation.APPLIED INTELLIGENCE,55,(1)
MLA:
Shang, Xiao,et al."PVT-MA: pyramid vision transformers with multi-attention fusion mechanism for polyp segmentation".APPLIED INTELLIGENCE 55..1(2025)