SHPNeXt: A Novel Method of Multi-Scale and Variable Resolution AI-Based Tongue Image Segmentation
In the domain of Traditional Chinese Medicine, accurately segmenting tongue images is fundamental for computer-assisted diagnosis. Yet, current models often falter with images of diverse scales and clarity, impeding their widespread application. To address this challenge, we propose SHPNeXt, an inno...
Saved in:
| Main Authors: | , , , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10938541/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | In the domain of Traditional Chinese Medicine, accurately segmenting tongue images is fundamental for computer-assisted diagnosis. Yet, current models often falter with images of diverse scales and clarity, impeding their widespread application. To address this challenge, we propose SHPNeXt, an innovative network designed to accurately segment tongue images across different scales and resolutions. This model blends PoolFormer and Hire-MLP to adeptly discern both overarching and nuanced details, ensuring accurate segmentation across varying tongue image sizes. Furthermore, SHPNeXt’s precision was further enhanced by integrating a Nuclear-Norm Non-negative Matrix Factorization (NMF) approach, which robustly counters noise in lower quality images. Experiments on three benchmark datasets demonstrate SHPNeXt’s superior performance, achieving mean Intersection over Union (mIoU) scores of 99.64%, 97.05%, and 96.82%. Balancing efficiency and accuracy, SHPNeXt’s architecture comprises 5.984 million parameters and operates at 1.22 GFLOPs, rendering it an exceptionally effective tool for real-world tongue diagnosis in TCM. The code has been released on github: (<uri>https://github.com/Kuanzhaipcx/SHPNeXt.git</uri>). |
|---|---|
| ISSN: | 2169-3536 |