SynPo

Abstract

The advent of Large Vision Models (LVMs) offers new opportunities for few-shot medical image segmentation. However, existing training-free methods based on LVMs fail to effectively utilize negative prompts, leading to poor performance on medical images with low-contrast. To address this issue, we propose Synpo, a training-free few-shot method based on LVMs (e.g., SAM), with the core insight: improving the quality of negative prompts. To select point prompts in a more reliable confidence map, we design a novel Confidence Map Synergy Module by combining the strengths of DINOv2 and SAM. Based on the confidence map, we select the top-k pixels as the positive points set and choose the negative points set using a Gaussian distribution, followed by independent K-means clustering for both sets. Then, these selected points are leveraged as high-quality prompts for SAM to get the segmentation results. Extensive experiments demonstrate that Synpo achieves performance comparable to state-of-the-art training-based few-shot methods.

Pipeline of SynPo

SynPo, as shown in below figure, consists of three key components: Confidence Map Synergy Module (CMSM), Point Selection Module (PSM), and Noise-aware Refine Module (NRM). Given a support-query pair, first extract zero-shot visual features using pre-trained vision models (SAM-ViT and DINOv2). In CMSM, the feature maps along with the support mask $ \mathcal{M}_S\in \mathbb{R}^{H \times W} $, are used to calculate the synergy map $SynMap\in \mathbb{R}^{H \times W}$ and model the negative confidence distribution $P_{neg}$, which are used to support the generation of prompts. In PSM, the pixels in the synergy map are sorted by their confidence score into a ranked list, which, along with the confidence distribution, are decisive factors in the selection of point prompts. Finally, the generated point prompt and Query Image $I_{Q}$ are fed into the SAM to predict the naive mask $\mathcal{M}_{coarse}\in \mathbb{R}^{H\times W}$. Additional NRM is designed for refining the $\mathcal{M}_{coarse}$.

SynPo Pipeline Illustration — (1) Overview of SynPo Architecture. SynPo integrates a Support-Query pair through three key modules: Confidence Map Synergy Module (CMSM), Point Selection Module (PSM), and Noise-aware Refine Module (NRM). Frozen vision models (SAM-ViT and DINOv2) are used to extract visual features, and a frozen prompt-based segmentation model (**e.g.,** SAM) is employed for mask prediction. (2) Illustration of Confidence Map Synergy. The synergy map and confidence statistics guide the PSM in selecting positive and negative point prompts through ranking and filtering. (3) Point Selection Module Diagram. The selected prompts are used to perform initial segmentation via SAM, followed by refinement through the NRM for more accurate mask prediction.

Experiment Results

Quality results of different methods. SynPo achieves segmentation results that closely align with the ground truth. In contrast, Q-Net, RPT, and ProtoSAM tend to produce over-segmented regions. SynPo effectively captures organ boundaries, reduces false positives, and preserves anatomical consistency, benefiting from our Confidence Map Synergy Module and Point Prompt Strategy. These results further demonstrate the superiority of SynPo in few-shot medical image segmentation.

Comparison with SOTA Methods. Except PerSAM and SynPo, the data is sourced from their papers. Among training-free methods, SynPo achieves the highest average Dice score on both datasets, and its performance is comparable to that of training-based methods.

Ablation Study. (a) To evaluate the contribution of each module, we conducted a module ablation study on the Synapse-CT dataset. The results demonstrate that all modules contribute effectively to the task. (b) To assess the impact of negative point selection in baseline (PerSAM), we replaced its original strategy and conducted experiments on the CHAOS-MRI dataset. The results underscore the critical role of effective negative prompt selection in transferring knowledge from natural images to medical domains.

Parameter Analysis on $\alpha$ and $\beta$ (with $\beta = \alpha - 1.5$). The figure shows that in CHAOS and RawData, the highest dice score was achieved under the confidence interval $[\mu,\mu+1.5\sigma]$ and the confidence interval $[\mu-\sigma,\mu+0.5\sigma]$. The figure proves low Dice scores at high-value $large$, validating our proposed negative point selection strategy that prioritizes not least but less similar points.

BibTeX

@misc{liu2025synpo, title={SynPo: Boosting Training-Free Few-Shot Medical Segmentation via High-Quality Negative Prompts}, author={Yufei Liu and Haoke Xiao and Jiaxing Chai and Yongcun Zhang and Rong Wang and Zijie Meng and Zhiming Luo}, year={2025}, eprint={2506.15153}, archivePrefix={arXiv}, primaryClass={cs.CV}, url={https://arxiv.org/abs/2506.15153}, }

SynPo: Boosting Training-Free Few-Shot Medical Segmentation via High-Quality Negative Prompts

Abstract

Pipeline of SynPo

Experiment Results

Comparison with SOTA Methods. Except PerSAM and SynPo, the data is sourced from their papers. Among training-free methods, SynPo achieves the highest average Dice score on both datasets, and its performance is comparable to that of training-based methods.

Poster

BibTeX