arXiv Analytics

Sign in

arXiv:2303.17051 [cs.CV]AbstractReferencesReviewsResources

Transductive few-shot adapters for medical image segmentation

Julio Silva-Rodríguez, Jose Dolz, Ismail Ben Ayed

Published 2023-03-29Version 1

With the recent raise of foundation models in computer vision and NLP, the pretrain-and-adapt strategy, where a large-scale model is fine-tuned on downstream tasks, is gaining popularity. However, traditional fine-tuning approaches may still require significant resources and yield sub-optimal results when the labeled data of the target task is scarce. This is especially the case in clinical settings. To address this challenge, we formalize few-shot efficient fine-tuning (FSEFT), a novel and realistic setting for medical image segmentation. Furthermore, we introduce a novel parameter-efficient fine-tuning strategy tailored to medical image segmentation, with (a) spatial adapter modules that are more appropriate for dense prediction tasks; and (b) a constrained transductive inference, which leverages task-specific prior knowledge. Our comprehensive experiments on a collection of public CT datasets for organ segmentation reveal the limitations of standard fine-tuning methods in few-shot scenarios, point to the potential of vision adapters and transductive inference, and confirm the suitability of foundation models.

Comments: The project code is available in https://github.com/jusiro/fewshot-finetuning
Categories: cs.CV
Related articles: Most relevant | Search more
arXiv:1810.07810 [cs.CV] (Published 2018-10-17)
LadderNet: Multi-path networks based on U-Net for medical image segmentation
arXiv:2007.00525 [cs.CV] (Published 2020-07-01)
A Fast Algorithm for Geodesic Active Contours with Applications to Medical Image Segmentation
arXiv:2305.14093 [cs.CV] (Published 2023-05-23, updated 2023-05-24)
3D Open-vocabulary Segmentation with Foundation Models
Kunhao Liu et al.