arXiv Analytics

Sign in

arXiv:2404.12292 [cs.CV]AbstractReferencesReviewsResources

Reducing Bias in Pre-trained Models by Tuning while Penalizing Change

Niklas Penzel, Gideon Stein, Joachim Denzler

Published 2024-04-18Version 1

Deep models trained on large amounts of data often incorporate implicit biases present during training time. If later such a bias is discovered during inference or deployment, it is often necessary to acquire new data and retrain the model. This behavior is especially problematic in critical areas such as autonomous driving or medical decision-making. In these scenarios, new data is often expensive and hard to come by. In this work, we present a method based on change penalization that takes a pre-trained model and adapts the weights to mitigate a previously detected bias. We achieve this by tuning a zero-initialized copy of a frozen pre-trained network. Our method needs very few, in extreme cases only a single, examples that contradict the bias to increase performance. Additionally, we propose an early stopping criterion to modify baselines and reduce overfitting. We evaluate our approach on a well-known bias in skin lesion classification and three other datasets from the domain shift literature. We find that our approach works especially well with very few images. Simple fine-tuning combined with our early stopping also leads to performance benefits for a larger number of tuning samples.

Comments: 12 pages, 12 figures, presented at VISAPP 2024
Journal: Proceedings of the 19th International Joint Conference on Computer Vision (2024), Imaging and Computer Graphics Theory and Applications - Volume 2: VISAPP, ISBN 978-989-758-679-8, ISSN 2184-4321, SciTePress, pages 90-101
Categories: cs.CV
Related articles: Most relevant | Search more
arXiv:2103.01542 [cs.CV] (Published 2021-03-02)
TransTailor: Pruning the Pre-trained Model for Improved Transfer Learning
arXiv:2105.12151 [cs.CV] (Published 2021-05-25)
AutoReCon: Neural Architecture Search-based Reconstruction for Data-free Compression
arXiv:2312.14915 [cs.CV] (Published 2023-12-22)
PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF