Comparison of Out-of-Distribution Detection Performance of CLIP-based Fine-Tuning Methods

Jeonghyeon Kim, Jihyo Kim, Sangheum Hwang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

In recent years, large-scale vision-language models such as CLIP have shown remarkable performance on various zero-shot classification tasks. Inspired by these pretrained models, many studies have proposed effective fine-tuning methods to exploit the models' pre-trained knowledge. One common approach is to fine-tune the entire model to transfer the pre-trained knowledge to target tasks. On the other hand, given the high cost of fine-tuning large-scale models, parameter-efficient fine-tuning methods are also being explored. While there have been performance comparisons of existing fine-tuning methods, they often focus solely on classification performance. Such a singular focus is not sufficient to assess the quality of the transferred pre-trained knowledge comprehensively. For a more rigorous evaluation, other metrics, such as the detection of out-of-distribution samples, should be considered, given their importance for model reliability. However, the comparison of fine-tuning methods concerning model reliability has been less explored. Therefore, we aim to fill this gap by offering a comprehensive comparative analysis on the out-of-distribution detection performance of CLIP-based fine-tuning methods along with their in-distribution classification performance. Our experimental results on the OpenOOD v1.5 benchmark dataset suggest that fine-tuning the entire model provides superior performance in both classification and out-of-distribution detection in a few-shot setting.

Original languageEnglish
Title of host publication2024 International Conference on Electronics, Information, and Communication, ICEIC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350371888
DOIs
StatePublished - 2024
Event2024 International Conference on Electronics, Information, and Communication, ICEIC 2024 - Taipei, Taiwan, Province of China
Duration: 28 Jan 202431 Jan 2024

Publication series

Name2024 International Conference on Electronics, Information, and Communication, ICEIC 2024

Conference

Conference2024 International Conference on Electronics, Information, and Communication, ICEIC 2024
Country/TerritoryTaiwan, Province of China
CityTaipei
Period28/01/2431/01/24

Keywords

  • CLIP
  • Fine-tuning
  • Multi-modal foundation models
  • Out-of-distribution detection

Fingerprint

Dive into the research topics of 'Comparison of Out-of-Distribution Detection Performance of CLIP-based Fine-Tuning Methods'. Together they form a unique fingerprint.

Cite this