Abstract
Conventional image restoration models are difficult to apply efficiently in real-world scenarios because they are designed to handle only specific types and levels of degradation. This study proposes a model that can handle multiple degradations with a single restoration model using prompt learning. Furthermore, we introduce a sub-network, pixel-level encoder, that modulates the encoder of the main network and prompts. In this process, the proposed model adaptively integrates features across spatial and channel spaces through deformable spatial cross-attention and multi-Dconv head transposed cross-attention. Moreover, this model exploits pixel-wise contrastive loss to capture the style and context information of target images. An experimental evaluation is conducted using a widely-used dataset in all-in-one image restoration, including dehazing, deraining, and denoising. Additionally, this study evaluates the robustness of models for monocular depth estimation and visual odometry using images reconstructed from noise degradation.
Original language | English |
---|---|
Pages (from-to) | 1015-1023 |
Number of pages | 9 |
Journal | Journal of Institute of Control, Robotics and Systems |
Volume | 30 |
Issue number | 9 |
DOIs | |
State | Published - 2024 |
Keywords
- attention mechanism
- Deep Learning
- Image Restoration
- Prompt
- Robust Visual Odmetry