TY - JOUR
T1 - Mitigating inappropriate concepts in text-to-image generation with attention-guided Image editing
AU - Oh, Jiyeon
AU - Jeong, Jae Yeop
AU - Hong, Yeong Gi
AU - Jeong, Jin Woo
N1 - Publisher Copyright:
© Copyright 2025 Oh et al. Distributed under Creative Commons CC-BY 4.0.
PY - 2025
Y1 - 2025
N2 - Text-to-image generative models have recently garnered a significant surge due to their ability to produce diverse images based on given text prompts. However, concerns regarding the occasional generation of inappropriate, offensive, or explicit content have arisen. To address this, we propose a simple yet effective method that leverages attention map to selectively suppress inappropriate concepts during image generation. Unlike existing approaches that often sacrifice original image context or demand substantial computational overhead, our method preserves image integrity without requiring additional model training or extensive engineering effort. To evaluate our method, we conducted comprehensive quantitative assessments on inappropriateness reduction, text fidelity, image consistency, and computational cost, alongside an online human perceptual study involving 20 participants. The results from our statistical analysis demonstrated that our method effectively removes inappropriate content while preserving the integrity of the original images with high computational efficiency.
AB - Text-to-image generative models have recently garnered a significant surge due to their ability to produce diverse images based on given text prompts. However, concerns regarding the occasional generation of inappropriate, offensive, or explicit content have arisen. To address this, we propose a simple yet effective method that leverages attention map to selectively suppress inappropriate concepts during image generation. Unlike existing approaches that often sacrifice original image context or demand substantial computational overhead, our method preserves image integrity without requiring additional model training or extensive engineering effort. To evaluate our method, we conducted comprehensive quantitative assessments on inappropriateness reduction, text fidelity, image consistency, and computational cost, alongside an online human perceptual study involving 20 participants. The results from our statistical analysis demonstrated that our method effectively removes inappropriate content while preserving the integrity of the original images with high computational efficiency.
KW - Attention map
KW - Deep learning
KW - Inappropriateness mitigation
KW - Text-to-image generation
UR - https://www.scopus.com/pages/publications/105018097033
U2 - 10.7717/peerj-cs.3170
DO - 10.7717/peerj-cs.3170
M3 - Article
AN - SCOPUS:105018097033
SN - 2376-5992
VL - 11
JO - PeerJ Computer Science
JF - PeerJ Computer Science
M1 - e3170
ER -