Score-Control for Hallucination Reduction in Diffusion Models
Abstract
Variance-Guided Score Modulation reduces hallucinations in diffusion models by controlling score function smoothness through Jacobian modulation while maintaining image quality.
Diffusion models have emerged as the backbone of modern generative AI, powering advances in vision, language, audio and other modalities. Despite their success, they suffer from hallucinations, implausible samples that lie outside the support of true data distribution, which degrade reliability and trust. In this work, we first empirically confirm previously proposed hypothesis that score smoothness causes hallucinations in Image Generation diffusion models and provide a density-based perspective. We further formalize this notion by linking the hallucinations probability mass to lipschitz constant of the learned score function. Motivated by this, we introduce a Variance-Guided Score Modulation (VSM) strategy that controls the score Jacobian, in turn reducing score smoothness and better approximating the ground truth score that decreases hallucinations. Empirical results on synthetic and real-world datasets demonstrate that our approach reduces hallucinations (up to ~25%) while maintaining high fidelity and diversity, providing a principled step toward more reliable diffusion-based image generation. We also propose two benchmark datasets with extreme semantic variation for systematic hallucination evaluation. Code and Datasets are publicly available at https://github.com/bhosalems/VSM.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- Broken Memories: Detecting and Mitigating Memorization in Diffusion Models with Degraded Generations (2026)
- FLaG: Fine-Grained Latent Grouping for Hallucination Detection (2026)
- HTDC: Hesitation-Triggered Differential Calibration for Mitigating Hallucination in Large Vision-Language Models (2026)
- Mitigating Entangled Steering in Large Vision-Language Models for Hallucination Reduction (2026)
- Local Intrinsic Dimension Unveils Hallucinations in Diffusion Models (2026)
- VCE: A Zero-Cost Hallucination Mitigation Method of LVLMs via Visual Contrastive Editing (2026)
- Correcting Visual Blur Induced by Attention Distraction to Reduce Hallucinations: Algorithm and Theory (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend
Get this paper in your agent:
hf papers read 2606.00377 Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash Models citing this paper 0
No model linking this paper
Datasets citing this paper 1
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper
