Latent Diffusion Model Assisted Image Deblurring

20250 citationsJournal Article

Authors

Cheng-Chia Wu · Richtek Technology (Taiwan)

Chih-Wei Tang · National Central University

Abstract

Image deblurring aims at pixel-level prediction with high fidelity, while realistic details are indispensable to high-quality deblurred images. A large latent diffusion model (LDM) trained on large-scale datasets generates images with realistic details, while incorporating a LDM into the deblurring process cannot avoid the non-negligible cost during inference on consumer electronics. To verify the hypothesis that a pretrained large LDM helps a deblurring network learn to improve visual quality without the additional inference cost raised by LDM, this paper proposes a novel two-phase training framework. The pretrained LDM functions as the auxiliary network during training and helps a deblurring network learn to improve quality. For this, the proposed late textual and visual prompt tuning equips the pretrained LDM with prior knowledge of sharp images, enabling the use of a small number of learnable parameters. The LDM helps an existing deblurring network learn to generate realistic details. The inference phase only retains the deblurring network to avoid the cost of the LDM. We utilize synthetic and real image deblurring datasets as main tools for developing and evaluating the proposed method. Test results show that the proposed method enhances most of perceptual indices and keep balance between fidelity and quality.

Topics & Keywords

Advanced Image Processing Techniques Image and Video Quality Assessment Generative Adversarial Networks and Image Synthesis

UN Sustainable Development Goals

Industry, innovation and infrastructure

Publication Details

Published in: IEEE Transactions on Consumer Electronics

Volume 72, Issue 1, pp. 999-1010

DOI: 10.1109/tce.2025.3644426

Field-Weighted Citation Impact: 0.00