Diffusion-based models have shown great promise in real-world image super-resolution (Real-ISR), but often generate content with structural errors and spurious texture details due to the empirical priors and illusions of these models. To address this issue, we introduce StructSR, a simple, effective, and plug-and-play method that enhances structural fidelity and suppresses spurious details for diffusion-based Real-ISR. StructSR operates without the need for additional fine-tuning, external model priors, or high-level semantic knowledge. At its core is the Structure-Aware Screening (SAS) mechanism, which identifies the image with the highest structural similarity to the low-resolution (LR) input in the early inference stage, allowing us to leverage it as a historical structure knowledge to suppress the generation of spurious details. By intervening in the diffusion inference process, StructSR seamlessly integrates with existing diffusion-based Real-ISR models. Our experimental results demonstrate that StructSR significantly improves the fidelity of structure and texture, improving the PSNR and SSIM metrics by an average of 5.27% and 9.36% on a synthetic dataset (DIV2K-Val) and 4.13% and 8.64% on two real-world datasets (RealSR and DRealSR) when integrated with four state-of-the-art diffusion-based Real-ISR methods.
In the proposed StructSR, the Structure-Aware Screening (SAS) works in the early inference stage and screens out the structural embedding with the most consistent and clearer structure compared to the LR image. In the later inference stage, the Structure Condition Embedding (SCE) uses the structural embedding to guide noise prediction in conjunction with the LR image. The Image Details Embedding (IDE) inserts the structural embedding into the clean latent image at each timestep according to the degradation degree.
@article{li2025structsr,
title={StructSR: Refuse Spurious Details in Real-World Image Super-Resolution},
author={Li, Yachao and Liang, Dong and Ding, Tianyu and Huang, Sheng-Jun},
journal={arXiv preprint arXiv:2501.05777},
year={2025}
}