Semantics-guided hybrid framework for image stitching

20260 citationsJournal Article

Authors

Yì Wáng · Henan University

Yanbo Sun · Henan University

Chenyang Liu · Henan Communication Science and Technology Research Institute (China)

Fanghui Zhang · Henan University

Chunxiao Li · Henan University

Qi Liu · Henan University

Abstract

Image stitching, a fundamental problem in computer vision, aims to generate panoramic images with an extended field of view. However, existing methods often struggle with large parallax, weak textures, and illumination variations, leading to misalignment and visible seams. In this work, we propose semantics-guided hybrid stitching (SGHS), a semantics-guided hybrid framework that recasts image stitching as a coarse-to-fine generative inpainting problem. SGHS combines robust semantic alignment with a semantic-aware mask and multimodal conditioning to steer a pretrained diffusion model. A lightweight low-rank adaptation (LoRA) module adapts the generator to the stitching task, and an edge-guided enhancement module sharpens seams. This design is resilient to large parallax, texture sparsity, and illumination changes, producing geometry-faithful panoramas with fewer artifacts. Extensive experiments on the challenging unsupervised deep image stitching dataset benchmark across high-, medium-, and low-overlap regimes, together with cross-dataset evaluations on as-projective-as-possible and real unmanned aerial vehicle mosaics, demonstrate that SGHS consistently improves peak signal-to-noise ratio and structure similarity index measure over representative warping- and inpainting-based baselines while yielding visibly cleaner seams, better straight-line preservation, and stronger semantic consistency. Ablation studies further validate the effectiveness of the semantic alignment, semantic-aware mask and conditioning, LoRA adaptation, and edge-guided enhancement components.

Topics & Keywords

Advanced Image and Video Retrieval Techniques Generative Adversarial Networks and Image Synthesis Face recognition and analysis

UN Sustainable Development Goals

Life below water

Publication Details

Published in: Journal of Electronic Imaging

Volume 35, Issue 02

DOI: 10.1117/1.jei.35.2.023020

Field-Weighted Citation Impact: 0.00