Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 1;15(1):21485.
doi: 10.1038/s41598-025-07610-5.

A dual-domain perception gate-controlled adaptive fusion algorithm for road crack detection

Affiliations

A dual-domain perception gate-controlled adaptive fusion algorithm for road crack detection

Ziyang Zhang et al. Sci Rep. .

Abstract

Road crack detection presents critical challenges, including diverse defect patterns and complex anomaly characteristics. The current object detection algorithms demonstrate deficiencies in considering feature redundancy across channel-spatial dimensions, employ indiscriminate fusion strategies for multi-stage feature information, and particularly neglect the high-frequency characteristics inherent in crack features, leading to inefficient network performance and a loss of crucial information. Building upon the identified limitations, this paper proposes a dual-domain perception gate-controlled adaptive fusion network (DP-DETR) that achieves dynamic perception of salient features across channel and spatial domains within latent space. To enhance focus on critical features, a dual-domain dynamic perception information distillation mechanism is constructed, which distills redundant features separately across channel and spatial domains, effectively reducing architectural processing redundancy while achieving discriminative characteristic representation efficiency. In order to address the challenge of coarse-grained fusion in multi-stage feature integration, a feature information gating-adaptive fusion module (FGAF-Fusion) is proposed, which facilitates interactive channel-spatial information fusion through mixed local channel attention while employing gated adaptive fusion operations to selectively retain critical semantic information of small-scale targets. In response to the persistent high-frequency signature identified within crack feature distributions, a dual-domain structural feature enhancement loss function is designed, which elevates the weighting of high-frequency information by leveraging a spectral weighting matrix, while complementarily enhancing crack edge texture features in the spatial domain through gradient map integration. The experimental results obtained on the public RDD2022 dataset demonstrate that the proposed DP-DETR (Dual-Domain Perception Gate-Controlled Adaptive Fusion Network) approach mAP50 and mAP50:95 values of 54.2% and 25.8%, respectively, representing improvements of 6.7 and 4.2 percentage points over RT-DETR. In road crack object detection tasks, the proposed DP-DETR method can effectively detect various types of road defects, demonstrating highly competitive detection results and good robustness. The code will be released at https://github.com/jiangsu415/DP-DETR .

Keywords: Defect detection; Dual-domain perception; Feature enhancement; Gate-controlled adaptive fusion; RT-DETR.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interest: The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
The main architecture and essential components of RT-DETR.
Fig. 2
Fig. 2
The Main architecture and key components of DP-DETR model.
Fig. 3
Fig. 3
Feature information gating-adaptive fusion module.
Fig. 4
Fig. 4
Training curves of the DP-DETR and partial models in this paper based on the mAP50 metric.
Fig. 5
Fig. 5
Test photos of representative scenarios.
Fig. 6
Fig. 6
Test photos of difficult scenarios.
Fig. 7
Fig. 7
Training curves of RT-DETRv2 and each evaluation index of this model.

Similar articles

References

    1. Shi, Y. et al. Automatic road crack detection using random structured forests. IEEE Trans. Intell. Transp. Syst.12, 3434–3445 (2016).
    1. Fan, L. L. et al. Pavement defect detection with deep learning: A comprehensive survey. IEEE Trans. Intell. Vehicles9(3), 4292–4311 (2023).
    1. Duan, W. et al. Triple-domain feature learning with frequency-aware memory enhancement for moving infrared small target detection. IEEE Trans. Geosci. Remote Sens., (2024).
    1. Duan, W. et al. Semi-supervised multi-view prototype learning with motion reconstruction for moving infrared small target detection. IEEE Trans. Geosci. Remote Sens., 2025.
    1. Zhu, S. et al. TMP: Temporal motion perception with spatial auxiliary enhancement for moving infrared dim-small target detection. Expert Syst. Appl.255, 124731 (2024).

LinkOut - more resources