To cite this paper use one of the standards below:
Pavement crack detection and segmentation are critical tasks for effective infrastructure maintenance. Despite promising advances driven by deep learning, significant challenges persist due to the inherent complexities of pavement crack characteristics. This paper introduces CrackFormerSV2, a novel encoder-decoder architecture specifically designed for robust pavement crack segmentation. A key feature of CrackFormerSV2 is its integration of the hierarchical feature extraction capabilities of Swin Transformer V2. Furthermore, the architecture incorporates dual attention mechanisms: the Convolutional Block Attention Module (CBAM) within the decoder blocks to refine feature maps, and a novel Skip Attention module that enhances traditional skip connections through a cross-attention strategy between corresponding encoder and decoder features. An Atrous Spatial Pyramid Pooling (ASPP) module is utilized at the bottleneck to effectively aggregate multi-scale contextual information crucial for capturing diverse crack patterns. The model is trained using a strategic learning rate schedule, employing distinct rates for the pre-trained encoder and the decoder. Evaluations conducted on established public benchmarks and a proprietary dataset demonstrate that CrackFormerSV2 achieves significant performance improvements across key metrics, including Intersection over Union (IoU), recall, precision, and F1 score, outperforming a baseline UNet-ResNet model. Continued optimization, including gradual unfreezing and the exploration of advanced loss functions, suggests a strong potential for CrackFormerSV2 to achieve or even surpass current state-of-the-art results in pavement crack segmentation.
With nearly 200,000 papers published, Galoá empowers scholars to share and discover cutting-edge research through our streamlined and accessible academic publishing platform.
Learn more about our products:
This proceedings is identified by a DOI , for use in citations or bibliographic references. Attention: this is not a DOI for the paper and as such cannot be used in Lattes to identify a particular work.
Check the link "How to cite" in the paper's page, to see how to properly cite the paper