Deep learning techniques for crack detection in St. Theodore Church, Cappadocia

Kilic, Ahmet; ÖZATA, ŞERİFE; KULAVUZ, Bahadır; BAKIRMAN, Tolga; BAYRAM, Bülent

doi:10.1016/j.culher.2026.05.003

Deep learning techniques for crack detection in St. Theodore Church, Cappadocia

Kilic A., ÖZATA Ş., KULAVUZ B., BAKIRMAN T., BAYRAM B.

Journal of Cultural Heritage, cilt.80, ss.58-70, 2026 (SCI-Expanded, AHCI, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 80
Basım Tarihi: 2026
Doi Numarası: 10.1016/j.culher.2026.05.003
Dergi Adı: Journal of Cultural Heritage
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Arts and Humanities Citation Index (AHCI), Scopus, Humanities Abstracts, Index Islamicus
Sayfa Sayıları: ss.58-70
Anahtar Kelimeler: Cappadocia, Conservation, Crack detection, Deep learning, Historic church, Instance segmentation, Rock-hewn heritage
Samsun Üniversitesi Adresli: Evet

Özet

The preservation of cultural heritage structures requires continuous monitoring and proactive damage assessment as cultural heritage structures are highly vulnerable to environmental and anthropogenic factors. Although deep learning has shown promise for crack detection, current studies often rely on bounding boxes and lack the precision needed for heritage surfaces, which are characterized by complex geometries and textures. This study investigates the applicability of YOLO-based models for instance segmentation of cracks in cultural heritage structures, using Saint Theodore Church in Cappadocia, Türkiye, as a case study. Saint Theodore Church, part of a UNESCO World Heritage site, exemplifies rock-hewn religious and historic architecture but has suffered significant deterioration over time. A novel domain-specific dataset, YTU-CrackIS, was developed, comprising high-resolution camera and UAV imagery to train and evaluate YOLOv8, YOLOv9 and YOLOv11 models. The study also investigates transfer learning using both COCO and general crack datasets to assess the influence of pre-training on performance. The results show that YOLOv8 outperforms other models, achieving a mean Average Precision (mAP) of 0.635 for detection and 0.553 for segmentation. Generalization tests on additional crack datasets from cultural heritage sites confirm that models trained on domain-specific data significantly outperform those trained on generic datasets. These findings emphasize the need for tailored deep learning approaches in cultural heritage preservation, where damage characteristics differ from conventional structural cracks.