https://doi.org/10.1051/epjpv/2025010
Original Article
Redefining failure detection in PV Systems: a comparative study of GPT-4o and ResNet's computer vision in aerial infrared imagery analysis
1
Institute for Renewable Energy, Eurac Research, Viale Druso 1, 39100 Bolzano, Italy
2
Università degli Studi di Modena e Reggio Emilia, Dipartimento di Ingegneria “Enzo Ferrari”, Via P. Vivarelli 10, 41125 Modena MO, Italy
* e-mail: sandra.gallmetzer@eurac.edu
Received:
9
December
2024
Accepted:
7
March
2025
Published online: 6 May 2025
The rapid growth of the solar photovoltaic industry underlines the importance of effective operation and maintenance strategies, particularly for large-scale systems. Aerial infrared thermography has become an essential tool for detecting anomalies in photovoltaic modules due to its cost-effectiveness and scalability. Continuous monitoring through advanced fault detection and classification methods can maintain optimal system performance and extend the life of PV modules. This study investigates the application of advanced artificial intelligence methods for fault detection and classification comparing the performance of GPT-4o, a multimodal large language model, and ResNet, a convolutional neural network renowned for image classification tasks. Our research evaluates the effectiveness of both models using infrared images, focusing on binary defect detection and multiclass classification. ResNet demonstrated advantages in terms of computational efficiency and ease of implementation. Conversely, GPT-4o offered superior adaptability and interpretability, effectively analysing multimodal data to identify and explain subtle anomalies in thermal imagery. However, its higher computational requirements limit its feasibility in resource-limited settings. The results highlight the complementary strengths of these models and provide valuable insights into their role in advancing automated fault diagnosis in photovoltaic systems.
Key words: Inspection / performance / failure detection and classification / photovoltaics / multimodal large language models / prompt engineering
© S. Gallmetzer et al., Published by EDP Sciences, 2025
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.