Degradation Mechanism Detection in Photovoltaic Backsheets by Fully Convolutional Neural Network

Zhang, Binbin; Grant, Joydan; Bruckman, Laura S.; Wodo, Olga; Rai, Rahul

doi:10.1038/s41598-019-52550-6

Download PDF

Article
Open access
Published: 06 November 2019

Degradation Mechanism Detection in Photovoltaic Backsheets by Fully Convolutional Neural Network

Binbin Zhang¹,
Joydan Grant²,
Laura S. Bruckman ORCID: orcid.org/0000-0003-1271-1072⁴,
Olga Wodo^1,3 &
…
Rahul Rai¹

Scientific Reports volume 9, Article number: 16119 (2019) Cite this article

2288 Accesses
16 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Materials and devices age with time. Material aging and degradation has important implications for lifetime performance of materials and systems. While consensus exists that materials should be studied and designed for degradation, materials inspection during operation is typically performed manually by technicians. The manual inspection makes studies prone to errors and uncertainties due to human subjectivity. In this work, we focus on automating the process of degradation mechanism detection through the use of a fully convolutional deep neural network architecture (F-CNN). We demonstrate that F-CNN architecture allows for automated inspection of cracks in polymer backsheets from photovoltaic (PV) modules. The developed F-CNN architecture enabled an end-to-end semantic inspection of the PV module backsheets by applying a contracting path of convolutional blocks (encoders) followed by an expansive path of decoding blocks (decoders). First, the hierarchy of contextual features is learned from the input images by encoders. Next, these features are reconstructed to the pixel-level prediction of the input by decoders. The structure of the encoder and the decoder networks are thoroughly investigated for the multi-class pixel-level degradation type prediction for PV module backsheets. The developed F-CNN framework is validated by reporting degradation type prediction accuracy for the pixel level prediction at the level of 92.8%.

A PV cell defect detector combined with transformer and attention mechanism

Article Open access 05 September 2024

Dual spin max pooling convolutional neural network for solar cell crack detection

Article Open access 09 July 2023

A study of degradation mechanisms in PVDF-based photovoltaic backsheets

Article Open access 24 August 2022

Introduction

Photovoltaic (PV) energy has been growing in the global energy market and will continue to grow with an increase in durability and reliability and the subsequent reduction in cost¹. PVs modules typically have a three-layer polymer laminate as the backsheet of the module to provide an environmental protective barrier for the PV cells. Additionally, the backsheet provides safety for the module due to the high dielectric breakdown strength of polyethylene terephthalate (PET), which is often used as a core layer in backsheets². PV backsheets are susceptible to degradation from environmental stressors such as irradiance, temperature, pollution, and humidity^3,4,5,6. Various degradation mechanisms, such as delamination and crack formation in PV module backsheets, leads to the failure of the safety and environmental barrier provided by the backsheet requiring repair or replacement^7,8,9. The fundamental understanding of underlying degradation mechanisms and pathways is largely missing and hinders the ability to predict the lifetime of PV module backsheets.

Currently, manual field surveys by technicians has been the main method to identify backsheet degradation and failure^10,11. Automated field surveys have emerged as an alternative for large scale commercial PV sites. These field surveys include techniques such as fluorescence, electroluminescence, photoluminescence, visual, and infrared imaging for the analysis of cell cracking, corrosion, and encapsulant degradation^12,13,14,15. Although approaches to collect the data exists (e.g., via drones, airplanes, or by instruments that can be moved along racks), the progress has not been paralleled by the data analytic tools to automate degradation detection and analysis. Additionally, there has been little focus on automated methods to detect failure or degradation on the backsheets. Such an automated way to identify degradation mechanism would reduce the operation and maintenance of PV farms and provide insights into the degradation mechanisms of fielded backsheets. More importantly, a large amount of data collected and annotated during operation would accelerate the rate of establishing a reliable relationship between various environmental conditions and degradation rates of materials^10,16,17.

The progress has been made in other applications, where various signals have also leveraged to detect defects in materials and structures. For example in steel beams, cracks have been identified from vibrational changes by leveraging the wavelet analysis¹⁸. Mechanical impedance¹⁹ and ultrasounds have been also used to detect the internal defects²⁰. The progress has been also made in the area of machine vision for defect detection^21,22,23,24. For example, Koch et al.²² investigated decision trees and support vector machines (SVM) methods for the task of defect detection in concrete and asphalt civil infrastructure. In that approach, the fixed rules are used to select a subset of regions in the image for which handcrafted features are computed. However, handcrafted features require significant domain knowledge, effort, and often fine-tuning to adjust them to perform efficiently in a particular scenario. The alternative approach involves an automated feature development and is considered the key advantage of deep learning based approaches. These approaches learn discriminative representations from the data without the need for handcrafted features. The learned representations offer high effectiveness to perform the mapping between automated features and the output of interest. At the same time, it has been shown that the learned representation might be difficult to notice or deduce for domain experts or conventional supervised learning methods.

It is the unremitting success of deep learning techniques in image classification and object detection tasks that motivated researchers to explore the capabilities of such network for pixel-level labeling tasks, such as scene labeling^25,26 and semantic segmentation^27,28,29. Many different models have been proposed for semantic segmentation^30,31,32,33. The most successful state-of-art deep learning techniques for semantic segmentation spring from a common breakthrough: the fully convolutional neural network (F-CNN) by Long et al.²⁹. This network is trained to learn hierarchies of features. The learned features are fused to achieve a non-linear, local-to-global feature representation that enables a pixel-wise inference. The F-CNN framework has shown a significant improvement in the segmentation accuracy over traditional methods on standard datasets like PASCAL VOC benchmark³⁴.

Motivated by the excellent performance of F-CNN on segmentation tasks reported in the literature and inspired by the ideas of flexibility in segmentation networks, we adapted the F-CNN for the task of degradation type detection of PV backsheets. Specifically, we have customized a F-CNN for degradation mechanism type detection by altering the standard feature extraction and expansion structure of the F-CNN to improve the accuracy of results on PVs degradation mechanism type detection. While previous work has attempted to address vision problems generally, this paper is concerned with the development of a semantic segmentation method that can be used for automated PV module degradation mechanism detection, e.g., crack inspection. In this paper, we outline the essential system components of an automated inspection system of PVs. The automated computer vision is based upon a deep learning architecture that falls under the family of fully convolutional neural networks (F-CNNs). Validation test demonstrates the high speed and accuracy of the proposed F-CNN architecture. Our approach and networks are generic and can be used for segmentation and identification of other cracks types and inspection of other material systems.

Results and Discussion

In this work, the detection of degradation modes in backsheets is discussed. Different types of surface patterns are observed in PV module backsheet films exposed to accelerated and real-world exposures (Klinke, et al.³⁵). The all the degradation types are observed on the inner-layer (i.e., the sun-side layer in a PV module), which was not directly exposed to irradiance. Fig. 1 depicts three representative images the various categories of degradation. The observed patterns can be mainly grouped into six categories: no-cracking, parallel, delamination, transverse-branching, longitudinal-branching, and mudflat. The no-cracking region refers to the region with the absence of any types of cracks. The no-cracking regions are desirable, but do not belong to any defect mechanism pattern. The parallel cracks in Fig. 1 are oriented parallel in the vertical direction (or along Y-axis of the image). The large-scale loss of adhesion leads to pieces of the inner layer to fall off and in turn leads to the delamination regions as shown in Fig. 1. The transverse-branching cracks are crack patterns that are perpendicular to parallel cracks (horizontal direction or along the x-axis). If there are branches on the parallel cracks and the branches are in the same direction with the parallel cracks, the type of branch crack is annotated as longitudinal-branching. Finally, the cracks are labeled as mudflat cracks when the branches of the cracks are oriented in multiple directions. The examples of transverse-branching, longitudinal-branching, and mudflat are also shown in Fig. 1.

The data set consists of 34 varying resolution images of the inner-layer (sun side) of backsheet films exposed to two accelerated exposures with eight steps of 500 h and two real-world exposures with six steps of 2 months³⁵. Samples were exposed with the air side of the backsheet films facing the irradiance source in two different ASTM G154-04 cycle four exposures³⁶ using UVA-340 fluorescent ultraviolet lamps (wavelengths 280–400 nm) in Q-Lab QUV accelerated weathering testers. One exposure was a cyclic exposure of 8 h of 1.55 W/m²/nm at a 70 °C chamber temperature followed by 4 h of darkness and condensing humidity at 50 °C and the other without the dark condensing humidity step. Real-world exposures were conducted in Cleveland, Ohio between July 2, 2013, and October 7, 2014, on two-axis trackers in sample trays with and without irradiance concentration. The images were collected using PAX-it PAXcam camera in a photo lightbox. These samples and exposures are described in detail in Klinke et al.³⁵.

The PV image dataset is annotated by human experts from Case Western Reserve University. Specifically, all images in the dataset are labeled manually. The image annotation tool LabelMe³⁷ was used for labeling purposes. The tool allows users to annotate a class by clicking along the boundary of the desired class and indicating its identity. In Fig. 1, the raw dataset images are depicted in the first row. The second-row images correspond to the manually annotated images of various crack regions using LabelMe. The annotated images are considered as ground truth labels of the raw images. As a result of annotation, the six categories are identified. The categories include no-cracking, parallel, delamination, transverse-branching, longitudinal-branching, and mudflat. These categories and are encoded as class labels 1, 2, 3, 4, 5, and 6, respectively. The remaining region of the image (not belonging to the first six classes) is considered as a background and assigned a different class category (class 0). Therefore, there are in total of seven classes (N = 7). The images obtained from the backsheet film study are of different resolution. Furthermore, the initial dataset size of 34 images may be insufficient to train the F-CNN model. Therefore, the initial sets of annotated images are split into image blocks. Examples of the image block are depicted in the first row of Fig. 1. Following this strategy, the initial set of 34 images is processed to generate 286 image block samples. Each image block sample is 320 pixels wide and 480 pixels high (320 × 480). The 286 image block samples are considered the input dataset, I, for the analysis. For training and evaluation of the model, the dataset I is shuffled and randomly split into three non-overlapping sets, namely a training set I_Train (170 examples), a validation set I_Vald (73 examples), and a test set I_Test (43 examples), respectively. To avoid variations between the training set and the test set, the label-preserving transformation was applied after splitting, specifically horizontal and vertical mirroring for the training set for inflating the size of the training dataset.

The annotated dataset is used to train the F-CNN networks. The architecture of the final F-CNN is determined through empirical studies, as generic design rules for constructing CNN are still elusive. Specifically, our architecture is developed by varying the encoding and decoding configuration. In this sense, this paper makes two contributions. First, several architectures of F-CNN are investigated. Next, the accuracy of the final architecture is discussed in the context of the application of interest. In the next two subsections, the corresponding results are discussed.

Design of F-CNN for detection of backsheet degradation types

The proposed network consists of an encoding part and a decoding part. In the encoding part, high level abstract features maps or representations are extracted from input images. The extraction is achieved through applying a series of convolutional and pooling layers. In the decoding part, the abstract features are gradually reconstructed to the pixel-level prediction of the input images. The reconstruction is accomplished through relaying the intermittent feature representations from encoding part to decoding part through concatenation layers. The network architecture of the F-CNN for degradation mechanism detection in backsheets is illustrated in Fig. 2. Two structures of the F-CNN architecture: encoder and decoder are detailed below. Table 1 summarizes a nomenclature used.

Table 1 Nomenclature used to describe the network architecture and parameters used during the experiments.

Full size table

Encoder structures

The convolutional layer (CONV layer) is the basic building block of a deep neural network model. The CONV layer performs two-dimensional convolution of the input image using a set of filters W, generating a set of feature maps h. Mathematically, the operation is expressed as follows:

$$h=W\ast X+b$$

where b denotes the bias of the filter, and ‘*’ represents the convolution operation.

Activation function (ReLU layer): A non-linear activation function handles the non-linearities of the mapping between input and output. In general, the Rectified Linear Unit (ReLU(h) = max (0, h)) is used as the neuron activation function, as it performs well with respect to runtime and generation error³⁸. This function is added after each convolutions layer.

Pooling layer (POOL layer): The POOL layer receives feature maps and resizes them into smaller maps. The most favorable POOL layer choice is max-pooling, where each map is subsampled with the maximum value over n_p × n_p adjacent regions. Max-pooling is performed as it introduces small invariance to translation and distortion and leads to faster convergence and better generalization^39,40.

Decoder structures

Upsampling layer (UPSAMPLING): Upsampling is a procedure to connect coarse outputs to dense pixels through interpolation. F-CNN-based architectures make use of learnable upsampling filters to upsample feature maps. The upsampling kernels are learned through the usage of transposed convolution (deconvolution)⁴¹, in which zero paddings and stride are specified to increase the size of feature maps instead. Figure 3 illustrates the upsampling process through the deconvolutional layer.

Feature fusion: Fusion is an element of F-CNN that enables the addition of context information to a fully convolutional architecture. As demonstrated in Fig. 3, the upsampled feature maps generated by the deconvolutional layer are added elementwise to the corresponding feature maps generated by the convolutional layer in the encoder.

There is no unique approach to design the encoder and decoder architectures. In this paper, various designs are explored to deliver the highest validation accuracy and yield low generalization error for new data. Regardless of the details of the architecture, the parameters of the network are determined via the training process.

Training process

Once the architecture is decided, the final step is to train the model using the dataset. In this paper, the aim is to map the input image to the set of classes. Specifically, the goal is to determine the complex end-to-end mapping function that transforms the input image from measurement {X_i} to its corresponding multi-class image {Y_i}. The output multi-class image consists of pixels annotated with the degradation category (0–6).

The network parameters are iteratively updated using backpropagation^42,43 to minimize the loss. The categorical cross-entropy loss⁴⁴ is applied for evaluating the output error. The output error is obtained by computing the deviation (error) of the network outcome Y_i with the desired ground-truth $\{{Y^{\prime} }_{i}\}$. The cross-entropy loss is defined as:

$$Loss=-\,\sum \,{Y^{\prime} }_{i}\,\log \,({Y}_{i})$$

where Y_i is a function of the input image {X_i} and the network parameters (i.e., W and b). The Adam optimization method is used⁴⁵ as it offers faster convergence than the standard stochastic gradient descent method.

The training process heavily depends on the amount of data available. In many materials science problems, the cost of generating data is high, and various strategies are needed to address this issue. In our problem, there are two significant challenges. First of all, we augment our dataset by splitting the collected datasets into the image blocks as described in the introduction of the Results Section. This operation allowed to increase the size of the dataset by the factor of 8. Another challenge stems from the unbalanced data towards some classes of cracks. In our dataset, seven classes are represented by significantly different numbers of pixels per region. Finally, whenever the data is limited the overfitting may occur hampering the generalizability of the F-CNN. The details of two strategies to address the above issues are detailed below.

Data balancing strategy

The unbalanced data used in the training can cause the learning algorithm to become biased towards the dominating class⁴⁶. In order to balance the different class frequencies and thus their contribution to the loss function, we introduce weighting coefficients η for each semantic class. The coefficient is defined as:

$${\alpha }_{i}=\frac{{\sum }_{i=0}^{N-1}{p}_{i}}{{p}_{i}}$$

$${\eta }_{i}=\frac{{\alpha }_{i}}{{\sum }_{i=0}^{N-1}{\alpha }_{i}}$$

where p_i is the number of pixels belong to class i in the training set, and ${\sum }_{i=0}^{N-1}\,{p}_{i}$ is the total pixel count over all classes. The loss function is updated accordingly:

$$Los{s}_{weighted}=\eta \times Loss$$

In this way, the importance of sparse classes (in terms of the pixel areas) is corrected.

Regularization strategy

Our network architecture is relatively deep, and the availability of data is limited, regularization needs to be used to mitigate the generalization test error of the algorithm⁴⁷. Among the variety of regularization techniques available, we applied L2 regularization and dropout. L2 regularization applies a penalty on large network parameters and forces them to be relatively small⁴⁸. Dropout refers to a technique where a fraction of randomly selected activations are ignored during training. It helps to reduce overfitting by not allowing the model to be heavily dependent on the output of one or a few neurons. According to Srivastava et al.⁴⁹, Gaussian dropout could perform better than the classical Bernoulli dropout. The use of Gaussian dropout equivalent to adding a Gaussian distributed random variable with zero mean and standard deviation. The Gaussian dropout is defined as follows:

$$Drop=\sqrt{\frac{\lambda }{1-\lambda }},\lambda \in (0,1)$$

where λ is the drop probability.

In our proposed architectures, we utilized L2 regularization and Gaussian dropout regularization strategies. The L2 regularization is applied after each activation function. Dropout is added after the last two convolution layers. As shown in Fig. 2, λ = 0.5 is added after the convolutional layers which have 1024 kernels.

Evaluation strategy

To assess the performance of different architectures, we computed several metrics⁵⁰:

1.
Pixel Accuracy: it is a metric computing a ratio between the amount of correctly classified pixels and the total number of pixels.
$$pixelAcc=\frac{{\sum }_{i=0}^{N-1}{p}_{ii}}{{\sum }_{i=0}^{N-1}{\sum }_{j=0}^{N-1}{p}_{ij}}$$
2.
Mean Intersection over Union (meanIU): it measures the intersection over the union of the labeled segments for each class and reports the average. It computes the ratio between the number of true positives (intersection) and the sum of true positives, false negatives, and false positives (union).
$$meanIU=\frac{1}{N}\mathop{\sum }\limits_{i=0}^{N-1}\frac{{p}_{ii}}{{\sum }_{j=0}^{N-1}{p}_{ij}+{\sum }_{j=0}^{N-1}{p}_{ij}-{p}_{ii}}$$
3.
Per-class accuracy: this is simply the proportion of correctly labeled pixels on a per-class basis. The $perClassAcc$ for class $i$ is defined as:

$$perClassAc{c}_{i}=\frac{{p}_{ii}}{{\sum }_{j=0}^{N-1}{p}_{ij}}$$

where N is the number of classes, p_ij is the number of pixels of class i inferred to belong to class j, p_ii represents true positives (the number of pixels correctly classified), p_ij represents false positives (the number of pixels incorrectly classified) and p_ji represents false negatives (the number of pixels which are wrongly not classified), respectively.

Empirical studies to identify the F-CNN architecture

In this paper, we study different encoder and decoder architectures to identify the final architecture which results in the highest validation accuracy and yields low generalization error for new data. We vary details of encoder and decoder structure independently. We first investigate the encoder structure as it plays a crucial role in learning distinctive features from the input dataset. Encoder structure has a strong effect on the computational performance of F-CNN. To develop an encoder architecture on the task of crack inspection, we evaluated the proposed F-CNN model P and two other models. The architecture of the two models (Model A and B) are as shown in Fig. 4. In the three different architectures, the number of CONV layers were changed. Models A, B, and P used 6, 13, and 16 layers of CONV, respectively. The last convolution layer is added to facilitate the prediction of the decoder to the N categories.

The accuracy and loss plots for model A, B, and proposed Model P are presented in Fig. 5. The evaluation results on test data are listed in Table 2. The time shown in the table is the training and validation time. It was observed that the test accuracy is better for complex models. A possible reason for the better performance of the complex model could be attributed to the complexity of features. As the complexity of crack features increases, the encoder structure needs to include more number of convolutional layers to extract the abstract contextual features from input images for the following pixel-level prediction. It is worth noting that as the accuracy increases, the computing time also increases since more number of parameters are trained in the system. Therefore, we did not enhance the complexity of the model beyond this design. The accuracy and loss plots also demonstrate that the training and validation accuracy of Model P climbs faster than Model A and B. Thus, the Model P was chosen for further improvement of our system.

Table 2 Performance metric for five investigated architectures. Three accuracy metrics were measured after 50 epochs with learning rate LR = 0.001.

Full size table

Decoder structure development

Apart from the importance of encoder part of the architecture which produces low-resolution image representations or feature maps, the role of decoder part is also significant as it maps those low-resolution images to the pixel-wise predictions for segmentation. Two more different decoder structures, i.e., Model C and D in Fig. 6, are investigated in our experiment. The variation between Model C, D, and our proposed Model P is on the number of upsampling layers and feature fusion times. In model C, we upsampled the last convolution layer, fused the feature information with the fourth pooling payer feature maps, and then upsampled to the size of the input image for pixel prediction. While in model D, more intermittent feature representations from encoding part learned from the input images are concatenated into the decoding part for inference. The accuracy and loss plots for training and validation dataset are presented in Fig. 5. The test result is listed in Table 2. Our results indicate that Model P is predominant in both the test accuracy and training time. Therefore, model P is chosen as our final model.

The above studies resulted in the final architecture of F-CNN shown in Fig. 2. The F-CNN contains 16 stacked convolution layers and two feature fusion and three upsampling layers. Each convolution layer is followed by ReLU activation function. The architecture has roughly 41.4 million parameters to be estimated. Such a high-dimensional model is prone to overfitting taking into account the relatively small datasets under consideration. To mitigate overfitting, we applied data augmentation during data preparation, L2 regularization after each convolution layer and Gaussian dropout on last two convolution layers during training. Finally, in the decoding part, we fused the feature information extracted from the last convolution layer with feature maps obtained after the third and fourth pooling layer for image prediction.

The final architecture is evaluated for 200 epochs to obtain an estimate of its generalization performance. The run time for the training process and 200 epochs is 62 hours. The final model accuracy and loss plots are presented in Fig. 7. The plots depict the performance climbs till around 125 epochs and then plateaus. Although a little overfitting is observed in the last 50 epochs, the trained model is acceptable for the classification task at hand.

Detection of the degradation mechanism

The evaluation results on test data demonstrate good performance as shown in Table 3. The final pixel-level prediction accuracy achieved by the trained model is 92.8% with mean IU of 72.5%. Table 3 also lists the per-class accuracy for class 0: background. The background class has been introduced to handle inaccuracies of the manual annotation. The accuracy for this class is relatively low, however, it does not affect the overall performance of the model.

Table 3 Class accuracy of final architecture of the network.

Full size table

Detailed results corresponding to the inspection of degradation mechanism in example PV backsheets are depicted in Fig. 8. Three columns in Fig. 8 depict examples of test images, corresponding manually labeled image, and predicted outputs using the trained Model P, respectively. Test images are selected to demonstrate various types of mechanisms that are marked in the legend at the bottom of the figure. The different colors in the second and third columns indicate different crack classes as shown in the color bar in Fig. 8. For example, the test image in the first raw simultaneously exhibits four cracking mechanisms (parallel cracking, delamination, transverse and longitudinal branching cracking). The F-CNN model correctly labeled all mechanisms regardless of different size of individual classes. The predicted crack types and their locations are in good correspondence to the manually labeled classes. All examples depicted in this figure consistently demonstrate the good performance of our model.

In summary, in this work, the utility and efficacy of a fully convolutional neural network architecture for degradation mechanism type detection of PV backsheets has been demonstrated. One of the main contributions of the paper is the development of a F-CNN model that has demonstrated excellent performance in the task of identifying different types of degradation mechanisms in PV backsheets. The pixel level prediction accuracy of the developed F-CNN model is close to 92.8% and the test time per image is 2.1 second. The presented results demonstrate the applicability of the fully-convolutional network in defect detection domain. The proposed architecture is developed by varying the encoding and decoding configuration.

In our framework, we focus on developing a system that can provide high prediction accuracy. Therefore, the evaluation metrics are placed more emphasis than the execution speed in our system. The execution time is also provided as a reference for further improvement of the system.

The developed F-CNN approach is generic and can be adapted to the broad class of segmentation tasks in materials science. Our approach could replace such a manual annotation performed by the microscopy expert to annotate micrographs, or at least suggest the initial annotations. In this sense, our model has an application to any material system, as long as sufficient data is available for model training. The micrograph could also be automatically annotated with the underlying mechanisms, or series of micrographs could be used to construct the entire phase diagram. The initial successes of machine learning in these areas have been recently reported. For example, the micrograph of ultra-high carbon steel has been classified using machine learning⁵¹. In the same material system - dual phase steel - the damage mechanism has been detected using deep learning⁵². Finally, machine learning has been recently leveraged to construct the phase diagram of low carbon steel⁵³.

Method

F-CNN was developed using Theano (version 1.0.2) and Keras (version 2.2.0). Keras is high-level neural networks application programming interface to enable fast experimentation. Specifically, Keras supports prototyping various convolution neural network architectures. Theano is one of the backend engines for mathematical expression evaluation involving multi-dimensional arrays. The code was developed in Python 2.7.13 and is available on github (https://github.com/Binbin16/Degradation-Mechanism-Detection-By-FCNN). All the experiments in the presented work were conducted on a Linux OS with 12 × 2.66 GHz Intel Xeon X5650 processor cores and 2× Nvidia M2050 Tesla GPUs.

References

Jones-Albertus, R., Feldman, D., Fu, R., Horowitz, K. & Woodhouse, M. Technology advances needed for photovoltaics to achieve widespread grid price parity. Prog. Photovoltaics: Res. Appl. 24, 1272–1283, https://doi.org/10.1002/pip.2755 (2016).
Article Google Scholar
Jorgensen, G. et al. Moisture transport, adhesion, and corrosion protection of pv module packaging materials. Sol. Energy Mater. Sol. Cells 90, 2739–2775, https://doi.org/10.1016/j.solmat.2006.04.003 (2006).
Article CAS Google Scholar
Oreski, G. & Wallner, G. M. Delamination behaviour of multi-layer films for PV encapsulation. Sol. Energy Mater. Sol. Cells 89, 139–151, https://doi.org/10.1016/j.solmat.2005.02.009 (2005).
Article CAS Google Scholar
Gambogi, W. et al. Backsheet and Module Durability and Performance and Comparison of Accelerated Testing to Long Term Fielded Modules. 28th Eur. Photovolt. Sol. Energy Conf. Exhib. 2846–2850, https://doi.org/10.4229/28theupvsec2013-4co.10.2 (2013).
Oreski, G. & Wallner, G. M. Aging mechanisms of polymeric films for PV encapsulation. Sol. Energy 79, 612–617, https://doi.org/10.1016/j.solener.2005.02.008 (2005).
Article CAS ADS Google Scholar
Gordon, D. A., Huang, W.-H., Burns, D. M., French, R. H. & Bruckman, L. S. Multivariate multiple regression models of poly(ethylene-terephthalate) film degradation under outdoor and multi-stressor accelerated weathering exposures. Plos One 13, 1–30, https://doi.org/10.1371/journal.pone.0209016 (2018).
Article CAS Google Scholar
Koentges, M. et al. IEA-PVPS {Task 13}: Review of Failures of PV Modules. Tech. Rep (2014).
Sánchez-Friera, P., Piliougine, M., Pelaez, J., Carretero, J. & Sidrach de Cardona, M. Analysis of degradation mechanisms of crystalline silicon pv modules after 12 years of operation in southern europe. Prog. photovoltaics: Res. Appl. 19, 658–666 (2011).
Article Google Scholar
Quintana, M. A., King, D. L., McMahon, T. J. & Osterwald, C. R. Commonly observed degradation in field-aged photovoltaic modules. In Conference Record of the Twenty-Ninth IEEE Photovoltaic Specialists Conference, 2002, 1436–1439, https://doi.org/10.1109/PVSC.2002.1190879 (2002).
Gambogi, W. et al. A Comparison of Key PV Backsheet and Module Performance from Fielded Module Exposures and Accelerated Tests. IEEE J. Photovoltaics 4, 935–941, https://doi.org/10.1109/JPHOTOV.2014.2305472 (2014).
Article Google Scholar
Chattopadhyay, S. et al. Visual degradation in field-aged crystalline silicon pv modules in india and correlation with electrical degradation. IEEE J. Photovoltaics 4, 1470–1476, https://doi.org/10.1109/JPHOTOV.2014.2356717 (2014).
Article Google Scholar
Ulrike Jahn et al. Review on IR and EL Imaging for PV Field Applications. Tech. Rep. IEA-PVPS T13-10:2018, IEA-PVPS Task 13 (2018).
Dubey, R. et al. On-Site Electroluminescence Study of Field-Aged PV Modules. In 2018 IEEE 7th World Conference on Photovoltaic Energy Conversion (WCPEC) (A Joint Conference of 45th IEEE PVSC, 28th PVSEC 34th EU PVSEC), 0098–0102, https://doi.org/10.1109/PVSC.2018.8548080 (2018).
Doll, B. et al. High through-put outdoor characterization of silicon photovoltaic modules by moving electroluminescence measurements. In Infrared Sensors, Devices, and Applications VIII, vol. 10766, 107660K, https://doi.org/10.1117/12.2320518 (International Society for Optics and Photonics, 2018).
Bhoopathy, R., Kunz, O., Juhl, M., Trupke, T. & Hameiri, Z. Outdoor photoluminescence imaging of photovoltaic modules with sunlight excitation. Prog. Photovoltaics: Res. Appl. 26, 69–73, https://doi.org/10.1002/pip.2946 (2018).
Article CAS Google Scholar
Bruckman, L. S. et al. Statistical and Domain Analytics Applied to PV Module Lifetime and Degradation Science. IEEE Access 1, 384–403, https://doi.org/10.1109/ACCESS.2013.2267611 (2013).
Article Google Scholar
Gok, A., Fagerholm, C. L., French, R. H. & Bruckman, L. S. Temporal evolution and pathway models of poly(ethyleneterephthalate) degradation under multi-factor accelerated weathering exposures. Plos One 14, 1–22, https://doi.org/10.1371/journal.pone.0212258 (2019).
Article CAS Google Scholar
Douka, E., Loutridis, S. & Trochidis, A. Crack identification in beams using wavelet analysis. Int. J. Solids Struct. 40, 3557–3569 (2003).
Article Google Scholar
Bamnios, Y., Douka, E. & Trochidis, A. Crack identification in beam structures using mechanical impedance. J. Sound Vib. 256, 287–297 (2002).
Article ADS Google Scholar
D’orazio, T. et al. Automatic ultrasonic inspection for internal defect detection in composite materials. NDT & e Int. 41, 145–154 (2008).
Article Google Scholar
Li, Q. & Ren, S. A visual detection system for rail surface defects. IEEE Transactions on Syst. Man, Cybern. Part C Applications Rev. 42, 1531–1542 (2012).
Article Google Scholar
Koch, C., Georgieva, K., Kasireddy, V., Akinci, B. & Fieguth, P. A review on computer vision based defect detection and condition assessment of concrete and asphalt civil infrastructure. Adv. Eng. Informatics 29, 196–210 (2015).
Article Google Scholar
Qian, X. et al. Solar cell surface defects detection based on computer vision. Int. J. Perform. Eng. 13, 1048 (2017).
Google Scholar
Jian, C., Gao, J. & Ao, Y. Automatic surface defect detection for mobile phone screen glass based on machine vision. Appl. Soft Comput. 52, 348–358 (2017).
Article Google Scholar
Farabet, C., Couprie, C., Najman, L. & LeCun, Y. Learning hierarchical features for scene labeling. IEEE transactions on pattern analysis machine intelligence 35, 1915–1929 (2013).
Article Google Scholar
Pinheiro, P. H. & Collobert, R. Recurrent convolutional neural networks for scene labeling. In 31st International Conference on Machine Learning (ICML), EPFL-CONF-199822 (2014).
Gupta, S., Girshick, R., Arbeláez, P. & Malik, J. Learning rich features from rgb-d images for object detection and segmentation. In European Conference on Computer Vision, 345–360 (Springer, 2014).
Hariharan, B., Arbeláez, P., Girshick, R. & Malik, J. Hypercolumns for object segmentation and fine-grained localization. In Proceedings of the IEEE conference on computer vision and pattern recognition, 447–456 (2015).
Long, J., Shelhamer, E. & Darrell, T. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition, 3431–3440 (2015).
Chen, L.-C., Papandreou, G., Kokkinos, I., Murphy, K. & Yuille, A. L. Semantic image segmentation with deep convolutional nets and fully connected crfs. arXiv preprint arXiv:1412.7062 (2014).
Noh, H., Hong, S. & Han, B. Learning deconvolution network for semantic segmentation. In Proceedings of the IEEE international conference on computer vision, 1520–1528 (2015).
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention, 234–241 (Springer, 2015).
Badrinarayanan, V., Kendall, A. & Cipolla, R. Segnet: A deep convolutional encoder-decoder architecture for image segmentation. arXiv preprint arXiv:1511.00561 (2015).
Everingham, M. et al. The pascal visual object classes challenge: A retrospective. Int. journal computer vision 111, 98–136 (2015).
Article Google Scholar
Klinke, A. G., Gok, A., Ifeanyi, S. I. & Bruckman, L. S. A non-destructive method for crack quantification in photovoltaic backsheets under accelerated and real-world exposures. Polym. Degrad. Stab. 153, 244–254 (2018).
Article CAS Google Scholar
International, A. ASTM G154-16, Standard Practice for Operating Fluorescent Ultraviolet (UV) Lamp Apparatus for Exposure of Nonmetallic Materials. Tech. Rep., ASTM International, West Conshohocken, PA (2016).
Russell, B. C., Torralba, A., Murphy, K. P. & Freeman, W. T. Labelme: a database and web-based tool for image annotation. Int. journal computer vision 77, 157–173 (2008).
Article Google Scholar
Nair, V. & Hinton, G. E. Rectified linear units improve restricted boltzmann machines. In Proceedings of the 27th international conference on machine learning (ICML-10), 807–814 (2010).
Masci, J., Meier, U., Ciresan, D., Schmidhuber, J. & Fricout, G. Steel defect classification with max-pooling convolutional neural networks. In Neural Networks (IJCNN), The 2012 International Joint Conference on, 1–6 (IEEE, 2012).
Scherer, D., Müller, A. & Behnke, S. Evaluation of pooling operations in convolutional architectures for object recognition. In Artificial Neural Networks–ICANN 2010, 92–101 (Springer, 2010).
Zeiler, M. D., Krishnan, D., Taylor, G. & Fergus, R. Deconvolutional networks. Ala, 2528–2535 (2010).
LeCun, Y. A., Bottou, L., Orr, G. B. & Müller, K.-R. Efficient backprop. In Neural networks: Tricks of the trade, 9–48 (Springer, 2012).
Hecht-Nielsen, R. Theory of the backpropagation neural network. In Neural networks for perception, 65–93 (Elsevier, 1992).
Vincent, P., Larochelle, H., Lajoie, I., Bengio, Y. & Manzagol, P.-A. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. J. machine learning research 11, 3371–3408 (2010).
MathSciNet MATH Google Scholar
Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
He, H. & Garcia, E. A. Learning from imbalanced data. IEEE Transactions on Knowl. & Data Eng. 1263–1284 (2008).
Novikov, A. A., Lenis, D., Major, D., HladvkaandWimmer, M. & Bhler, K. Fully convolutional architectures for multi-class segmentation in chest radiographs. IEEE Transactions on Med. Imaging (2018).
Ng, A. Y. Feature selection, l 1 vs. l 2 regularization, and rotational invariance. In Proceedings of the twenty-first international conference on Machine learning, 78 (ACM, 2004).
Srivastava, N., Hinton, G., Krizhevsky, A., Sutskever, I. & Salakhutdinov, R. Dropout: a simple way to prevent neural networks from overfitting. The J. Mach. Learn. Res. 15, 1929–1958 (2014).
MathSciNet MATH Google Scholar
Csurka, G., Larlus, D., Perronnin, F. & Meylan, F. What is a good evaluation measure for semantic segmentation? In BMVC, vol. 27, 2013 (Citeseer, 2013).
Arivazhagan, S., Tracia, J. J. & Selvakumar, N. Classification of steel microstructures using modified alternate local ternary pattern. Mater. Res. Express 6, 096539 (2019).
Article ADS Google Scholar
Kusche, C. et al. Large-area, high-resolution characterisation and classification of damage mechanisms in dual-phase steel using deep learning. PloS One 14, e0216493 (2019).
Article Google Scholar
Tsutsui, K. et al. Microstructural diagram for steel based on crystallography with machine learning. Comput. Mater. Sci. 159, 403–411 (2019).
Article CAS Google Scholar

Download references

Acknowledgements

This work was supported by funding from the NYS Center of Excellence in Materials Informatics, University at Buffalo (SUNY) Grant Number [1140384-4-75163]. The views and opinions expressed in this article are those of the authors and do not necessarily reflect the official policy or position NYS Center of Excellence in Materials Informatics, University at Buffalo (SUNY). This material is based upon work supported by the National Science Foundation under Grant Numbers HRD 1432053, 1432864, 1432868, 1432878, 1432891, 1432921 and 1432950. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation. This research made use of the Solar Durability and Lifetime Extension (SDLE) Research Center (Ohio Third Frontier, Wright Project Program Award Tech 12-004). The authors would also like to acknowledge support from the Department of Energy’s SETO PREDICTS2 program (DE-EE0007143).

Author information

Authors and Affiliations

Mechanical and Aerospace Engineering Department, University at Buffalo, Buffalo, USA
Binbin Zhang, Olga Wodo & Rahul Rai
Mechanical Engineering Tuskegee University, Tuskegee, AL, USA
Joydan Grant
Materials Design and Innovation Department, University at Buffalo, Buffalo, USA
Olga Wodo
Department of Materials Science and Engineering, Case Western Reserve University, Cleveland, OH, USA
Laura S. Bruckman

Authors

Binbin Zhang
View author publications
Search author on:PubMed Google Scholar
Joydan Grant
View author publications
Search author on:PubMed Google Scholar
Laura S. Bruckman
View author publications
Search author on:PubMed Google Scholar
Olga Wodo
View author publications
Search author on:PubMed Google Scholar
Rahul Rai
View author publications
Search author on:PubMed Google Scholar

Contributions

L.B., J.G. and B.Z. contributed to results generation and subsequent analysis. L.B., O.W. and R.R. contributed to problem formulations and project oversight. All authors contributed to the manuscript preparation. All authors reviewed the manuscript.

Corresponding author

Correspondence to Rahul Rai.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, B., Grant, J., Bruckman, L.S. et al. Degradation Mechanism Detection in Photovoltaic Backsheets by Fully Convolutional Neural Network. Sci Rep 9, 16119 (2019). https://doi.org/10.1038/s41598-019-52550-6

Download citation

Received: 02 May 2019
Accepted: 15 October 2019
Published: 06 November 2019
DOI: https://doi.org/10.1038/s41598-019-52550-6