Deep learning based atomic defect detection framework for two-dimensional materials

Chen, Fu-Xiang Rikudo; Lin, Chia-Yu; Siao, Hui-Ying; Jian, Cheng-Yuan; Yang, Yong-Cheng; Lin, Chun-Liang

doi:10.1038/s41597-023-02004-6

Download PDF

Data Descriptor
Open access
Published: 14 February 2023

Deep learning based atomic defect detection framework for two-dimensional materials

Scientific Data volume 10, Article number: 91 (2023) Cite this article

6845 Accesses
22 Citations
4 Altmetric
Metrics details

Subjects

Abstract

Defects to popular two-dimensional (2D) transition metal dichalcogenides (TMDs) seriously lower the efficiency of field-effect transistor (FET) and depress the development of 2D materials. These atomic defects are mainly identified and researched by scanning tunneling microscope (STM) because it can provide precise measurement without harming the samples. The long analysis time of STM for locating defects in images has been solved by combining feature detection with convolutional neural networks (CNN). However, the low signal-noise ratio, insufficient data, and a large amount of TMDs members make the automatic defect detection system hard to be applied. In this study, we propose a deep learning-based atomic defect detection framework (DL-ADD) to efficiently detect atomic defects in molybdenum disulfide (MoS₂) and generalize the model for defect detection in other TMD materials. We design DL-ADD with data augmentation, color preprocessing, noise filtering, and a detection model to improve detection quality. The DL-ADD provides precise detection in MoS₂ (F2-scores is 0.86 on average) and good generality to WS₂ (F2-scores is 0.89 on average).

Experimental and theoretical studies of native deep-level defects in transition metal dichalcogenides

Article Open access 29 October 2022

Rapid and accurate predictions of perfect and defective material properties in atomistic simulation using the power of 3D CNN-based trained artificial neural networks

Article Open access 02 January 2024

Recent advances and applications of deep learning methods in materials science

Article Open access 05 April 2022

Background & Summary

Transition metal dichalcogenides (TMDs), as two-dimensional (2D) materials, have been predicted to have huge application potential in the solar energy and semi-conductor industry^1,2,3,4,5. Nevertheless,TMDs usually obtain many defects because of the low formation energy of chalcogen vacancies⁶. When attempting to construct a high-performance field-effect transistor, defects can affect contact resistance⁷, result in Fermi-level pinning^7,8, and reduce carrier mobility^9,10.

To prevent defects from affecting the performance of TMD devices, researchers usually use optical methods^11,12, probing techniques^13,14,15,16 and transmission electron techniques^17,18 to measure the defect distribution on each TMD surface. Optical methods have advantages, such as large area and element detection, especially excellent in X-ray photoelectron spectroscopy measurement¹⁹. However, the spatial resolution of optical methods cannot exceed the diffraction limit of approximately 0.1 µm and can sometimes harm the sample because of heat generated from pumping light. The problem can be conquered by applying probing techniques, particularly scanning tunneling microscopy (STM) which can provide ultra-high-resolution images without harming the sample. Because of the extremely high resolution of the image, STM measurements exhibit zero deviations and element recognition in defect density estimation. Yet, the STM measurements require scanning hundreds of images and counting thousands of defects from images one by one, which takes a lot of time. Therefore, estimating the defect density of samples by STM will require roughly dozens hours for counting defects. Artificial intelligence techniques are used to solve this problem.

Deep learning-based STM analysis provides substantial advantages but also presents problems when applying sensing defect variance in TMDs-based field-effect transistor (FET). Based on many training images, most studies have achieved high accuracy in detecting defects^20,21,22. They used 3500 to 7500 images in their reports, depending on the signal-to-noise ratio. Using STM to collect this amount of data must take months. Furthermore, the STM tip is frequently exposed to residual chemical and poor conducting areas, while scanning FETs, resulting in images with highly unstable quality and low signal-to-noise ratio. Density functional theory (DFT) calculation is used to generate a large amount of mimic data to solve this problem²³. However, inadequate and low-quality experiment data can still result in model overfitting and showing low accuracy.

Furthermore, an increasing number of materials that can be applied to FET will require defect detection. A well-trained model must be retrained when applying to different materials²² because the appearance of defects are changed, resulting in a large data collection cost and model training time. The combination of these problems makes detecting defects in TMD-based FETs difficult.

In this study, we propose a deep learning-based atomic defect detection framework (DL-ADD) for diagnosing defects in TMD-based FETs to address the problem of insufficient and low-quality data and to improve model generality. In DL-ADD, we design a data augmentation module, a color preprocessing module, a noise filtering module, and a detection model to reduce analysis time and improve detection accuracy. We also develop a data augmentation module to generate more pseudo data for the training model to depress overfitting. Color preprocessing, and noise filtering modules are designed to improve the quality of data. We implement U-Net²⁴ as the detection model to accurately locate and identify the atomic defect because it can segment images faster and more precisely. Furthermore, we conduct experiments based on molybdenum disulfide (MoS₂) and tungsten disulfide (WS₂) images to evaluate the accuracy and the generality of DL-ADD. The defects are defined as impurities^8,25 and voids^13,26 since these two defects are frequently exist in MoS₂ and WS₂. The proposed achieves an F1-score of 0.89 for impurities and 0.80 for voids in MoS₂ with an extremely small amount of data (approximately 70 images). We use the MoS₂ model to differentiate the WS₂ defects without retraining. The F1-score for voids in WS₂ is 0.94, that is, the proposed DL-ADD can be effectively used for TMD-based FETs with limited and low-quality data and widely applied to other 2D materials.

Methods

As shown in Fig. 1, we propose a deep learning-based atomic defect detection framework (DL-ADD) for 2D materials. Three data preprocessing modules and a detection model were included. The code of DL-ADD is accessible at GitHub: https://github.com/MeatYuan/MOS2.

Experimental detail

This research applies room temperature STM (RT-STM) to scan MoS₂ and WS₂ surfaces at vapor pressure 1 × 10⁻¹⁰ torr. Single crystal WS₂ and MoS₂ bulk were both bought from Structure Probe Inc. The WS₂ and MoS₂ crystals were applied with the different processes before scanned to increase the number of impurities and voids. The MoS₂ sample was kept at normal pressure after cleaved by mechanical exfoliation and were transferred to make field effect transistor. The WS₂ sample was cleaved at an ultra-high vacuum chamber and heated for 200 for 12 hours to increase only the number of voids. All images are scanned in constant current mode with −1 V sample bias and 1 nA tunneling current. By the help of this method, statistical property of defect density was researched¹⁶.

Data augmentation

To train an accurate model, a certain amount of data is required. However, collecting 2D material images requires much labor and takes a long time. Therefore, we designed a data augmentation process to increase the amount of training data. The process of data augmentation is shown in Fig. 2. To augment data and maintain the diversity of data, we first manually create the defect data and then augment more data using traditional augmentation methods. We cut the voids and impurities from the training data. After that, we selected a clear background of the MoS₂ surface and pasted the cut defects into the background images. The proportions of impurities and voids are randomized to simulate real images. We used the Gaussian blur on images to reduce the discontinuity of the edge to make the boundary of the cut images and background smoother. Finally, we randomly choose horizontal flip, vertical flip, rotation, and shift methods to augment data.

Color preprocessing

Compared to color images, the gray images are easier to remove noise and reduce data difference. Therefore, we resize images to 256 × 256 pixels and convert them to gray in this module.

Noise filtering

In scanning probe microscope research, the quality of experiment data strongly depends on tip conditions. When dealing with a rough sample (root mean square roughness above 1 nm), a tip can easily hit the sample, resulting in a large amount of data scanned in bad conditions. Under this condition, high-frequency noise shows stronger intensity than signals on the image, making it more difficult for the model to detect defects. We first use Fast Fourier Transform (FFT) on the images, then a low pass filter to remove high-frequency signals before performing inverse FFT. Finally, we adjust the images contrast higher to make the impurity and void more visible.

Detection model

We choose U-Net as the defect detection model since it can be trained with few images and achieve precise segmentation. U-Net²⁴ is a convolutional network architecture that consists of a contracting path for downsampling and an expansive path for upsampling, as shown in Fig. 3. Six identical parts are used in the downsampling, each comprising two convolutional layers with a 3 × 3 kernel size and one max pooling layer. Also six identical parts are used in the upsampling, each comprising two convolutional layers with a 3 × 3 kernel size, one transposed convolution layer, and a concatenate with the same size as the previous feature map. Finally, we use three convolutional layers to output a 256 × 256 size image.

Data Records

The actual size of MoS₂ and WS₂ images are separately 50 × 50 nm² and 60 × 60 nm² stored in Bitmap format (BMP) with 256 pixels, which is the most suitable size to distinguish and detect those defects. All of the data is kept at Open Science Framework at OSF²⁷: https://doi.org/10.17605/OSF.IO/ZXGTJ

Technical Validation

In this section, we will evaluate DL-ADD. We first compare DL-ADD and YOLOv4²⁸ and then conduct ablation studies of image preprocessing module and data augmentation module of DL-ADD. Furthermore, to demonstrate the generality of DL-ADD, we apply it to WS₂ material which also contains voids and impurities. A void can be labeled as a black dip appears on the surface, while an impurity can be labeled as bright protrusion. We scanned the entire scannable area in one MoS₂ FET sample and obtained 90 images to evaluate DL-ADD in the following experiments. Seventy images are used as training dataset while twenty images are used as testing dataset.

The evaluation metrics are recall, precision, F1-score and F2-score. False Positive (FP) is referred to as pseudo defects. False Negative (FN) is the defects that the model does not detect. True Positive (TP) is the true defects that are correctly classified. F-Score is the harmonic average of precision and recall. Since the defect is more important, we add F2-score in the following result.

$$Recall=\frac{TP}{TP\,+\,FN}$$

(1)

$$Precision\,Rate=\frac{TP}{TP\,+\,FP}$$

(2)

$${F}_{\beta }=(1\,+\,{\beta }^{2})\frac{Precision\ast Recall}{({\beta }^{2}\ast Precision)\,+\,Recall}$$

(3)

Model evaluation

Table 1 compares DL-ADD with YOLOv4 to evaluate the model, indicating that F1-score of DL-ADD outperforms the YOLO model in terms of voids and impurities. As shown in Fig. 4, the loss of DL-ADD drops faster and more dramatically compared to YOLOv4. The difference of loss clearly indicates YOLOv4 is more difficult to converge. This may be due to the complexity of YOLOv4 increasing the necessity of data numbers. As such, the simple structure of the U-Net of DL-ADD can achieve better performance based on little data. Some of the DL-ADD and YOLO results are shown in Fig. 5.

Table 1 The accuracy of DL-ADD and YOLOv4 models.

Full size table

Data augmentation validation

In DL-ADD, we use data augmentation by manually imitating data. We validate the effect of data augmentation in Table 2. Both defects are detected more accurately after the process is used. The augmentation process primarily balance model precision and recall relief the miss-alarm problem.

Table 2 The detection accuracy of DL-ADD model with and without data augmentation.

Full size table

Noise filtering validation

The noise signals mainly exist in the high-frequency region, while the defect signals are mostly found in the low-frequency region. Thus, we design a low pass filter in DL-ADD to improve the quality of images. We show the effects of noise filtering methods in Table 3. The results show that our procedure successfully improves the signal-to-noise ratio and significantly improves accuracy regardless of recall or precision.

Table 3 The detection accuracy of DL-ADD with and without preprocessing.

Full size table

Generality validation

Some 2D materials are similar. If DL-ADD can be generalized to these materials, we can save much model training time. We apply DL-ADD which is trained with MoS₂ to detect the defects of WS₂. The results of detecting defects on 15 WS₂ images are shown in Fig. 6 and Table 4. The F1-Score and F2-Score of DL-ADD are significantly higher than those of YOLO. The impurity data of the two training models in the MoS₂ report are barely different, but the difference of void data is more obvious. Among the two materials, the accuracy of DL-ADD does not decrease too much, but the F1-Score of YOLO is less than 0.6, indicating that YOLO is completely overfitted on MoS₂. That is, the generality of DL-ADD is better than YOLO.

Table 4 Generality validation to WS₂ of each model.

Full size table

Training and inference time

In addition to improving accuracy, the detection time is essential to defect detection. We compare the training and inference times of the two models, as shown in Table 5. We use the RTX 3090 to train two models. DL-ADD takes 29 minutes to finish model training, and YOLOv4 takes 91 minutes, more than three times that of DL-ADD. In the inference speed, DL-ADD takes 0.17 seconds per image, and YOLOv4 takes 0.15 seconds per image, and there is not much difference between the two.

Table 5 Training and inference time of each model.

Full size table

Usage Notes

In this study, we proposed a DL-ADD framework to efficiently detect atomic defects in TMD-based FET. In DL-ADD, we designed an image preprocessing module, a data augmentation module, and a detection model. With low quality and small amounts of data, DL-ADD performed well in detecting defects (F-score > 0.85 on average) and resisting intense noise, significantly lowering the restriction to apply this model for FET research. The good generality of DL-ADD further increased the convenience of this model, allowing to be used as a simple label tool while changing to other materials in the TMD family. Overall, DL-ADD demonstrated strong noise resistance, high accuracy, and good generality. This study lowered the threshold for using CNN models with the aforementioned properties and significantly improved the efficiency of the atomic defect detection process.

Code availability

All the code to produce the results of this paper is accessible at: https://github.com/MeatYuan/MOS2.We all use Python and jupyter notebook.

References

Balis, N., Stratakis, E. & Kymakis, E. Graphene and transition metal dichalcogenide nanosheets as charge transport layers for solution processed solar cells. Materials Today 19, 580–594 (2016).
Article CAS Google Scholar
Singh, E., Kim, K. S., Yeom, G. Y. & Nalwa, H. S. Two-dimensional transition metal dichalcogenide-based counter electrodes for dye-sensitized solar cells. RSC Adv. 7, 28234–28290 (2017).
Article ADS CAS Google Scholar
Wu, K., Ma, H., Gao, Y., Hu, W. & Yang, J. Highly-efficient heterojunction solar cells based on two-dimensional tellurene and transition metal dichalcogenides. Journal of Materials Chemistry A 7, 7430–7436 (2019).
Article CAS Google Scholar
Rawat, B., Vinaya, M. & Paily, R. Transition metal dichalcogenide-based field-effect transistors for analog/mixed-signal applications. IEEE Trans Electron Devices 66, 2424–2430 (2019).
Article ADS CAS Google Scholar
Liu, X. et al. High performance field-effect transistor based on multilayer tungsten disulfide. ACS nano 8, 10396–10402 (2014).
Article CAS PubMed Google Scholar
Guo, Y., Liu, D. & Robertson, J. Chalcogen vacancies in monolayer transition metal dichalcogenides and fermi level pinning at contacts. Applied Physics Letters 106, 173106 (2015).
Article ADS Google Scholar
McDonnell, S., Addou, R., Buie, C., Wallace, R. M. & Hinkle, C. L. Defect-dominated doping and contact resistance in MoS₂. ACS nano 8, 2880–2888 (2014).
Article CAS PubMed Google Scholar
Bampoulis, P. et al. Defect dominated charge transport and fermi level pinning in MoS₂/metal contacts. ACS Appl. Mater. Interfaces 9, 19278–19286 (2017).
Article CAS PubMed PubMed Central Google Scholar
Wu, Z. et al. Defects as a factor limiting carrier mobility in WSe₂: A spectroscopic investigation. Nano Research 9, 3622–3631 (2016).
Article CAS Google Scholar
Lin, Z. et al. Defect engineering of two-dimensional transition metal dichalcogenides. 2D Materials 3, 022002 (2016).
Article Google Scholar
Verhagen, T., Guerra, V. L., Haider, G., Kalbac, M. & Vejpravova, J. Towards the evaluation of defects in MoS₂ using cryogenic photoluminescence spectroscopy. Nanoscale 12, 3019–3028 (2020).
Article CAS PubMed Google Scholar
Mignuzzi, S. et al. Effect of disorder on raman scattering of single-layer mo s 2. Physical Review B 91, 195411 (2015).
Article ADS Google Scholar
Vancsó, P. et al. The intrinsic defect structure of exfoliated MoS₂ single layers revealed by scanning tunneling microscopy. Sci. Rep. 6, 1–7 (2016).
Article Google Scholar
Barja, S. et al. Identifying substitutional oxygen as a prolific point defect in monolayer transition metal dichalcogenides. Nature Commun. 10, 1–8 (2019).
Article ADS CAS Google Scholar
Tumino, F., Casari, C. S., Li Bassi, A. & Tosoni, S. Nature of point defects in single-layer MoS₂ supported on Au (111). J. Phys. Chem. C 124, 12424–12431 (2020).
Article CAS Google Scholar
Chen, F.-X. R. et al. Visualizing correlation between carrier mobility and defect density in mos2 fet. Applied Physics Letters 121, 151601 (2022).
Article ADS CAS Google Scholar
Yang, S.-H. et al. Deep learning-assisted quantification of atomic dopants and defects in 2D materials. Advanced Science 8, 2101099 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Lee, K. et al. Stem image analysis based on deep learning: Identification of vacancy defects and polymorphs of MoS₂. Nano Letters (2022).
Gali, S. M., Pershin, A., Lherbier, A., Charlier, J.-C. & Beljonne, D. Electronic and transport properties in defective MoS₂: impact of sulfur vacancies. J. Phys. Chem. C 124, 15076–15084 (2020).
Article CAS Google Scholar
Rashidi, M. & Wolkow, R. A. Autonomous scanning probe microscopy in situ tip conditioning through machine learning. ACS nano 12, 5185–5189 (2018).
Article CAS PubMed Google Scholar
Krull, A., Hirsch, P., Rother, C., Schiffrin, A. & Krull, C. Artificial-intelligence-driven scanning probe microscopy. Communications Physics 3, 1–8 (2020).
Article Google Scholar
Gordon, O. et al. Scanning tunneling state recognition with multi-class neural network ensembles. Rev. Sci. Instrum. 90, 103704 (2019).
Article ADS Google Scholar
Choudhary, K. et al. Computational scanning tunneling microscope image database. Scientific data 8, 1–9 (2021).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical image computing and computer-assisted intervention, 234–241 (Springer, 2015).
Schuler, B. et al. Electrically driven photon emission from individual atomic defects in monolayer WS₂. Sci. Adv. 6, eabb5988 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Schuler, B. et al. How substitutional point defects in two-dimensional WS₂ induce charge localization, spin–orbit splitting, and strain. ACS nano 13, 10520–10534 (2019).
Article CAS PubMed Google Scholar
Chen, F.-X. R. et al. TMD-FET defect autodetection at STM images. OSF https://doi.org/10.17605/OSF.IO/ZXGTJ (2022).
Jiang, P., Ergu, D., Liu, F., Cai, Y. & Ma, B. A review of yolo algorithm developments. Procedia Computer Science 199, 1066–1073 (2022).
Article Google Scholar

Download references

Acknowledgements

This work is jointly sponsored by the National Yang Ming Chiao Tung University, National Central University, Yuan Ze University, Ministry of Education (MOE), National Science and Technology Council (NSTC), and Center for the Semiconductor Technology Research from The Featured Areas Research Center Program under the projects “Higher Education Sprout”, NSTC 110-2222-E-008 -008 -MY3, NSTC 109-2634-F-009-029 and NSTC 110-2112-M-A49 -013 -MY3, respectively.

Author information

Authors and Affiliations

Department of Electrophysics, National Yang Ming Chiao Tung University, Hsinchu City, Taiwan
Fu-Xiang Rikudo Chen, Yong-Cheng Yang & Chun-Liang Lin
Department of Computer Science and Information Engineering, National Central University, Taoyuan City, Taiwan
Chia-Yu Lin
Department of Electrical and Computer Engineering, University of California, Davis, CA, USA
Hui-Ying Siao
Department of Computer Science and Engineering, Yuan Ze University, Taoyuan City, Taiwan
Cheng-Yuan Jian

Authors

Fu-Xiang Rikudo Chen
View author publications
Search author on:PubMed Google Scholar
Chia-Yu Lin
View author publications
Search author on:PubMed Google Scholar
Hui-Ying Siao
View author publications
Search author on:PubMed Google Scholar
Cheng-Yuan Jian
View author publications
Search author on:PubMed Google Scholar
Yong-Cheng Yang
View author publications
Search author on:PubMed Google Scholar
Chun-Liang Lin
View author publications
Search author on:PubMed Google Scholar

Contributions

Under the supervision of Chia-Yu Lin (C.-Y. Lin) and Chun-Liang Lin (C.-L. Lin), Fu-Xiang Rikudo Chen (F.-X. R. Chen) and Hui-Ying Siao (H.-Y. Siao) constructed the idea of DL-ADD framework. The STM data was collected by Yong-Cheng Yang (Y.-C. Yang) and F.-X. R. Chen. The DL-ADD is firstly developed by H.-Y. Siao and updated by Cheng-Yuan Jian (C.-Y. Jian). The U-net model was trained by H.-Y. Siao and C.-Y. Jian, and the YOLO model was trained by F.-X.R. Chen. The manuscript was written by C.-Y Lin, C.-L. Lin, F.-X.R. Chen and C.-Y. Jian.

Corresponding author

Correspondence to Chia-Yu Lin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, FX.R., Lin, CY., Siao, HY. et al. Deep learning based atomic defect detection framework for two-dimensional materials. Sci Data 10, 91 (2023). https://doi.org/10.1038/s41597-023-02004-6

Download citation

Received: 12 October 2022
Accepted: 06 February 2023
Published: 14 February 2023
Version of record: 14 February 2023
DOI: https://doi.org/10.1038/s41597-023-02004-6