Abstract
Deep learning methods have shown significant potential in tool wear lifecycle analysis. However, there are fewer open source datasets due to the high cost of data collection and equipment time investment. Existing datasets often fail to capture cutting force changes directly. This paper introduces QIT-CEMC, a comprehensive dataset for the full lifecycle of titanium (Ti6Al4V) tool wear. QIT-CEMC utilizes complex circumferential milling paths and employs a rotary dynamometer to directly measure cutting force and torque, alongside multidimensional data from initial wear to severe wear. The dataset consists of 68 different samples with approximately 5 million rows each, includes vibration, sound, cutting force and torque. Detailed wear pictures and measurement values are also provided. It is a valuable resource for time series prediction, anomaly detection, and tool wear studies. We believe QIT-CEMC will be a crucial resource for smart manufacturing research.
Similar content being viewed by others
Background & Summary
Metal milling is crucial for producing high-precision parts, but tool wear over time reduces machine accuracy1. An effective tool replacement strategy is essential for maintaining high-quality production and prolonging tool life. Monitoring tool wear is crucial for tool replacement decisions, coating improvements, and new material development. Researchers assess machining performance of new materials by analyzing wear, surface roughness, and cutting temperature2,3. Many researchers use cutting force signals4, cutting temperature5, spindle current power6and other indirect signals in the machining process to obtain the mapping relationship between these signals and tool wear to achieve good monitoring results. The emergence of artificial intelligence (AI) has opened up new opportunities for tool forecasting, and many works are beginning to utilize AI for intelligent forecasting7,8.
With the rapid advancement of big data and deep learning, datasets have become indispensable in studying the full lifecycle of tool wear9,10,11. These datasets provide rich samples and diverse working conditions, enabling models to predict wear under various scenarios accurately. They capture different wear stages and subtle changes, offering extensive training data to improve model robustness and accuracy. Furthermore, the accumulation of datasets facilitates in-depth research and comparison of different evaluation methods, driving advancements in tool wear detection technologies12,13.
Many datasets on tool wear have been proposed in recent years. Table 1 compares existing tool wear datasets with the proposed QIT-CEMC dataset. As shown, open-source datasets are limited and predominantly use linear cutting methods. Only the IEEE NUAA IDEAHOUSE14 and SDU-QIT15 datasets feature non-linear cutting. The PHM2010 dataset16 focuses on tool wear and remaining useful life prediction under constant experimental conditions, terminating at a wear value of 150 micrometers. The Milling Data Set17 involves multi-layer coated carbide tools with signals like acoustic emission, vibration, and current, primarily addressing single-edge wear. The IEEE NUAA IDEAHOUSE dataset14 captures vibration, spindle current, and power signals, suitable for single-edge wear analysis in square spiral cutting paths, but lacks force signals. Other datasets also face limitations, such as single cutting paths or limited signal types, and some are not open-source.
This paper introduces QIT-CEMC, a comprehensive dataset on the full lifecycle wear of coated end mills. The data was collected during the milling of Ti6Al4V, a challenging material commonly used in aerospace and medical industries18,19. Unlike traditional linear milling, this study employs a complex circumferential milling path, generating periodic signals that offer new insights into tool path and wear analysis. The dataset includes direct measurements of cutting force and torque using a rotary dynamometer. It also contains multidimensional signals, including three-channel force, torque, vibration, and sound. QIT-CEMC is fully open-source, providing a valuable resource for research into tool wear under varied conditions. Here are the advantages of the QIT-CEMC Dataset:
-
Diverse Signal Types: QIT-CEMC captures a wide range of signals, including forces (in x, y, and z directions), torque, sound, and vibrations (in x, y, and z directions). In addition, offline images of various types of tool wear are provided. This provides comprehensive data for tool wear analysis. In contrast, datasets like PHM201016 and MATW120 focus on fewer signal types, such as cutting forces, vibrations, and acoustic emissions.
-
Advanced Sensor: QIT-CEMC employs a variety of advanced sensors, including a rotary dynamometer, sound sensors, and accelerometers, offering precise measurements. Other datasets, such as IEEE NUAA-IDEAHOUSE16 and SDU-QIT15, typically rely on one or two sensors, often excluding sound or force sensors.
-
Varied Machining Paths: QIT-CEMC uses a circular machining path to simulate complex real-world conditions, facilitating research on tool wear under different paths. Most other datasets, like MATW120 and NUJST21, employ linear paths, missing out on the unique wear information circular paths provide.
-
Comprehensive Wear Analysis: QIT-CEMC not only focuses on main cutting edge wear but also includes end teeth and four tool regions wear, offering a more holistic analysis. This is in stark contrast to existing datasets, such as MATW120 and Ross et al.22, which generally emphasize single-edge wear or surface morphology.
Methods
Data acquisition
The QIT-CEMC dataset was generated using a four-flute coated carbide end mill to machine Ti6Al4V titanium alloy along a circular path23,24, as shown in Fig. 1. This alloy is widely used in the aerospace industry for its excellent properties but presents significant machining challenges. The workpiece was a titanium alloy rod with a diameter of 11.3 cm and a height of 400 mm, yielding a base area of 100 cm2. The end mill had a diameter of 10 mm, made from YG3X carbide (composed of 96.5% WC, 0.5% TaC/NbC, and 3% Co), with a TiAlN coating and a 55° helix angle.
The milling cutter used for the experiment has four edges, we measured the wear characterization on the back face of the four edges on the peripheral edge of the milling cutter including the wear value at the maximum wear point, the wear value at the 1/2ap point, and the area of the wear area; in addition, we measured the wear characterization on the back face of the four edges on the bottom edge including the maximum wear value and the area of the wear area. The milling tool used had a rake angle of 10, a flank angle of 15, and a corner radius of 0.8 mm. As shown in Fig. 2, the milling tool has a diameter of 10 mm with a tolerance of ±0.020 mm, an overall length of 75 mm, a cutting length of 25 mm, and helix angles of 38 and 41.
The process involves three main stages: equipment setup and calibration, signal acquisition, and tool wear measurement. As shown in Fig. 3, the experiment used a cylindrical titanium alloy workpiece. First, the rotary dynamometer was mounted on the spindle of the vertical machining center, with accelerometers and a microphone positioned to accurately capture vibration and audio signals. The tool and workpiece were then securely mounted on the worktable. During signal acquisition, the rotary dynamometer measured cutting forces, accelerometers captured vibration signals, and the microphone recorded sound signals. These signals were wirelessly transmitted to a PC, where they were recorded and displayed in real-time for preliminary analysis. Then the PC software processed the data. It recorded cutting forces in the x, y, and z directions, as well as torque. Vibration parameters, including accelerations in the x, y, and z directions, were also recorded. Additionally, the software captured sound signal characteristics. After machining, the tool was examined under a microscope to measure wear on the side and end teeth, with metrics such as maximum wear width (VBmax) and wear area (SVB) documented. The specific parameters of the conditions are shown in Table 2.
The tool wear measurement procedure is illustrated in Fig. 4. On the right side, the workflow for wear measurement is presented. First, the system, including the microscope, camera, and lighting, is calibrated. The tool is positioned on the platform, aligning the wear area with X- and Y-axis controllers. Lighting and focus adjustments are made for optimal image clarity. A CCD color camera captures high-resolution images of the wear area, which are then analyzed by software to quantify key wear metrics, such as maximum wear and wear area, allowing for accurate tool wear assessment.
For wear measurement, we used a 19JPC-V microscope with a resolution of 0.0005 mm (glass grating). The maximum inaccuracy is defined as \(1+\frac{L}{100},\mu {\rm{m}}\), where L represents the measurement length in mm, and the measurement range is 200 × 100 mm. To maintain accuracy, the environment was controlled at a temperature of 20 ± 1° C, with fluctuations limited to 0. 5° C per hour and a temperature difference of no more than 0. 5° C between the measured part and the instrument. We recorded maximum wear values for each of the four circumferential edges, wear values at mid-depth, and the wear zone area. Additionally, we measured maximum wear and wear zone area on the bottom edge and calculated average wear values across the four edges.
The QIT-CEMC dataset was gathered using a Kistler 9170B251 dynamometer and includes 68 milling samples, each generating approximately 5 million records. These records contain data on vibration, sound, cutting force, torque, and detailed tool wear measurements. The milling parameters used are listed in Table 3. Tool wear measurements are divided into online and offline methods. Online methods are efficient but may suffer from noise and vibration-related inaccuracies. By contrast, the QIT-CEMC dataset employs offline methods, where the machining process is halted to allow precise measurements with high-accuracy instruments. For cutting force measurements, the Kistler 9170B251 dynamometer was operated at a 10 kHz sampling rate to capture dynamic force variations accurately. Vibration data was recorded with an accelerometer near the tool holder, also sampled at 10 kHz, and acoustic emission (AE) signals were captured by a AE sensor mounted on the spindle housing. To ensure signal clarity, a low-pass filter at 1 kHz was applied to the force and vibration data to minimize high-frequency noise, while a high-pass filter at 50 kHz was used on AE data to isolate relevant emission events.
Data process
The raw data collection process includes invalid signals and unavoidable baseline wander25, necessitating signal processing. Figure 5 illustrates the distribution of these invalid signals, which primarily occur during the initial tool engagement and withdrawal phases. Additionally, due to the circular path used during cutting, invalid signals are also observed during path transitions. Figure 6 shows the data processing workflow. A binary Gaussian mixture model (GMM)26 is employed to cluster the signals based on energy frequency. As shown in Fig. 6 invalid signals of raw data mentioned above are eliminated.
To address the presence of baseline wandering problem, this work uses the moving average method for correction27. Figure 7 shows the data signal after the moving average method, and it can be seen that the baseline wander is well corrected.
Data Records
The QIT-CEMC dataset records four-dimensional data: vibration, sound, cutting force, and torque, along with measurements of end and side cutting edge wear. The vibration and sound data are collected using Donghua DH5922D sensors. These data are stored in .csv files. The cutting force and torque data are gathered with the Kistler 9170B251 force gauge. These data are saved in .txt files. The detailed content and structure of the QIT-CEMC dataset are shown in Fig. 8. The records for each file of the dataset are listed below:
-
Force and torque data folder: The collected data was renamed and saved in txt format. Each file contains five columns of data: the first column represents time, while the remaining four columns correspond to the X, Y, and Z direction cutting force and the torque.
-
Vibration and sound data folder: Each sample acquisition is recorded in a csv file containing five columns of data. The first column represents time, while the remaining five columns the first column represents time, while the remaining four columns correspond to the X, Y, and Z direction vibration and the sound. The data for the second sample was lost because the acquisition software disconnected.
-
Image data folder: The folder contains 68 subfolders, each documenting 8 images of tool wear from both the end and side teeth.
-
Tool Wear Health indicators: The CSV file comprises 12 initial columns, each representing the maximum flank wear width (VBmax), wear width at half the depth of cut (VB in 1/2 ap) and wear area (S), for each of the tool’s four cutting edges. The subsequent columns contain the maximum wear width (VBmax) and wear area (S) for the end teeth of these cutting edges.
Technical Validation
To verify the data’s usability, we analyzed the distribution of the collected data and the tool wear trends. Figure 9 displays data distribution histograms and KDE distributions. Each distribution is generally symmetrical, indicating a near-normal shape without significant skewness or extreme outliers. This pattern suggests that the data is consistent and free from major anomalies, affirming the dataset’s reliability and usability for further analysis. The absence of irregularities or unexpected patterns in the distributions confirms that the data acquisition process was conducted properly. This validation supports the dataset’s suitability for subsequent computational modeling and ensures its readiness for high-precision applications.
Figure 10 illustrates the tool wear trend in the dataset. The wear rate is slow during the initial stage, remains relatively stable in the middle stage, and accelerates sharply in the final stage. This pattern aligns with real-world observations and matches the wear curves from datasets in reference16 and reference20, confirming the dataset’s reliability.
Usage Notes
This dataset facilitates a comprehensive, data-driven study on the full life cycle of tool wear. The raw dataset is available for download at https://doi.org/10.6084/m9.figshare.2732334628. Additionally, we provide a Python script, data processing.py, which handles the processing of raw data, including the removal of invalid data and baseline wander correction. The tool wear.csv file contains wear measurements for the four tool edges—side and end edges, respectively. All data and script are accessible in the repository.
Code availability
All Python code scripts for data processing are publicly accessible in figshare28.
References
Li, Y., Meng, N., Zhuang, H., Li, X. & Ye, W. Physics-informed meta learning for machining tool wear prediction. J. Manuf. Syst. 62, 17–27, https://doi.org/10.1016/j.jmsy.2021.10.013 (2022).
Usca, Ü. A. et al. Tool wear, surface roughness, cutting temperature and chips morphology evaluation of Al/TiN coated carbide cutting tools in milling of Cu-B-CrC based ceramic matrix composites. Journal of Materials Research and Technology 12, 1243–1259, https://doi.org/10.1016/j.jmrt.2021.12.063 (2022).
Korkmaz, M. E. et al. Indirect monitoring of machining characteristics via advanced sensor systems: a critical review. International Journal of Advanced Manufacturing Technology 120, 7043–7078, https://doi.org/10.1007/s00170-022-09286-x (2022).
Zhang, Y. et al. Tool wear condition monitoring method based on deep learning with force signals. Sensors 23, 4595, https://doi.org/10.3390/s23104595 (2023).
Liu, D. et al. Tool wear monitoring through online measured cutting force and cutting temperature during face milling Inconel 718. The International Journal of Advanced Manufacturing Technology 122, 729–740, https://doi.org/10.1007/s00170-022-09950-2 (2022).
Kundu, P. et al. A novel current sensor indicator enabled WAFTR model for tool wear prediction under variable operating conditions. Journal of Manufacturing Processes 82, 777–791, https://doi.org/10.1016/j.jmapro.2022.08.036 (2022).
Pimenov, D. Y. et al. Artificial intelligence systems for tool condition monitoring in machining: analysis and critical review. Journal of Intelligent Manufacturing 34, 2079–2121, https://doi.org/10.1007/s10845-022-01923-2 (2023).
Tran, M.-Q. et al. Machine learning and IoT-based approach for tool condition monitoring: A review and future prospects. Measurement 207, 112351, https://doi.org/10.1016/j.measurement.2022.112351 (2023).
Zhu, K., Meng, N., Zhang, H., Sun, X. & Ye, W. Physics-informed hidden Markov model for tool wear monitoring. J. Manuf. Syst. 72, 308–322, https://doi.org/10.1016/j.jmsy.2023.11.003 (2024).
Li, H. et al. A review on physics-informed data-driven remaining useful life prediction: Challenges and opportunities. Mech. Syst. Signal Process. 209, 111120, https://doi.org/10.1016/j.ymssp.2024.111120 (2024).
Yang, J. et al. Tool wear prediction based on parallel dual-channel adaptive feature fusion. Int. J. Adv. Manuf. Technol. 128, 145–165, https://doi.org/10.1007/s00170-023-11832-0 (2023).
Huang, Z. et al. Tool wear prediction based on multi-information fusion and genetic algorithm-optimized Gaussian process regression in milling. IEEE Trans. Instrum. Meas. 72, 1–16, https://doi.org/10.1109/TIM.2023.3280531 (2023).
He, J. et al. Cross-domain adaptation network based on attention mechanism for tool wear prediction. J. Intell. Manuf. 34, 3365–3387, https://doi.org/10.1007/s10845-022-02005-z (2023).
Liu, C., Li, Y., Li, J. & Hua, J. A Meta-Invariant Feature Space Method for Accurate Tool Wear Prediction Under Cross Conditions. IEEE Trans. Ind. Inform. 18, 922–931, https://doi.org/10.1109/TII.2021.3070109 (2022).
Xin, M. et al. SDU-QIT End Mill Tool Wear Experimental Dataset. J. Mech. Eng. 58, 166–171 (2022).
Li, X. et al. Fuzzy neural network modelling for tool wear estimation in dry milling operation. Ann. Conf. PHM Soc. 1, 1 (2009).
Agogino, A. & Goebel, K. Milling Data Set. BEST Lab, UC Berkeley, NASA Prognostics Data Repository, NASA Ames Research Center, Moffett Field, CA, (2007).
Peta, K. et al. Scale-dependent wetting behavior of bioinspired lubricants on electrical discharge machined Ti6Al4V surfaces. Tribol. Int. 194, 109562, https://doi.org/10.1016/j.triboint.2024.109562 (2024).
Buhairi, M. A. et al. Review on volumetric energy density: influence on morphology and mechanical properties of Ti6Al4V manufactured via laser powder bed fusion. Prog. Addit. Manuf. 8, 265–283, https://doi.org/10.1007/s40964-022-00328-0 (2023).
De Pauw, L., Jacobs, T. & Goedemé, T. MATWI: A Multimodal Automatic Tool Wear Inspection Dataset and Baseline Algorithms. In Computer Vision Systems, Christensen, H. I., Corke, P., Detry, R., Weibel, J.-B., & Vincze, M. (Eds.), 14253, 255–269, https://doi.org/10.1007/978-3-031-44137-0_22 (2023).
Pan, Y. et al. NJUST-CCTD: An Image Database for Milling Tool Wear Classification with Deep Learning. Int. J. Adv. Manuf. Technol. 127, 3681–3698, https://doi.org/10.1007/s00170-023-11418-w (2023).
Ross, N. S. et al. A Novel Approach of Tool Condition Monitoring in Sustainable Machining of Ni Alloy with Transfer Learning Models. J. Intell. Manuf. 35, 757–775, https://doi.org/10.1007/s10845-023-02074-8 (2024).
Bakar, H. N. A. et al. Wear mechanisms of solid carbide cutting tools in dry and cryogenic machining of AISI H13 steel with varying cutting-edge radius. Wear 523, 204758, https://doi.org/10.1016/j.wear.2023.204758 (2023).
Inácio, R. H. et al. Suppressing notch wear by changing the tool path in the side milling of a Ti-6Al-4V alloy. Int. J. Adv. Manuf. Technol. 125, 453–463, https://doi.org/10.1007/s00170-022-10741-y (2023).
Li, H. et al. Descod-ecg: Deep score-based diffusion model for ecg baseline wander and noise removal. IEEE J. Biomed. Health Inform., https://doi.org/10.1109/JBHI.2023.3237712 (2023).
Reynolds, D. A. Gaussian mixture models. Encyclopedia of Biometrics 741, 659–663 (2009).
Luzia, R., Rubio, L. & Velasquez, C. E. Sensitivity analysis for forecasting Brazilian electricity demand using artificial neural networks and hybrid models based on Autoregressive Integrated Moving Average. Energy 274, 127365, https://doi.org/10.1016/j.energy.2023.127365 (2023).
Li, N. et al. A multi-feature dataset of coated end milling cutter tool wear whole life cycle. figshare https://doi.org/10.6084/m9.figshare.27323346 (2024).
Truchan, H., Naumov, E., Abedin, R., Palmer, G. & Ahmadi, Z. Multimodal Isotropic Neural Architecture with Patch Embedding. In Neural Information Processing, Luo, B., Cheng, L., Wu, Z.-G., Li, H., & Li, C. (Eds.), 14447, 173–187, https://doi.org/10.1007/978-981-99-8079-6_14 (2024).
You, Z. et al. A Novel Evaluation Metric Based on Dispersion of Wear Distance for In Situ Tool Condition Monitoring. IEEE Trans. Instrum. Meas. 72, 1–10, https://doi.org/10.1109/TIM.2022.3224995 (2023).
Peng, Y. et al. A tool wear condition monitoring method for non-specific sensing signals. Int. J. Mech. Sci. 263, 108769, https://doi.org/10.1016/j.ijmecsci.2023.108769 (2024).
Acknowledgements
This work was founded by the Shandong Provincial Key Research and Development Program (2024CXPT011), the National Natural Science Foundation of China (62176050), the National Key Research and Development Program of China (2024YFB3312302), the Qilu Institute of Technology for Talent Project (QIT24TP027), the Regional Joint Fund for Basic and Applied Basic Research Fund of Guangdong Province Key Project (2021B1515120066), and the Shandong Provincial Natural Science Foundation (ZR2023MF085).
Author information
Authors and Affiliations
Contributions
Na Li, Xiao Wang, Wanzhen Wang and Miaomiao Xin contributed to data collection and analyze. Wanzhen Wang wrote the code for data processing. Xiao Wang and Mingqiang Zhang reviewed the statements of data records. Xiao Wang and Dongfeng Yuan analyzed the results and designed the experiments. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Li, N., Wang, X., Wang, W. et al. A multi-feature dataset of coated end milling cutter tool wear whole life cycle. Sci Data 12, 16 (2025). https://doi.org/10.1038/s41597-024-04345-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41597-024-04345-2