Abstract
This paper describes a long-term cycling dataset of repurposed lithium-ion batteries originally used in electric vehicles. After their initial automotive use – referred to as their “first life” – these batteries were redeployed for stationary energy storage applications, representing their “second life”. The dataset covers six distinct use cases modelled to represent real-world energy storage applications. Unlike other published datasets, which focus on new or first-life cells, this work exclusively features second-life batteries with no available data from prior usage. It is aimed at supporting research into battery degradation, state of health prediction, and performance benchmarking under aged conditions. Data was collected using a Chroma 17020 batter cycler and originally stored as large, continuous log files. This paper outlines the steps taken to reorganise and clean the data – handling missing values and segmenting it into individual cycles – while preserving its raw experimental content. Cycling occurred intermittently between 2019 and 2025, offering a uniquely long observation period. The dataset supports both academic and industrial research into battery ageing and second-life applications.
Similar content being viewed by others
Background & Summary
Lithium-ion batteries are widely used across various applications, including consumer electronics, electric vehicles (EVs), domestic energy storage systems (ESS) and grid-scale ESS’s1. As the adoption of EVs has grown, this has in turn resulted in an increasing number of EV batteries reaching the end of their first life within vehicles2,3,4. However, these batteries are typically retired when they have around 70–80% of their original capacity remaining, making them unsuitable for automotive use, but still viable for less demanding second-life applications. Instead of being immediately recycled, they can be repurposed for uses such as grid support, renewable energy storage, and backup power solutions, extending their overall usability and reducing waste5,6,7.
There is extensive research on tracking degradation and capacity fade in new, first-life lithium-ion batteries. These studies are mainly conducted under controlled laboratory conditions, using consistent charge-discharge patterns, and full depth of discharge (DOD) cycles. Publicly available datasets also focus on small-capacity cells (≤2 Ah), which are cheaper to test in bulk, but do not reflect real-world operational conditions. These datasets commonly assume a battery starts at full capacity and degrades predictably to an end-of-life (EOL) threshold of 80% capacity8,9. While this has enabled the development of various state of health (SOH) estimation models, they are largely tailored to idealised scenarios involving new batteries.
In contrast, second-life batteries (SLBs), which are repurposed after their initial use in electric vehicles, often enter service with an already reduced and unknown capacity. Their degradation trajectories are shaped by their first-life usage patterns, cycling histories, and varying DOD levels, which are often undocumented. This makes it difficult to apply traditional SOH estimation methods directly. Although SLBs may still retain 70–80% of their original capacity and are viable for stationary applications such as grid support, backup power, and renewable energy storage, there are very few publicly available long-term cycling datasets that reflect the potential of second-life usage.
A key limitation in SLB research is the lack of standardised and diverse datasets. In practice, SLBs are sourced from a wide range of EV manufacturers, and come with varying chemistries, capacities, size, and shapes. To address this, there is a growing need for datasets that reflect the full range of SLB characteristics and usage histories. By expanding the diversity and scale of available data, more robust and generalisable models for SOH estimation and degradation can be developed. Table 1 below provides an overview of some publicly available datasets with details of the number of cells, the nominal capacity, cell chemistry, test conditions, and the lowest recorded state of health (SOH):
The primary aim of this paper is to provide researchers with access to detailed, long-term cycling data from SLBs operating in realistic application scenarios. Given the limited availability of such datasets, particularly for batteries beyond their first life, this work supports efforts in SOH estimation, degradation analysis, and the evaluation of SLB performance across varied operating conditions. The dataset is particularly suited for developing and validating predictive models and guiding operational strategies for repurposed energy storage.
This dataset delivers on the need to address gaps in test data for larger cells and second life application by providing long-term cycling data for SLBs across six different use cases. Each use case represents a unique operational scenario, incorporating variations in charge-discharge profiles, rest periods, and cycling intensities. Six SLBs were tested, each corresponding to a specific use case, allowing for a broad range of experimental conditions. Capacity tests were performed periodically, with full capacity values recorded at the beginning and end of each use case. The dataset was recorded at a sampling rate of approximately every 2.8 seconds, offering high-resolution insights into performance trends and degradation over time.
This dataset has also been utilized in other published studies. The first study set the foundation for this dataset by developing the experimental framework for testing these SLBs10 – “Sweat Testing Cycles of Batteries for Different Electrical Power Applications”. This study explored six different use cases for a domestically located battery system and used MATLAB to generate charge and discharge profile that represented real-world operating conditions over a year-long period. Statistical analysis methods, including the Haar transform and a pragmatic approach, were applied to condense these waveforms into programmable steps, forming the basis for the cycling protocols used in this dataset10.
Further analysis using this dataset was conducted in the following research paper11 - “Second-Life Battery Capacity Estimation and Method Comparison”. This research investigated the effectiveness of different heuristic state-of-health (SOH) estimation methods originally designed for first-life capacity estimations. The study evaluated various algorithms to assess their suitability for forecasting battery capacity fade in second-life applications. The findings highlighted that a modified Gaussian regression technique could provide a practical and reliable approach for estimating battery capacity in SLBs.
Finally, this dataset was also used in the development of a standardized methodology for generating incremental capacity curves12 – “A Review of Methods of Generating Incremental Capacity-Differential Voltage Curves for Battery Health Determination”. The proposed method was validated using three datasets: two publicly available datasets13,14, and one of the use cases from this dataset. By establishing a consistent approach to deriving incremental capacity curves, this study aimed to improve the reliability and repeatability of SOH assessments for Li-ion batteries. These prior studies demonstrate the broad application of this dataset and its potential to support further research into second-life battery performance especially in the field of AI and machine learning approaches to degradation.
Sweat Testing Methodology
The battery cycling tests were conducted using a Chroma 17020 battery cycler, which served as the central platform for programmable charge-discharge operations. The raw data were logged by the Chroma system and saved as.DB files, which were later extracted and converted into.csv format. Each.csv file corresponds to a single cycle and contains key parameters, including voltage, current, charge/discharge duration, and temperature.
Figure 1 shows the Chroma 17020 system, comprising multiple test channels capable of parallel cycling. Each channel is controlled via a central PC, allowing real-time acquisition and management of test parameters. The cleaning and processing steps for the data extraction is further explained afterwards.
Experimental setup for the sweat test cycling using a Chroma 17020 battery cycler.
Each test channel was configured with an individual battery module houses within a custom-designed mechanical rig. The rig securely clamps the modules in place, ensuring a stable electrical connection throughout the testing period. Figure 2 shows a closer view of this fixture, where current and voltage leads were directly attached to the positive and negative terminals of each module.
Chroma bespoke test rig for loading batteries.
These electrical connections were interfaced with the power condition units (for executing the charging-discharging routines) and the data acquisition system (for recording voltage, current, and auxiliary measurements), both integral to the Chroma tester. A central PC Current running the Chroma software controlled the test profiles, logged the data, and managed channel-level communication. Figure 1 also highlights this interface, including the screen showing active cycling parameters.
Figure 3 presents a schematic representation of a single test channel. This includes voltage and current sensing wires connected directly to the battery terminals, as well as thermocouples paced on the side of each module to capture surface temperature data. The sensors and power conditioning circuity are routed back to the Chroma’s internal monitoring system and then to the central control PC for logging and feedback control. Each channel in the multi-channel setup follows the same configuration.
Illustration of a singular channel of the Chroma system for testing the SLBs.
The batteries tested were retired lithium-ion modules previously used in an EV. Each module consists of two pouch cells in series and two in parallel (2S2P configuration). As the first-life history of these modules is not available, all data presented in this work was collected from the beginning of their second-life usage.
Key specifications of the modules are as follows:
-
Cell chemistry – Lithium Manganese Oxide (LMO) and Lithium Nickel Oxide (LNO)
-
Nominal capacity (new) – 66 Ah
-
Minimum voltage – 5 V per module
-
Nominal voltage – 7.5 V per module
-
Maximum voltage – 8.3 V per module
-
Dimensions: 303 mm (Length) × 223 mm (Width) × 35 mm (Height)
-
Weight: 3.8 kg
Figs. 4 and 5 display images of the SLB modules tested in this study, with Fig. 5 specifically highlighting the positive and negative terminals of a singular module.
Plan view image of the second-life lithium-ion modules being tested.
Plan view of the terminals of the second-life lithium-ion modules being tested.
The term “sweat testing” refers to a method for replicating battery usage profiles over extended periods, enabling accelerated evaluation of battery behaviour under practical operating conditions. In this study, testing was conducted in a factory environment rather than a temperature-controlled laboratory, providing a closer approximation to real-world use. As a result, ambient temperature was not strictly regulated, and the SLBs were exposed to natural room temperature fluctuations across seasons. These ambient temperatures were recorded via the thermocouples attached to each module, and the corresponding temperature data is included in the dataset.
Rather than applying simple, uniform cycling protocols, sweat testing involves encoding diverse, real-world energy use profiles into programmable charge-discharge sequences. This allows for the simulation of operational stresses from applications such as residential storage, grid-services, and market trading, which helps capture the long-term effects of irregular cycling and varying load intensities under ambient environmental conditions.
The dataset includes six distinct use cases, each designed to simulate different real-world applications. More details of each use can be found in the previous research paper10.
-
1.
PV: A house with four people and a photovoltaic (PV) panel using the battery module to absorb excess energy when the PV panel generates more power than is consumed in the house. The battery module then releases this stored energy when needed.
-
2.
PV-TOU: A house with four people and PV panels operating a time-of-use (TOU) tariff, where energy is stored and discharged based on tariff fluctuations.
-
3.
TOU: A house with four people but no PV, where the battery module charges during low-tariff periods and discharges during peak tariff times to reduce electricity costs.
-
4.
FFR: The battery participates in an aggregated firm frequency response (FFR) system, charging and discharging dynamically in response to grid frequency fluctuations.
-
5.
EFR: Similar to FFR but with more rapid cycling, optimizing the battery module for performance on the enhance frequency response (EFR) market.
-
6.
Day Ahead: The battery module is used as part of an aggregated system to trade energy in the day-ahead electricity market, charging when prices are low and discharging when prices increase.
The first three use cases follow a quarterly cycle, with each quarter having a different number of charge-discharge steps, whilst the last three use cases following a year-long cycling profile. The charge-discharge sequences consist of constant current (CC) discharges, constant-current constant-voltage (CC-CV) charges, and intermittent rest periods. Each ‘step’ in the testing profile equates to a sequence of charge, discharge, or a rest period. A brief overview of each testing profile can be found in Table 2 below, whilst more details of each step can be found in the supplementary information.
The sweat testing methodology was designed to generate representative charge-discharge waveforms for each use case based on their real-world cycling behaviour as outlined in prior work10. The development of these testing waveforms, including the statistical method used to encode them, was originally presented in the previous work10 and is summarised here to support understanding of the testing procedures used to generate this dataset.
The waveforms were extracted from MATLAB-generated charge-discharge profiles and refined to eliminate inactive periods, ensuring that the resulting dataset only reflected active battery operation.
To convert these realistic-use case waveforms into programmable test sequences, two transformation techniques were employed. The Haar wavelet transform allowed for efficient signal compression and the regeneration of transient cycle behaviour, while a statistical averaging method provided an alternative approach for battery tested that do not support high-speed transitions. Both techniques preserved the variability and stress characteristics of real-world use cases, enabling robust evaluation of SLB degradation under different operational scenarios.
The two methods are summarised as follows:
-
1.
Haar wavelet transformation
The Haar transform was used decompose the charge-discharge waveforms into a sequence of square-shaped functions. This is similar to a Fourier transform but is more optimised for capturing discrete transitions in cycling behaviour. This method is suitable for battery tested capable of high-speed cycling transitions. The resulting Haar coefficients used to extract key cycling components, which were then recombined to reconstruct a simplified but representative charge-discharge pattern that closely matched the original waveform.
-
2.
Statistical averaging
As not all battery testing equipment can support high-frequency transitions, an alternative approach was also developed. This method involved statistically analysing the total time the battery operated at different charge and discharge rates over a given period. These distributions were plotted as histograms, which were then averaged to generate a representative daily cycle for that month. This approach allowed for an approximate but practical reconstruction of the charge-discharge behaviour whilst maintaining consistency with the original cycling conditions.
In all six use cases, the first cycle was programmed to begin with a capacity test designed to establish an initial reference for battery performance. This test typically involves charging the battery module to an upper voltage threshold, followed by a controlled discharge, allowing for capacity measurement. Figure 6 illustrates this behaviour by plotting the Power against Time profiles for all six use cases for cycle 1.
Power against Time plots for all six use cases.
Additionally, Figs. 7 to 12 present the current-time profiles for each application based on their respective complete sweat test recipes. For PV, PV-TOU, and TOU, the figures show the entire January to March testing recipes, while for Day Ahead, FFR, and EFR, the full test recipes are shown in their entirety. Each figure consists of three panels: the top panel visualises the full current profile, whilst the bottom left and right panels highlight the first and final looping segments, respectively. These plots are designed to reflect realistic daily cycling behaviour for each application, enabling their direct use in long-term degradation testing. These can then be applied in long-term testing scenarios to evaluate battery performance and degradation across second-life use cases.
PV use case – Current versus time for January to March testing.
PV-TOU use case – Current versus time for January to March testing.
TOU use case – Current versus time for January to March testing.
EFR use case – Current versus time for the full test recipe.
FFR use case – Current versus time for the full test recipe.
Day ahead use case – Current versus time for the full test recipe.
Figure 7 corresponds to use case 1 (PV with FIT maximisation), where excess solar energy is stored and used later in the day. This dataset is valuable for modelling solar self-consumption in domestic settings. Figure 8 shows use case 2 (PV with FIT and time-of-use tariff), which demonstrates how batteries can be used to optimise both renewable generation and electricity pricing, providing insight into dual-incentive optimisation strategies. Figure 9 covers used case 3 (TOU tariff only, no PV), capturing charge-discharge cycles based solely on tariff periods. This can be useful for analysing battery performance in non-solar, tariff-driven applications.
Figure 10 illustrates use case 4 (FFR), showing short one-minute cycling patterns (in line with the National Grid data) aligned with grid frequency events, which can be used for understanding degradation under high-frequency response services. Figure 11 presents use case 5 (EFR), where dynamic frequency response leads to repeated deep discharges as the system maintains a nominal state of charge. Finally, Fig. 12 depicts use case 6 (Day Ahead market), which models energy trading based on price signals, supporting the analysis of SLB suitability for energy arbitrage in market-based applications10.
Taken together, these figures highlight the diversity of cycling behaviours represented in the dataset. The PV and TOU-based use cases (Figs. 7–9) show structured daily profiles with moderate depth-of-discharge variations, reflecting energy use in residential settings. In contrast, the grid response use cases (Figs. 10–11) exhibit high-frequency cycling with minimal rest periods, which may impose greater mechanical and thermal stress on the cells. The Day Ahead profile (Fig. 12) displays more irregular cycling, driven by market volatility, with both shallow and deep discharges occurring intermittently. This range of cycling patterns demonstrates the dataset’s utility for exploring how SLBs perform under varied operational stressors.
Data Extraction and Processing
To illustrate the complete processing workflow, Fig. 13 below provides a visual overview of full data extraction and processing workflow. The diagram summarises the end-to-end process, from acquiring raw.DB files through to saving the processed data and generating metadata. This flowchart serves as a high-level map, helping reader follow the structure of the following sections, which describe each stage in greater detail.
Overview of the data extraction and processing workflow.
The raw data were downloaded from the PC connected to the Chroma battery tester as.DB files and were converted into.txt format as some files exceeded the.csv size limit. Converting these files into.txt format allowed the dataset to then be processed and cleaned. The data processing and cleaning were performed using Python (version 3.12.4) within Jupyter Notebook (version 7.0.8). The primary aim of this processing was to restructure the continuous raw data into smaller, individual cycles, making it easier to interpret, analyse, and apply to different tasks such as performance evaluation, degradation analysis and other tasks. Although the data remains unfiltered, and retains its raw measurement values, key features were also further extracted from existing features to organise the data into a more usable format. The key processing steps included:
Segmentation of Testing Profile
The ‘STEP_NO’ column was used to track the sequence of charge-discharge steps over time. Some steps exhibited fluctuations between alternating short charge and discharge periods, requiring separation of cycles that consisted of repeated charge-discharge pulses from those representing longer, continuous test sequences. The STEP_NO and CYCLE_NO columns were analysed to categorize and extract structured charge-discharge periods. To define when a complete cycle ended, segmentation was performed based on the first detected minimum voltage point. This ensured that each recorded cycle captured both the initial charge and discharge behaviour, as well as the stabilization phase following voltage drops.
The dataset contained multiple experimental parameters stored within the EX_PARAMS_NAME_RECORD and EX_PARAMS_VALUE_RECORD columns. These parameters were extracted, standardized, and renamed for consistency. Some parameters included maximum and minimum voltage, average voltage, voltage difference, and ambient temperature. This transformation enabled a structured and uniform representation of additional test conditions.
Data Cleaning
Empty columns, columns with NaN values, and specific unwanted columns were removed to streamline the dataset. The following columns were excluded from the dataset as they provided no meaningful insights for analysis and were largely empty across all files: Removing these columns had no impact on the integrity or usefulness of the dataset.
-
‘DETAILCORRECTIONTIME’, ‘EXTCORRECTIONTIME’, ‘CORRECTIONFFSET’, ‘EXTREALTIME’
-
‘RECOVERY_CYCLE1_NO’, ‘EX_PARAMS_NAME’, ‘EX_PARAMS_VALUE’
-
‘REMARK’, ‘DESCRIPTION’, ‘CYCLE3_NO’, ‘CYCLE4_NO’, ‘CYCLE5_NO’
-
‘RECOVERY_CYCLE2_NO’, ‘RECOVERY_CYCLE3_NO’, ‘RECOVERY_CYCLE4_NO’, ‘RECOVERY_CYCLE5_NO’
Final Cycle Processing
The cleaned and segmented cycles were saved as individual.csv files. Each file included time-series data for the parameters defined in below. A feature column labelled ‘COMBINED’ was also added to indicate whether a given cycle had been merged from multiple datasets due to interruptions or restarts during long-term cycling tests.
Defined Parameters
The dataset includes a wide range of parameters, and Tables 3 to 5 below defines and describes all of the parameters available in the dataset. Some parameters, such as maximum voltage, minimum voltage, average voltage, voltage difference, max position, and min position, are not always present in every dataset. These parameters are extracted from the EX_PARAMS_NAME_RECORD and EX_PARAMS_VALUE_RECORD fields, but their availability depends on whether the experimental data was recorded during that specific test cycle. If these fields are missing or incomplete, the corresponding parameters cannot be extracted.
Throughout the cycling, there were some cases where the test cycles were restarted due to undervoltage protection being activated, potentially introducing additional aging effects not explicitly visible in the dataset. Testing was also paused on weekends and long holidays. To address the resulting data gaps and maintain a continuous representation of cycling, some datasets were manually merged. A feature labelled “COMBINED” (yes/no) has been added to indicate whether multiple files were combined in this way. However, it is important to note that while this merging helps to fix the discontinuities, it assumes that no significant changes occurred during these pauses, and there may still be residual inconsistencies.
The ‘Source’ column in Tables 3 to 5 indicates whether a parameter is directly from the raw Chroma output (raw), calculated or extracted from the existing raw fields (Derived), or newly created during the processing phase (Generated) to support further analysis or feature engineering.
Following the extraction and processing of the raw .DB files, the dataset sizes were compared before and after the cleaning and transformation steps. The initial sizes represent the raw extracted and converted .CSV files, while the processed sizes reflect changes due to the removal of irrelevant columns and the generation of additional cycle-level features. In some cases, the data sizes decreased due to the removal of empty or redundant fields from the raw Chroma output. However, in other cases, such as PV-TOU, TOU, and FFR, the dataset size increased slightly after processing. This increasing is primarily due to the successful extraction of additional parameters from the ‘EX_PARAMS_NAME_RECORD’ and ‘EX_PARAMS_VALUE_RECORD’ fields. When present, these fields contain embedded experimental metadata, which was parsed and used to generate new columns (e.g., MAX VOLTAGE, MIN VOLTAGE, AVG VOLTAGE, etc.). Since the presence of this metadata varies across test cycles and use cases, the resulting file size changes depend on the availability of these values during each test cycle. Table 6 below summarises the change in file size for each use case in gigabytes (GB), along with the corresponding percentage change.
Data Records
All data files associated with this study have been made publicly available through the Loughborough University Research Repository (LUPIN)15: https://doi.org/10.17028/rd.lboro.28732490.v2.
The dataset is provided as a compressed. zip archive, which contains separate folders for each of the six use cases: PV, PV-TOU, TOU, EFR, FFR, and Day Ahead. Each folder contains pre-processed cycle-level data in. csv or. txt format.
For the PV, PV-TOU, and TOU use cases, data is further organised by quarter. Within each use case folder, subfolders are labelled according to the actual test dates, using a “YYYY_Month_Month” format (e.g., 2022_April_June) to reflect the period during which testing was conducted. This allows users to easily identify and access data for specific time ranges within a given use case. The EFR, FFR, and Day Ahead use cases follow a year-long cycle format and are stored as single folders per use case, without further subdivision.
In addition to the main datasets a supplementary information folder is included, containing detailed test recipes used in the Chroma 17020 battery cycle for the six use cases. These documents provide cycle-by-cycle test descriptions for users who wish to reference or replicate specific testing protocols.
Two additional .txt files are included at the root level:
-
Changelog.txt – summarises dataset updates and file additions, including date stamps and relevant notes.
-
Requirements.txt – lists the python package dependencies used in the data extraction and processing scripts.
Technical Validation
The dataset has been validated by comparing reconstructed waveforms with original charge/discharge profiles, performing standard error checks to identify inconsistencies, preprocessing to align time-series measurements and remove anomalies, and conducting internal consistency checks to ensure accurate representation of battery behaviour.
Known incidents during testing
Several factors impacted the consistency of the dataset. This included a change in testing location of the batteries and subsequent storage over the period of the move. Test restrictions during COVID and the requirement to check calibration of the sensors approximately mid-way through the testing. Capacity tests were attempted at regular intervals, but these did not always register correctly as the CC-CV charge that precedes a full discharge did not always complete the CV part due to issues with midpoint protection11. As a result, only the start and end capacity values are reported here as these were reliably obtained. Additionally, one of the PV tests was conducted under an incorrect testing profile for an unspecified duration. This discrepancy was not actively tracked at the time and may have introduced variability or inconsistencies in the recorded charge-discharge behaviour for the particular use case.
The initial start and end capacity values for each use case are provided in Table 7 below.
Usage Notes
This dataset is intended for research applications in battery state of health prediction, aging analysis, performance benchmarking, and energy market research. It supports the development of machine learning models for SOH estimation, enables detailed tracking of degradation trends over time, provides a reference for comparing battery degradation across different cycling conditions, and is useful for evaluating the economic feasibility of second-life batteries in various applications. The dataset does not include sophisticated demand-side management or electric vehicle fast charging applications, as these were not market-ready at the time of testing. Microgrid functionality and large-scale peak load shaving were also not included due to the absence of relevant market mechanisms.
Code availability
Example Jupyter Notebook scripts for converting the raw.DB data into.csv or.txt formats are freely available in the online repository at https://doi.org/10.17028/rd.lboro.28732490.v215. These scripts serve as the starting point for data extraction and enable users to work with files that exceed.csv size limits. The repository also includes processing scripts tailored to each use case – separate scripts are provided to segment the PV-TOU, TOU, EFR and Day Ahead data into individual cycles, the PV dataset into smaller cycles, and the FFR dataset into one-minute response cycles. An additional script is provided for feature engineering, which derives and generates new columns to support further analysis. The required and optional Python libraries are listed in the ‘requirements.txt’ file in the online repository.
References
Grey, C. P. & Hall, D. S. Prospects for lithium-ion batteries and beyond - a 2030 vision. Nature Communications 11, 6279, https://doi.org/10.1038/s41467-020-19991-4 (2020).
Beatty, M., Strickland, D. & Ferreira, P. Estimating the future numbers of EV&HV battery cells available for second life repurposing. in 11th International Conference on Power Electronics, Machines and Drives (PEMD 2022) https://doi.org/10.1049/icp.2022.1020 (IET, Newcastle, 2022).
Dong, Q. et al. Cost, energy, and carbon footprint benefits of second-life electric vehicle battery use. iScience 26, 107195 (2023).
Tankou, A., Bieker, G. & Hall, D. Scaling up Reuse and Recycling of Electric Vehicle Batteries: Assessing Challenges and Policy Approaches. https://theicct.org/publication/recycling-electric-vehicle-batteries-feb-23/ (2023).
Harper, G. et al. Recycling lithium-ion batteries from electric vehicles. Nature 575, 75–86, https://doi.org/10.1038/s41586-019-1682-5 (2019).
Aguilar Lopez, F., Lauinger, D., Vuille, F. & Müller, D. B. On the potential of vehicle-to-grid and second-life batteries to provide energy and material security. Nat Commun 15 (2024).
Montes, T. et al. Procedure for assessing the suitability of battery second life applications after EV first life. Batteries 8 (2022).
Tawfik, M. I., Ali, A. & Asfoor, M. A combined trade-off strategy of battery degradation, charge retention, and driveability for electric vehicles. Sci Rep 14, 21995 (2024).
Yang, F., Xie, Y., Deng, Y. & Yuan, C. Predictive modeling of battery degradation and greenhouse gas emissions from U.S. state-level electric vehicle operation. Nat Commun 9 (2018).
Abedi Varnosfaderani, M., Strickland, D., Ruse, M. & Brana Castillo, E. Sweat testing cycles of batteries for different electrical power applications. IEEE Access 7, 132333–132342 (2019).
Yang, J., Beatty, M., Strickland, D., Abedi-Varnosfaderani, M. & Warren, J. Second-Life battery capacity estimation and method comparison. Energies (Basel) 16 (2023).
Beatty, M., Strickland, D., Ferreira, P. A Review of Methods of Generating Incremental Capacity–Differential Voltage Curves for Battery Health Determination. Energies 17, 4309, https://doi.org/10.3390/en17174309 (2024).
Oxford Battery Degradation Dataset 1 - ORA - Oxford University Research Archive. https://ora.ox.ac.uk/objects/uuid:03ba4b01-cfed-46d3-9b1a-7d4a7bdf6fac.
Severson, K. A. et al. Data-driven prediction of battery cycle life before capacity degradation. Nat Energy 4, 383–391 (2019).
Beatty, M., Strickland, D. & Ferreira, P. Long-Term Sweat Testing Data for Second-Life Batteries. Loughborough University Research Repository https://doi.org/10.17028/rd.lboro.28732490.v2 (2025).
NASA Battery Dataset. https://www.kaggle.com/datasets/patrickfleith/nasa-battery-dataset.
Scikit Learn Developers. Scikit Learn confusion matrix documentation. Scikit Learn https://scikit-learn.org/stable/auto_examples/model_selection/plot_confusion_matrix.html (2025).
Centre for Advanced Life Cycle Engineering. Battery Data | Center for Advanced Life Cycle Engineering. University of Maryland CALCE https://calce.umd.edu/battery-data (2011).
Stroebl, F. et al. A multi-stage lithium-ion battery aging dataset using various experimental design methodologies. Sci Data 11, 1020 (2024).
Luh, M. & Blank, T. Comprehensive battery aging dataset: capacity and impedance fade measurements of a lithium-ion NMC/C-SiO cell. Sci Data 11 (2024).
Acknowledgements
The authors would like to acknowledge the help provided by Powervault for access to second-life batteries and their facilities to continue to gather data for the dataset provided in this study. Furthermore, the authors would like to acknowledge the help from the following project - Innovate UK grant, Faraday challenge, 103951 project “An ultra-low cost, resource efficient second life battery solution unlocking the mass-market for home energy storage and Innovate UK project “Li.2”.
Author information
Authors and Affiliations
Contributions
Conceptualization, J.W. and D.S.; methodology M.B., D.S. and J.W.; formal analysis M.B.; investigation, M.B.; data curation, M.B. and J.C.; software, M.B.; validation D.S. and P.F.; visualisation D.S., and M.B.; M.B.; writing – original draft preparation; writing – review and editing, D.S., and P.F.; supervision, D.S., and P.F.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Beatty, M., Strickland, D., Warren, J. et al. Long-Term Sweat Testing Dataset for Second-Life Batteries. Sci Data 12, 1068 (2025). https://doi.org/10.1038/s41597-025-05360-7
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s41597-025-05360-7















