Operational data of lithium-ion batteries from battery electric vehicles can be logged and used to model lithium-ion battery aging, i.e., the state of health. Here, we discuss future State of Health definitions, the use of data from battery production beyond production, the logging & aggregation of operational data and challenges of the State of Health in automotive applications. Our suggestions could improve data transfer efficiency and data storage costs.
Introduction
Lithium-ion batteries (LIBs) are attracting increasing attention by media, customers, researchers, and industrials due to rising worldwide sales of new battery electric vehicles (BEVs) 1,2. Manufacturers of BEVs, known as original equipment manufacturers (OEMs), as well as LIB cell manufacturers & suppliers need to ensure that the battery of the BEV fulfills the warranty specifications (e.g.: USABC (US)3: 15 years and 1000 cycles; EUCAR (Europe)4: BEV life equals 150,000 km and 22–24 MWh; NEDO (Japan)5: 10–15 years and 1000–1500 cycles6). This is often accomplished by limiting the usable battery capacity through the battery management system (BMS) to accommodate for capacity loss over the lifetime of the battery. In this case, the total (gross) capacity of the battery is greater than the usable (net) battery capacity7.
When LIBs age their performance characteristics like the capacity and internal resistance change. For example, the aging-induced reduction of the capacity results in a decrease of the range of BEVs which is a key factor for customers. Commonly, the aging state of LIBs is called State of Health (SOH): the SOH compares the current state of the battery to the state of a new battery at its beginning of life (BOL). It depends on the usage and environmental conditions of the battery8,9,10. The SOH is also relevant for other battery applications like grid energy storage and electric trains11,12. The rising connectivity of machines in the context of the Internet of Things and Industry 4.0 also includes vehicle fleets so that real-world battery data from automotive applications become available on a large scale13.
Based on this situation, this paper contributes a re-evaluation on battery data and SOH with suggestions for the future development in the field, with a focus on automotive applications. Each of the suggestions is covered in a separate section of this paper covering different aspects of data, SOH, and battery.
The remainder of this paper is structured as follows and illustrated by Fig. 1:
-
The most frequently used SOH definition is the relative capacity, but also the relative storable energy content and the relative internal resistance are used13. For a more descriptive quantification and characterization of the SOH, we discuss the suitability of these definitions and propose another variant. This could lead to an improvement of SOH estimation and SOH forecasting methods.
-
Given the task split of battery cell manufacturers and BEV manufacturers, data sharing beyond general cell specification sheets is currently uncommon due to silo-thinking. We argue that it would be beneficial to expand battery-cell classification after cell production and to use data from battery manufacturing beyond cell production.
-
Research on SOH estimation, SOH forecasting, and remaining useful life (RUL) prediction has mostly been focused on data from LIB cells operated in the laboratory setting. However, in automotive applications, LIB systems combining hundreds (sometimes thousands) of cells operated under real-world conditions are subject to customers habits. We describe these differences in the operational domain (laboratory vs. real world) and the battery configuration (cell vs. system). Also, we provide examples of current challenges.
-
When intending to conduct research on operational battery data, i.e., time-series data of current, temperature, voltage, and state of charge (SOC) from BEVs, suitable data logging, storage, and potentially aggregation need to be considered with the constraints of cost and mobile connectivity.
Operational data of LIBs from BEVs can be logged and used to model LIB aging, i.e., the SOH. Here, we discuss alternative SOH definitions which could reduce ambiguity in battery research. We also discuss the use of data from battery production beyond production itself, the logging and aggregation of operational data and challenges of the SOH in automotive applications. Our suggestions could improve data transfer efficiency and data storage costs.
Minimal viable characterization of battery aging
The SOH compares the current state of the battery to the state of a new battery at its BOL8,9,10. The SOH can be defined differently depending on the view point, e.g., of a manufacturer or battery user10. Proposed state variables for the definition of the SOH that change with aging are: the number of charging/discharging cycles14, the voltage change caused by the load of a power or current profile15, AC impedance, self-discharge rate, and power density8,16,17. When using the term SOH of a battery, very often it is synonymously used for the \({{SOH}}_{{{\rm{C}}}}\), which is the relative capacity18:
with the remaining capacity \(C\left(t\right)\) relative to the nominal capacity \({C}_{{{\rm{nom}}}}\) which is specified by the battery manufacturer or OEM.
The relative storable energy (\({{SOH}}_{{\mbox{E}}}\)) is correlated to the \({{SOH}}_{{{\rm{C}}}}\) by the voltage19:
with the total energy \({E}_{\max }(t)\) and the nominal total energy \({E}_{{{\rm{nom}}}}\). The United Nations (UN) Global Technical Regulation (GTR) No. 2220 accelerates the importance of the energy and not the capacity as a measure of the SOH for developers, users, vendors, and buyers of BEVs by defining the usable battery energy (UBE), i.e., the net energy of the battery. The net energy is more relevant in practice to users of BEVs than the gross energy as it reflects the accessible and usable energy of the battery. The corresponding definition of a UBE-based \({{SOH}}_{{{\rm{UBE}}}}(t)\) similar to Eq. (2) but with explicit net energy:
has been used very rarely, e.g., by Weng et al.21. Compared to the \({{SOH}}_{{{\rm{C}}}}\) fewer research works use the \({{SOH}}_{{{\rm{E}}}}\) and \({{SOH}}_{{{\rm{UBE}}}}\) health metrics. This may become problematic in case of a potential disconnect of academia from the upcoming standards and SOH definitions used in industrial and automotive applications.
Less frequently used is the relative resistance (SOHR)22,23,24:
with the current internal ohmic resistance \(R\left(t\right)\) and the nominal internal ohmic resistance \({R}_{{{\rm{nom}}}}\). An overview of further SOHR definitions is presented in Supplementary Note 1. For further differences and advantages of \({{SOH}}_{{{\rm{C}}}}\), \({{SOH}}_{{\mbox{E}}}\), and \({{SOH}}_{{\mbox{R}}}\) see Chapter 2.1.3 in ref. 13.
Unfortunately, these definitions may falter under certain scenarios which may lead to an ambiguous description of the SOH. This is illustrated by two examples: example 1 considers two batteries that have experienced different operations in the past as shown in Fig. 1. One battery has experienced only calendar aging (blue), and the other only cyclic aging (black). At a certain point in time, using the \({{SOH}}_{{\mbox{C}}}\) definition they are equivalent, but the batteries will likely continue on a different trajectory even when operated identically (green dotted). As a similar example, Hartmann et al.25 stored two cells for 150 days at marginally different conditions (100% SOC, 45 °C vs. 80% SOC, 60 °C). Afterwards, the capacity of both cells was similar, but during following fast-charging one cell experienced sudden death (rapid capacity degradation). These examples underline that the description of the \({{SOH}}_{{{\rm{C}}}}\) fails to account for different aging mechanisms triggered throughout the operations in the past.
Example 2 considers a single battery that is subjected to high-current cycling and capacity loss, but then recovers capacity after a long rest period26,27,28. As depicted in Fig. 3, this means the battery has the same \({{SOH}}_{{\mbox{C}}}\) more than once (red opaque area) during its lifetime. This means the description of the SOH fails in this ambiguous time where \({{SOH}}_{{\mbox{C}}}\) values are not decreasing monotonically.
These examples show the limitations of relying on a single scalar, like the relative capacity (\({{SOH}}_{{\mbox{C}}}\)), to describe the SOH of a battery as noted by Baure et al.29, Rogge et al.30, and Hartmann et al.25. Also, Weng et al.21 note that similarly even accurate single point measurements of the SOH are not sufficient to predict the RUL.
When analyzing the meaning of “state of health”, and specifically “state”, definitions and perceptions of this term come from control engineering31,32, thermodynamics33, Markov decision processes (MDPs)34, and reinforcement learning34. These state variables are “a set of variables that completely describe a system”32. In other words “the state must include information about all aspects of the past […] that make a difference for the future”34. This is also called Markov property34. States “always take on the same value when the state of the system is the same again”33. In other words, state variables only depend on the system’s state and should be path independent, i.e., they do not depend on the way how a state was reached33. A common example in thermodynamics is the two independent state variables temperature and pressure of pure liquid water (one component, one phase). They can be varied independently as degrees of freedom of pure liquid water to describe the state of the system as visible in the corresponding phase diagram35,36.
When transferring this aspiration to completely describe a battery system regarding the progress of its aging mechanisms and overall state of aging (SOA), this may be very ambitious given the complexity of battery aging. Still, capturing the SOA more precisely than only by a single variable like the relative capacity or the relative resistance seems advisable. It seems that a good balance of high level, e.g., on operation level, and low-level states, e.g., on the level of aging mechanisms, is advantageous.
In essence, a “state of health” should capture the different battery aging paths and the relevant complexity of the battery’s state while avoiding too much simplicity. This aims at improving the distinguishability of the SOH of batteries with the same relative capacity, like those in the examples in Figs. 2 and 3.
a Different future SOHC trajectories (currently): SOHC trajectories of two batteries that either first face only calendar (blue) or cyclic aging (black), respectively, but have the same \({{SOH}}_{{\mbox{C}}}\) at a certain point (gray dashed). From there on, they are cycled with the same operational load. b Same or similar future \({{SOH}}\) trajectories (desired): the same operational load leads to different future \({{SOH}}_{{\mbox{C}}}\) trajectories (green dotted) despite the same operational load. However, the same future operational load should lead to the same or very similar future \({SOH}\) trajectories (green dotted). Thus, a more suitable SOH definition should be selected (illustrative examples).
How could the aging state be described more precisely? The idea of “fusing”17 different SOH variants/definitions like \({{SOH}}_{{\mbox{C}}}\), \({{SOH}}_{{\mbox{E}}}\), and \({{SOH}}_{{\mbox{R}}}\) has already been brought up by Saxena et al.17, but has not been applied widely. Fusion, in our interpretation, refers to a single scalar built from several scalars, e.g., by applying an arithmetic, harmonic, or geometric mean as defined in Supplementary Note 218. However, when calculating the mean, some scaling, e.g., min–max normalization to the range \([{\mathrm{0,1}}]\) or usage of only relative values, might be sensible to balance the importance of all scalars. Furthermore, a single scalar compresses information which eases interpretability and comparability but hardens the distinguishability of the health of different battery cells. The latter is a limitation we would like to minimize.
We promote a more precise description of SOH in the form of a vector. The objective is to concatenate several scalars to one vector so that no information is aggregated or compressed:
The vector may consist of the currently dominant \({{SOH}}_{{\mbox{C}}}\), \({{SOH}}_{{\mbox{E}}}\), and \({{SOH}}_{{\mbox{R}}}\), but also application-dependent parameters that contain information about the operational context. For a BEV this could include the mileage in kilometers and the number of equivalent full cycles. More application-independent parameters like the total energy throughput \({E}_{{{\rm{trhpt}}}}\) as well as distributional information on the SOC, voltage, and temperature during rest phases may indicate to which extend calendar aging mechanisms have been triggered. The same applies for cyclic aging mechanisms characterized by distributional information on the SOC, voltage, temperature, and current19. Including the share of rest time in the last X weeks and months may make sense to account for capacity recovery effects. Furthermore, harmful events like exceeding certain temperatures could be counted. Other examples can be found in the EU battery passport data attributes on battery “performance and durability”37.
Also, new battery materials may require a new or a more precise description of the SOA because of new different aging mechanisms. In this case again, capacity and resistance may not be sufficient to describe the SOH.
It might be sensible to store multiple capacity values obtained at different temperatures and different charging profiles as these variables influence the measured capacity. This would lead to a vector or matrix of capacities in which each scalar gets updated once a new capacity measurement at that temperature and charging profile gets measured during regular customer operation. This could tackle the problem that capacity measurements via reference cycles are more difficult to obtain in real-world BEV fleet operation than in the laboratory. Current challenges are time-consuming measurements and expensive test equipment with limited availablility38,39,40. A further improvement would be a method to make capacities comparable that have been determined under different conditions.
Figure 4 visualizes different target audience groups and characteristics of a suitable SOH definition. On OEM side the BMS designers might be more interested in the influence of voltage margins in the BMS on the SOH, while the cell designers look into path dependency and aging mechanisms. The cell manufacturer is often concerned about cell quality and sudden deaths. In contrast, a leasing company, financier of BEVs, and insurance may be interested in the future value of a battery.
An SOH defined in vector form is certainly less easy to interpret for BEV customers or fleet managers. Most likely it will be difficult for them to retrieve operationally relevant characteristics like a standardized maximum range at fixed unpredictable variables, like road conditions, wind, and small energy consumers, from an SOH vector. Thus, the target group for the vector-based SOH would not be the BEV customer or a fleet manager. However, for methods of SOH estimation, SOH forecasting, and RUL prediction, a more elaborate and precise characterization of the SOH may be helpful to better distinguish different battery states from each other. This could also benefit BEV customers or a fleet managers if the methods perform better on a vector-based SOH, even though they keep only visible access to the \({{SOH}}_{{\mbox{C}}}\) or \({{SOH}}_{{\mbox{E}}}\).
Using data from battery production after the beginning of life
Currently, at the end of battery cell production cells do not have equal properties due to the variety of production process steps and impurities in cells’ raw materials41,42,43,44. These initial cell-to-cell variations are coupled when the cells are assembled in a battery system. Then cell-to-cell variations of the capacity and impedance result in heterogeneous cell currents causing different heat generation of the cells. Then temperature gradients between cells can cause heat transfer to adjacent cells45,46. Overall, this cell-to-cell variance of aging stress factors leads to different SOH trajectories during battery life47,48,49. Despite, battery cells are often classified by the cell manufacturer into an ordinal scale of only three groups (grade A, B, and C)50,51. Heimes et al.52 also indicate three classes on page 21 in the center figure. However, there is no overall standard for this ABC classification. Every battery cell manufacturer may have its own regulations. Some cell manufacturers distinguish still fully functional grade-B and grade-C cells from grade-A cells by a higher storage time in a warehouse of several months50. However, the storage duration, storage temperature, and storage SOC are not further distinguished, even though it is known that they influence calendar aging53,54.
Another example is the battery data set presented by Sauer et al.55 which uses 48 nominally identical battery cells. The cells were graded into group C from the cell manufacturer and are drawn from the same production lot. However, despite this and cycling with the same operational protocol for charging, discharging, and rest, the initially quite small variability of the capacity increases from the mid-life at around 800–1000 cycles onwards as shown in Fig. 5. This means that there was more variance regarding the health state of the battery cells initially than captured by the grouping into group C and the \({{SOH}}_{{\mbox{C}}}\).
Given the problem of precise battery cell grading after production, we want to initiate a discussion of an adapted grading system to end silo-thinking of battery cell manufacturing and operation: we propose either maintaining ordinal grading, but introducing more groups, or with a non-ordinal, but continuous grading using ratio scales. Both can be seen as an extension of the ordinal binning of the current grading system into A, B, and C. This shall enable a more precise characterization of battery cells, regardless of making or buying batteries. In the past, OEMs have mostly bought batteries for their HEVs or BEVs from 3rd party battery cell or module manufacturers. However, many OEMs strategically shift towards investing into in-house battery production56, which often is run as a subsidiary company. Regardless in both situations, make-or-buy, data exchange needs to be standardized given large organizations and complex processes.
Variant 1 in Fig. 6 is an ordinal grading system, but more precise due to the subgroups. For example, B1 could contain cells with a storage time \({t}_{{{\rm{storage}}}}\) smaller than 1 month (\({t}_{{{\rm{storage}}}}\le 1\,{{\rm{month}}}\)), an average storage temperature of 15–20 °C (\({\varnothing T}_{{{\rm{storage}}}}\in (15\,^{\circ}\, {{\rm{C}}},\,20\,^{\circ}\, {{\rm{C}}}]\)), and an average storage SOC of 45%–55% (\({\varnothing {SOC}}_{{{\rm{storage}}}}\in (45\% ,\,55\% ]\)). Accordingly, another group is named B2 with \(1\,{{\rm{month}}}\, < {t}_{{{\rm{storage}}}}\le 6\,{{\rm{month}}}\) and the other two conditions in the same range.
Variant 2 in Fig. 6 is a continuous grading system based on several battery health measures. As discussed in the previous section, potentially more characteristics of a battery cell than capacity and internal resistance are necessary to describe the SOH of a battery. This also applies to new battery cells after production. Figure 6 uses the display style “parallel coordinates”, but also a radar chart is possible to compare battery cells and batches of battery cells with each other relatively or with an absolute required specification.
This leads to the question how data from the battery production process can be used to characterize a battery cell. Currently, we observe silo-thinking/analysis: data from battery cell production processes are used to optimize and analyze the battery cell production process steps. Data from battery operation in the laboratory and real-world applications are used in the context of battery operation. We imagine that data from battery cell production can be used to characterize a battery cell (for more information on the battery production steps consult52). Data from battery cell production could originate from end-of-line tests (end-of-line is sometimes abbreviated as EoL or EOL. However, it is not abbreviated in this work due to the ambiguity with the acronym EOL for end-of-life). Contemplable end-of-line tests are pulse tests (e.g., direct current internal resistance test), open-circuit voltage tests, and self-discharge tests. But also data about the raw material batch and deviations from the desired process parameter (time, temperature, pressure, material, etc.) could indicate inadequate production batches regarding aging. Nevertheless, such data can also be advantageously used in the operation of the same battery cells as prior knowledge on the cells. For example, Baumhofer et al.42 found the initial pulse resistance to correlate with the cell life. Schindler et al.48 point out the applicability of differential voltage analysis and differential capacity analysis (DCA) to determine differences in the active material composition correlating with the trajectory of the cells’ capacities. Of course, this requires the sharing of such more detailed data on the cells and their state at the end-of-line by the cell manufacturer to the battery buyer, like academic institutions or BEV manufacturers. Regarding the use of battery cells in 2nd life applications, Rufino Júnior57 mentions an obstacle to intermediary companies who are trading 2nd life batteries with variations of battery cells. The mentioned data could reduce this obstacle.
Challenges for state of health estimation and forecasting, remaining useful lifetime prediction in automotive applications
The tasks of SOH estimation and forecasting as well as RUL prediction, just as other algorithms and battery simulation models in the context of BMS and battery cloud solutions, face two major challenging domain shifts (DS): from laboratory or test bench operation to real-world BEV operation (DS1) and from LIB cell-level to LIB system-level (DS2) including the combination of both limitations (Cell & Lab → System & BEV).
First, battery aging in laboratory conditions differs significantly from battery aging in real-world BEV operation (DS1) as argued by43,49,58,59,60,61. In our previous work18,19, we argue that there is a continuous path towards laboratory data closer to real-world battery operation. Real-world battery operation includes idling, external charging, and driving as described in the taxonomy of battery operation in automotive applications in Supplementary Note 3. The main problem is that laboratory operation conditions are never equivalent to real-world BEV operation because they are meant to accelerate aging. Thus, it is necessary to get a deeper understanding of how real-world usage conditions affect battery aging and outline opportunities to approximate them better in the laboratory62. There is an ongoing effort on specifying more realistic laboratory tests, for example with variable discharge currents63,64 and combining cycling and calendar aging, i.e., both \(I\,\ne\,0{A}\) and \(I=0{A}\), in the same battery cell under test65. Calendar aging should also be examined in the laboratory with non-constant environment temperature like in real-world operation. This further extends to the importance of path dependency65.
Second, most of the batteries operated in laboratory are cells, but not modules or packs which are installed in BEVs (DS2). Aging of battery modules and systems is more complex than of a single cell because of the interactions of all cells. These interactions cause the degradation process of the battery pack and of the cells in that pack to depend on each other66. It is influenced by inconsistencies of cell characteristics, also known as intrinsic cell-to-cell variability47, electrical imbalance, and temperature gradients between cells that cause heat transfer to adjacent cells45,46. For data-driven methods, e.g., using machine learning, it is important to figure out how to apply them not only on cell level, but also on module-, pack- and system- level. This challenge becomes evident in two examples:
First, so far, electrochemical impedance spectroscopy (EIS) studies mostly examined individual cells or low voltage modules, with only a few studies dealing with HV battery modules and packs as required for BEVs67,68. Overall, EIS seems applicable to modules and packs with careful calibration of the model because the signals are harder to interpret for modules and packs69. Second, a virtual incremental capacity analysis (ICA) using a digital battery twin failed to yield conclusions regarding the SOH on system level, but was successful on cell level38.
Operational battery data from vehicles: logging and aggregation
Logging operational data in mobile applications faces the challenge of limited and costly bandwidth and connectivity, when sending the logged data to a central backend like a cloud. Thus, the question of what data to log, as well as what and when to send to the cloud needs to be answered for BEVs as well. This concerns the measured controller area network (CAN) signals and their sampling time.
Recommendations on the measured signals and their sampling time to capture battery operation in BEVs are given in Table 1 depending on the dynamic of the respective signals. Voltage and current require the highest sampling rate because they are the most dynamic signals in automotive applications. Especially in driving mode in the wide sense, according to the taxonomy introduced in Supplementary Note 3, the current and voltage are not constant, but highly variable and non-linear due to the possibility of huge, but brief power demands59,70,71,72. For the temperature measurement during parking mode, it may be important to take regular measurements, e.g., every hour. This ensures capturing the dynamic of the environment temperature in the course of day and night. The capacity and storable energy content only change slowly with aging over time. Thus, a daily or weekly sampling should be sufficient. For all of the signals in Table 1, not only the sampling time, but also the measurement resolution and accuracy are relevant. They also may limit the benefit of a high sampling time. These benefits may be countered by the increasing cost for data transmission and data storage61.
However, data aggregation before sending operational data to the cloud may be advisable to reduce the data transfer cost. But also battery simulation models benefit from compact and representative load profiles as efficient inputs43. This especially makes sense, when battery data of long time periods, like weeks or months of operation, regarding battery aging is stored. Overall, there is a trade-off of data aggregation between information compression, data representativeness, human-interpretability, saving data-related costs for storage, processing, and transmission. The data aggregation method must align with the specific task and objective for which the data will be used. Currently, there are a few methods on encoding operational data densely in an aggregated form:
First, one-dimensional (1D) and multi-dimensional (2D, 3D) residence time histograms of the operational parameters current, SOC, and temperature have been used19,73,74. Color-coded, i.e., heatmap style, 1D and 2D histograms are still interpretable by engineers with some practice. Also, the order of the operational states is not encoded in the histograms but would matter due to path dependency of battery aging. However, the bin width and range of the histograms’ bins have to be chosen adequately and in advance, if the aggregation is executed in the vehicle.
Second, statistical properties of histograms like range, maximum, minimum, mean, variance, skewness, and kurtosis have also been used to aggregate the histograms further75. Zhang et al.75 argue that these statistical properties of histograms may encode the correlation of battery operation and aging better than the histograms. However, the interpretability for humans may be more complex and require a understanding of these descriptive statistical properties (e.g., see Fig. 2 in ref. 75). For example, an increase of the kurtosis of the SOC-histogram corresponds to an increase of residence time in the SOC states in the distribution tails (less peakedness, i.e., light-tailed), i.e., more time is spend SOCs distant to the mean SOC which usually means in low and high SOCs. Especially, the statistical properties of multi-dimensional histograms will likely become complex to interpret because of the multidimensionality of the statistical properties. Furthermore, they are not uniquely defined76.
Third, these properties could also be directly derived from the battery operational time-series data as in Table 5 in ref. 77. Here, likewise understanding of the descriptive statistical properties is necessary to interpret them correctly.
Fourth, operational patterns of the battery could be clustered unsupervised to aggregate data. Therefore two variants regarding the temporal size of clusters exist: one cluster's weekly operational patterns78 and another cluster's driving trips79. For both, if the aggregation is executed in the vehicle, the clusters need to be determined beforehand. In general, the task of determining suitable clusters has probably many feasible results. Beneficially, the clusters have characteristics that are easy to interpret, also for BEV users, like “charging on Thursday night”78 or “charge-every-night”78. Also, the representativeness of the identified patterns needs to be ensured.
The temporal splitting into weeks or trips of the last category can be combined with the histograms from the first category: temporal sub-sequences can be generated before the aggregation by histograms. This has the advantage to maintain some information about the sequence of operation compared to pure histograms: first, the most simple approach is to aggregate the battery operation monthly, weekly, daily, or per trip as suggested in our previous work13 (see Chapter 10.1 there). For example, the battery operation of each week could be described by separate histograms. Second, the rainflow algorithm can alternatively be used to separate operational sequences, combined with aggregation by histograms80,81. Rainflow counting enables the consideration of micro-cycles.
Even though the SOC will not likely change significantly during parking, the temperature of the battery certainly will depend on day–night, seasonal, shade–sunlight, and indoor–outdoor temperature differences which influence calendar battery aging. The temperature can either be approximated based on temperature data from public weather services or logged during regular wake-ups by the vehicle. It is likely that there are times during parking where it is not economically viable to log all data.
Apart from unsuitable and limited logged data, the progress on research tasks like SOC estimation, SOH estimation, RUL prediction, and SOH forecasting is also hindered by lacking comparability of the models’ performance18,49,82: exemplarily for SOH forecasting different metrics, different output values, and different forecast horizons are used13,18. A standardization of these criteria is probably difficult to achieve given the absence of a worldwide institution and different requirements of different battery applications, most likely, if at all, United Nations Economic Commission for Europe (UNECE) or International Standards Organization. Thus currently, comparability can only be ensured by using multiple metrics, output values, and forecast horizons so that a common overlap over several methods in this regard can be achieved.
Currently, no standard data set from real-world operation exists for battery SOH forecasting models like ImageNet, MNIST, or CIFAR for image classification models (see overview Table 12 in ref. 19). Such open-source datasets give different researchers the opportunity to compare the performance results, e.g., of their data augmentation techniques83. A publicly available data set not from laboratory battery operation, but from real-world vehicle operation of battery systems would enable benchmarking of SOH forecasting models applicable in real-world vehicle operation battery systems. Thus, it would accelerate research progress.
Conclusion
This comment re-evaluates the current state of battery data and the SOH. The following suggestions for the development of the field, especially for automotive applications, were formulated:
-
An SOH definition merging several scalars to a vector format was motivated to fulfill the Markov property. It may lead to more accuracy of SOH estimation and SOH forecasting methods, while the relative capacity (\({{SOH}}_{{{\rm{C}}}}\)) may still be practically relevant customers and fleet managers.
-
Battery-cell classification after cell production might be diversified by extending the current ordinal grading system of battery cells into groups A, B, and C, potentially related to the previously proposed vector-based SOH. Also, the benefits of using data from battery manufacturing beyond cell production have been discussed.
-
To extend research on SOH estimation, SOH forecasting, and RUL prediction from battery cell & laboratory operation to battery systems & BEV operation, battery operational data is differentiated and examples are provided.
-
Battery data logging in automotive applications was discussed and methods for time-series data aggregation from battery operation like histograms, and statistical features including combinations with the generation of temporal sub-sequences were presented.
This paper focuses on the perspective on automotive applications, as mentioned in the title. Other applications like LIBs for grid storage are not so much in focus. Furthermore, passenger vehicles as automotive application are more in the foreground in this paper than trucks. Battery electric trucks (BETs) may have different characteristics regarding the battery compared to passenger vehicle: different usage, i.e., different battery load, different battery system configuration and different charging system, i.e., megawatt charging system (MCS) in the case of BETs.
A detailed discussion of the problem of transferring and scaling research results from universities to the scale of the automotive industry is still open84. Most universities conduct research on cell materials in small lab scale (often coin format) but automotive cell formats and larger cell sizes behave differently (often big prismatic or cylindrical cells). Furthermore, the scale-up of cell chemistry is expensive and difficult due to the altering of the thermal and mechanical behavior of battery cells when scaling. Additionally, automotive cell production needs to based on mass-scalable manufacturing technology, whereas cells from laboratory production can be handcrafted and individually optimized.
Overall, we observe an increasing research focus from battery cells operated in the laboratory towards battery systems in real-world applications that are only scratching the tip of the iceberg of the inherent complexity in battery systems in real-world applications. Especially, once more operational battery data from BEVs become available in the near future because of the rising adoption of BEVs in the vehicle markets research progress should accelerate.
Data availability
All data analyzed during this study are included in this published article and in the references.
References
International Energy Agency (IEA). Global Electric Vehicle Outlook 2024. Moving towards increased affordability. https://iea.blob.core.windows.net/assets/72e8cec8-c4d3-445f-a6df-28b827a31666/GlobalEVOutlook2024.pdf (2024).
Verband der Automobilindustrie e.V. (VDA). International market data. International passenger car markets mostly positive after three quarters. https://www.vda.de/en/press/press-releases/2023/231020_PM_International-market-data (2023).
United States Advanced Battery Consortium (USABC). Goals for Advanced Batteries for EVs. CY 2020 Commercialization. https://uscar.org/download/246/energy-storage-system-goals/12837/goals-for-advanced-high-performance-batteries-for-electric-vehicle-ev-applications.pdf (2019).
European Council for Automotive R&D (EUCAR). Battery requirements for future automotive applications. https://eucar.be/wp-content/uploads/2019/08/20190710-EG-BEV-FCEV-Battery-requirements-FINAL.pdf (2019).
New Energy and Industrial Technology Development Organization (NEDO). The Japanese policy and NEDO activity for future mobility. https://www.tresor.economie.gouv.fr/Articles/70dfe615-29bf-4ee4-a232-2fd2b954c025/files/59a6914a-57e1-495f-8108-abdecb7d0715 (2017).
Masias, A., Marcicki, J. & Paxton, W. A. Opportunities and challenges of lithium ion batteries in automotive applications. ACS Energy Lett. 6, 621–630 (2021).
Rachid, A. et al. Electric vehicle charging systems: comprehensive review. Energies 16, 255 (2023).
Warner, J. The Handbook of Lithium-Ion Battery Pack Design: Chemistry, Components, Types and Terminology (Elsevier, Amsterdam, The Netherlands, 2015).
Dorn, R., Schwartz, R. & Steurich, B. Battery management system. In Lithium-Ion Batteries: Basics and Applications (ed. Korthauer, R.) 165–175 (Springer, Berlin, Heidelberg, Germany, 2018).
Waag, W. & Sauer, D. U. Secondary batteries – lead–acid systems | State-of-Charge/Health. In Encyclopedia of Electrochemical Power Sources 1st edn (ed. Garche, J.) 793–804 (Elsevier, Amsterdam, The Netherlands, 2009).
Verband der Elektrotechnik Elektronik Informationstechnik e. V. (VDE). Battery Systems for Multiple Units. Emission-free drives powered by lithium-ion cells. https://www.vde.com/resource/blob/2068324/a35ebed9833dae59d8cb1451368203bd/vde-study-battery-systems-data.pdf (2018).
Dubarry, M., Tun, M., Baure, G., Matsuura, M. & Rocheleau, R. E. Battery durability and reliability under electric utility grid operations: analysis of on-site reference tests. Electronics 10, A19191-A2001 (2021).
von Bülow, F. A Data-Driven Fleet Service: State of Health Forecasting of Lithium-Ion Batteries 1st edn (Springer Fachmedien Wiesbaden GmbH, Wiesbaden, Germany, 2024).
Rezvanizaniani, S. M., Liu, Z., Chen, Y. & Lee, J. Review and recent advances in battery health monitoring and prognostics technologies for electric vehicle (EV) safety and mobility. J. Power Sources 256, 110–124 (2014).
Meissner, E. & Richter, G. The challenge to the automotive battery industry. The battery has to become an increasingly integrated component within the vehicle electric power system. J. Power Sources 144, 438–460 (2005).
Lu, L., Han, X., Li, J., Hua, J. & Ouyang, M. A review on the key issues for lithium-ion battery management in electric vehicles. J. Power Sources 226, 272–288 (2013).
Saxena, S., Xing, Y. & Pecht, M. G. PHM of Li-ion batteries. in Prognostics and Health Management of Electronics (eds Pecht, M. G. & Kang, M.) 349–375 (John Wiley and Sons Ltd, Chichester, UK, 2018).
von Bülow, F. & Meisen, T. A review on methods for state of health forecasting of lithium-ion batteries applicable in real-world operational conditions. J. Energy Storage 57, 105978 (2023).
von Bülow, F., Wassermann, M. & Meisen, T. State of Health Forecasting of Lithium-Ion Batteries Operated in a Mobility on Demand Fleet (Poster, Batterieforum Deutschland 2023, Berlin, Germany, 2023).
United Nations Economic Commission for Europe (UNECE). Addendum 22: United Nations Global Technical Regulation (GTR) No. 22. United Nations Global Technical Regulation on In-vehicle Battery Durability for Electrified Vehicles https://unece.org/sites/default/files/2022-04/ECE_TRANS_180a22e.pdf (2022).
Weng, A., Dufek, E. J. & Stefanopoulou, A. Battery passports for promoting electric vehicle resale and repurposing. Joule 7, 837–842 (2023).
Kohs, A. Batteriemodell zur Prädiktion des Gesundheitszustands von Lithium-Ionen-Batterien 1st edn (Springer Fachmedien Wiesbaden, Wiesbaden, Germany, 2022).
Farmann, A., Waag, W., Marongiu, A. & Sauer, D. U. Critical review of on-board capacity estimation techniques for lithium-ion batteries in electric and hybrid electric vehicles. J Power Sources 281, 114–130 (2015).
Singh, K., Tjahjowidodo, T., Boulon, L. & Feroskhan, M. Framework for measurement of battery state-of-health (resistance) integrating overpotential effects and entropy changes using energy equilibrium. Energy 239, 121942 (2022).
Hartmann, L. et al. Depletion of electrolyte salt upon calendaric aging of lithium-ion batteries and its effect on cell performance. J. Electrochem. Soc. 171, 60506 (2024).
Lewerenz, M., Dechent, P. A. & Sauer, D. U. Investigation of capacity recovery during rest period at different states-of-charge after cycle life test for prismatic Li(Ni1/3Mn1/3Co1/3)O2-graphite cells. J. Energy Storage 21, 680–690 (2019).
Epding, B., Rumberg, B., Jahnke, H., Stradtmann, I. & Kwade, A. Investigation of significant capacity recovery effects due to long rest periods during high current cyclic aging tests in automotive lithium ion cells and their influence on lifetime. J. Energy Storage 22, 249–256 (2019).
Wilhelm, J. et al. Cycling capacity recovery effect: a coulombic efficiency and post-mortem study. J. Power Sources 365, 327–338 (2017).
Baure, G., Devie, A. & Dubarry, M. Battery durability and reliability under electric utility grid operations: path dependence of battery degradation. J. Electrochem. Soc. 166, A1991–A2001 (2019).
Rogge, M. & Jossen, A. Path‐dependent ageing of lithium‐ion batteries and implications on the ageing assessment of accelerated ageing tests. Batter. Supercaps 7 (2024).
Karnopp, D. C., Margolis, D. L. & Rosenberg, R. C. System Dynamics: Modeling and Simulation and Control of Mechatronic Systems 5th edn (Wiley, Hoboken, NJ, 2012).
Palm, W. J. System Dynamics 2nd edn (McGraw-Hill Higher Education, Boston, MA, USA, 2010).
Dehli, M., Doering, E. & Schedwill, H. Fundamentals of Technical Thermodynamics (Springer Fachmedien Wiesbaden, Wiesbaden, 2023).
Sutton, R. S. & Barto, A. G. Reinforcement Learning: An Introduction 2nd edn (The MIT Press, Cambridge, MA, USA, London, UK, 2018).
Wagner, W. & Pruß, A. The IAPWS formulation 1995 for the thermodynamic properties of ordinary water substance for general and scientific use. J. Phys. Chem. Ref. Data 31, 387–535 (2002).
DeVoe, H. Thermodynamics and Chemistry https://www2.chem.umd.edu/thermobook/v10-screen.pdf (2020).
The Battery Pass Consortium. Battery Passport Content Guidance. Achieving compliance with the EU Battery Regulation and increasing sustainability and circularity. https://thebatterypass.eu/assets/images/content-guidance/pdf/2023_Battery_Passport_Content_Guidance.pdf (2023).
Heinrich, F. Battery State Estimation of Electric Vehicles Using Neural Networks. PhD Thesis, University of Erlangen. (2022).
von Bülow, F., Heinrich, F. & Meisen, T. Fleet management approach for manufacturers displayed at the use case of battery electric vehicles. In IEEE International Conference on Systems, Man, and Cybernetics (SMC) 3218–3225 (IEEE, 2021).
von Bülow, F., Heinrich, F. & Meisen, T. A new perspective for manufacturers: cloud-based utilization of operational fleet data. in ELIV 2021 1st edn (ed. VDI Wissensforum GmbH) 117–130 (VDI Verlag, Düsseldorf, Germany, 2021).
Beck, D., Dechent, P. A., Junker, M., Sauer, D. U. & Dubarry, M. Inhomogeneities and cell-to-cell variations in lithium-ion batteries, a review. Energies 14, 3276 (2021).
Baumhöfer, T., Brühl, M., Rothgang, S. & Sauer, D. U. Production caused variation in capacity aging trend and correlation to initial cell performance. J. Power Sources 247, 332–338 (2014).
Katzschke, K., Wendland, R., Kurczveil, T. & Rausch, A. Deriving representative load profiles for battery simulation: the use case of system imbalances induced by cell-to-cell variations. J Energy Storage 90, 111830 (2024).
Baumann, M., Wildfeuer, L., Rohr, S. & Lienkamp, M. Parameter variations within Li-Ion battery packs—theoretical investigations and experimental quantification. J. Energy Storage 18, 295–307 (2018).
Li, Y. et al. Data-driven health estimation and lifetime prediction of lithium-ion batteries: a review. Renew. Sustain. Energy Rev. 113, 109254 (2019).
Dubarry, M. et al. Battery energy storage system modeling: a combined comprehensive approach. J. Energy Storage 21, 172–185 (2019).
Schuster, S. F., Brand, M. J., Berg, P., Gleissenberger, M. & Jossen, A. Lithium-ion cell-to-cell variation during battery electric vehicle operation. J. Power Sources 297, 242–251 (2015).
Schulz, A., Schindler, V. & Marker, S. Elektrifizierung von Fahrzeugflotten. Automob. Z. 114, 86–92 (2012).
Kumar, R. & Das, K. Lithium battery prognostics and health management for electric vehicle application—a perspective review. Sustain. Energy Technol. Assess. 65, 103766 (2024).
Lam, D. H. C., Lim, Y. S., Wong, J., Allahham, A. & Patsios, C. A novel characteristic-based degradation model of Li-ion batteries for maximum financial benefits of energy storage system during peak demand reductions. Appl. Energy 343, 121206 (2023).
Ank, M., Kröger, T., Schreiber, M. & Lienkamp, M. Experimental analysis of lithium-ion cell procurement: quality differences, correlations, and importance of cell characterization. J. Energy Storage 66, 107430 (2023).
Heimes, H. H. et al. Lithium-Ion Battery Cell Production Process (PEM der RWTH Aachen University; VDMA, Aachen, Frankfurt am Main, 2018).
Gewald, T. et al. Accelerated aging characterization of lithium-ion cells: using sensitivity analysis to identify the stress factors relevant to cyclic aging. Batteries 6, 6 (2020).
Barré, A. et al. A review on lithium-ion battery ageing mechanisms and estimations for automotive applications. J. Power Sources 241, 680–689 (2013).
Sauer, D. U. Time-series cyclic aging data on 48 commercial NMC/Graphite Sanyo/Panasonic UR18650R cylindrical cells. https://doi.org/10.18154/RWTH-2021-04545 (2021).
International Energy Agency (IEA). Global Electric Vehicle Outlook 2023. Catching up with Climate Ambitions. https://iea.blob.core.windows.net/assets/dacf14d2-eabc-498a-8263-9f97fd5dc327/GEVO2023.pdf (2023).
Rufino Júnior, C. A. et al. Towards a business model for second-life batteries: barriers, opportunities, uncertainties, and technologies. J Energy Chem. 78, 507–525 (2023).
Gasper, P. & Smith, K. Predictive Battery Lifetime Modeling at the National Renewable Energy Laboratory https://www.osti.gov/biblio/1838001 (2021).
Ansari, S., Ayob, A., Lipu, M. S. H., Hussain, A. & Saad, M. H. M. Remaining useful life prediction for lithium-ion battery storage system: a comprehensive review of methods, key factors, issues and future outlook. Energy Rep. 8, 12153–12185 (2022).
Mutagekar, S. & Jhunjhunwala, A. Understanding the Li-ion battery pack degradation in the field using field-test and lab-test data. J. Energy Storage 53, 105216 (2022).
Zhao, J. & Burke, A. F. Electric vehicle batteries: status and perspectives of data-driven diagnosis and prognosis. Batteries 8, 142 (2022).
Pozzato, G. et al. Analysis and key findings from real-world electric vehicle field data. Joule 7, 2035–2053 (2023).
Bole, B., Kulkarni, C. S. & Daigle, M. Randomized battery usage data set. https://www.nasa.gov/content/prognostics-center-of-excellence-data-set-repository (2012).
Jones, P. K., Stimming, U. & Lee, A. A. Impedance-based forecasting of lithium-ion battery performance amid uneven usage. Nat. Commun. 13, 4806 (2022).
Raj, T., Wang, A. A., Monroe, C. W. & Howey, D. A. Investigation of path‐dependent degradation in lithium‐ion batteries. Batter. Supercaps 3, 1377–1385 (2020).
Wang, X., Wang, Z., Wang, L., Wang, Z. & Guo, H. Dependency analysis and degradation process-dependent modeling of lithium-ion battery packs. J. Power Sources 414, 318–326 (2019).
Andre, D. et al. Characterization of high-power lithium-ion batteries by electrochemical impedance spectroscopy. I. Experimental investigation. J. Power Sources 196, 5334–5341 (2011).
Choi, W., Shin, H.-C., Kim, J. M., Choi, J.-Y. & Yoon, W.-S. Modeling and applications of electrochemical impedance spectroscopy (EIS) for lithium-ion batteries. J. Electrochem. Sci. Technol. 11, 1–13 (2020).
Kasper, M. et al. Calibrated electrochemical impedance spectroscopy and time‐domain measurements of a 7 kWh automotive lithium‐ion battery module with 396 cylindrical cells. Batter. Supercaps 6, 978 (2023).
Ibraheem, R., Strange, C. & dos Reis, G. Capacity and internal resistance of lithium-ion batteries: full degradation curve prediction from voltage response at constant current at discharge. J. Power Sources 556, 232477 (2023).
Yang, J., Beatty, M., Strickland, D., Abedi-Varnosfaderani, M. & Warren, J. Second-life battery capacity estimation and method comparison. Energies 16, 3244 (2023).
Badey, Q., Cherouvrier, G., Reynier, Y., Duffault, J.-M. & Franger, S. Ageing forecast of lithium-ion batteries for electric and hybrid vehicles. Curr. Top. Electrochem. 16, 65–79 (2011).
Richardson, R. R., Osborne, M. A. & Howey, D. A. Battery health prediction under generalized conditions using a Gaussian process transition model. J. Energy Storage 23, 320–328 (2019).
von Bülow, F., Mentz, J. & Meisen, T. State of health forecasting of lithium-ion batteries applicable in real-world operational conditions. J. Energy Storage 44, 103439 (2021).
Zhang, Y., Wik, T., Bergström, J., Pecht, M. G. & Zou, C. A machine learning-based framework for online prediction of battery ageing trajectory and lifetime using histogram data. J. Power Sources 526, 231110 (2022).
Jammalamadaka, S. R., Taufer, E. & Terdik, G. H. On multivariate skewness and kurtosis. Sankhya A 83, 607–644 (2021).
von Bülow, F., Wassermann, M. & Meisen, T. State of health forecasting of lithium-ion batteries operated in a battery electric vehicle fleet. J. Energy Storage 72, 108271 (2023).
Balaram, D., Dufford, B., Negoita, G. A., Paxton, W. A. & Yen, M. Exploring Battery Usage in Electric Vehicles through Graph-based Cascaded Clustering https://openreview.net/forum?id=BUxuW0f6EE (2024).
Heinrich, F., Noering, F. K.-D., Pruckner, M. & Jonas, K. Unsupervised data-preprocessing for long short-term memory based battery model under electric vehicle operation. J. Energy Storage 38, 102598 (2021).
Nuhic, A., Bergdolt, J., Spier, B., Buchholz, M. & Dietmayer, K. Battery health monitoring and degradation prognosis in fleet management systems. World Electr. Veh. J. 9, 39 (2018).
Nuhic, A., Terzimehic, T., Soczka-Guth, T., Buchholz, M. & Dietmayer, K. Health diagnosis and remaining useful life prognostics of lithium-ion batteries using data-driven methods. J. Power Sources 239, 680–688 (2013).
Ng, M.-F., Zhao, J., Yan, Q., Conduit, G. J. & Seh, Z. W. Predicting the state of charge and health of batteries using data-driven machine learning. Nat. Mach. Intell. 2, 161–170 (2020).
Shorten, C. & Khoshgoftaar, T. M. A survey on image data augmentation for deep learning. J. Big Data 6, 1106 (2019).
Kwade, A. et al. Current status and challenges for automotive battery production technologies. Nat. Energy 3, 290–300 (2018).
von Bülow, F. & Meisen, T. State of health forecasting of heterogeneous lithium-ion battery types and operation enabled by transfer learning. PHM Soc. Eur. Conf. 7, 490–508 (2022).
von Bülow, F., Hahn, Y., Meyes, R. & Meisen, T. Transparent and interpretable state of health forecasting of lithium-ion batteries with deep learning and saliency maps. Int. J. Energy Res. 2023, 1–23 (2023).
Author information
Authors and Affiliations
Contributions
Friedrich von Bülow: conceptualization, visualization, writing—original draft, writing—review and editing. Felix Heinrich: conceptualization, writing—review and editing. William Arthur Paxton: conceptualization, writing—review and editing.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Peer review
Peer review information
Communications Engineering thanks the anonymous reviewers for their contribution to the peer review of this work. Primary handling editors: [Jiangong Zhu, Billy Wu, and Weihan Li] and [Mengying Su, Saleem Denholme, and Rosamund Daw].
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
von Bülow, F., Heinrich, F. & Paxton, W.A. The future of battery data and the state of health of lithium-ion batteries in automotive applications. Commun Eng 3, 173 (2024). https://doi.org/10.1038/s44172-024-00299-w
Received:
Accepted:
Published:
Version of record:
DOI: https://doi.org/10.1038/s44172-024-00299-w
This article is cited by
-
Battery management systems for vehicle electrification
Communications Engineering (2026)
-
Why we need a standardized state of health measurement procedure for electric vehicle battery packs—a proposal for energy- and capacity-based metrics
npj Clean Energy (2025)





