Main

The market for home storage systems has been growing strongly over the past years1. To make the investment of around 10,000 € per system1 more appealing, manufacturers give warranty periods of 10 years. However, the industry lacks standardized state of health (SOH) evaluations2, and customers have to rely on the given values of battery management systems (BMSs), which have unknown calculation schemes2 and inaccuracies3. To establish transparency, the batteries regulation of the European Union requires reliable and transparent SOH estimations4. Therefore, methods to determine the SOH of battery storage systems in field operation are urgently needed.

In the laboratory, methods to determine the SOH are defined to a large extent. Laboratory measurements are based on monitoring individual cells in a controlled environment, which are aged by either cycling5,6,7,8,9,10,11,12,13,14 or storing them5,8,15,16 at specific conditions. The ageing within these tests is tracked via regular capacity measurements, and results vary depending on the investigated chemistry and testing conditions17. However, these tests are time-consuming18, and real-world operation is always subject to unpredictable changes19, which is why simulations of field operation are applied in the laboratory20,21,22.

Field capacity tests can be found for grid storage23,24,25, photovoltaic (PV) integration19,26,27, telecommunication28 and electric vehicles (EVs)29,30. While most of these use on-site capacity tests to monitor battery ageing19,23,24,25,26,28, others remove the battery for laboratory measurements24,27,29. Such capacity tests require a certain system downtime, leading to increasing work time and decreasing revenue.

Operational data analysis is a promising way to overcome the shortcomings of conducted field capacity tests30,31,32,33,34,35,36,37,38. The capacity-based SOH estimation is widely used to track the decreasing capacity39. Coulomb counting is an established approach for this but comes with several challenges, such as the required knowledge of an initial state of charge (SOC) and its sensitivity to error accumulation40. Therefore, different methods have been developed to estimate the SOH, which can be categorized into model-based and data-driven approaches41. Model-based methods42 use either equivalent circuit models40,43,44,45,46 or electrochemical models describing the internal electrochemical processes of the battery47,48,49,50. Model-based SOH estimators are mainly limited by their high computational costs, and their accuracy depends highly on the parameterization39. Machine learning methods do not model the internal working principles of the battery51. Instead, they map ageing-related input features of the battery to its SOH42,51, using algorithms such as Gaussian process regression52,53, support vector machines54,55 and different types of neural network approach39,56,57,58.

The lack of publicly available field measurement datasets is a problem that many studies face or explicitly identify as a general shortcoming today2,59,60. For example, thematical close publications of Dubarry et al.60,61 analyse synthetical home storage system (HSS) battery data derived from measured irradiance to develop diagnostic methods using machine learning and incremental capacity analysis. The developed methods show promising results and could be validated with the dataset of this paper. Several battery datasets are accessible to the public, originating from laboratory measurements or synthetic cell simulations. Constant cycling data for varying battery chemistries and conditions can be found in refs. 11,12,13,14,62,63,64,65,66. Specific discharging conditions are used in refs. 67,68,69,70,71, where driving cycles are used to simulate the usage of the tested batteries in EV applications. Synthetic datasets simulating many degradation paths and corresponding battery data can be found in refs. 72,73. Real-world EV operational data with lithium-ion batteries was recently published2,74 with valuable datasets71,75. Lead-acid solar home batteries in Africa are evaluated in ref. 76. Nevertheless, if such large field datasets are published in rare cases, there are usually no reference measurements to validate the developed algorithms. A detailed overview of public datasets is given in ref. 59.

On the basis of the conducted literature research, we conclude that first, most research focuses on EVs2,74,77,78,79,80,81,82,83 and not on stationary storage systems3,60,76. Second, the measurement periods are usually limited to 1–2 years of operation2,74,78,79,80,81,83, mostly without conducted validation measurements, and the datasets are typically not shared. Third, the internal resistance is regularly chosen2,76 as a health metric, as the capacity78 is difficult to examine during field operation and the exact origins of the stated BMS values are unknown2,3. However, the capacity is the most important metric for the battery lifetime and is subject to given warranties.

To contribute to battery research, this paper analyses field data of 21 privately operated HSSs of the first product generation over up to 8 years. The main scientific contributions of this paper are the development of a method to estimate the usable battery capacity of home storage systems and the publication of the large dataset. The key findings are that the measured HSSs in field operation lose about 2–3 percentage points (pp) of capacity per year. Compared with other publications, the long measurement period and periodic field capacity tests allow for method validation. Alongside the paper, we publish the dataset consisting of 106 system years, 14 billion data points and 146 gigabytes in 1,270 monthly files. To the best of our knowledge, no comparable public dataset for various lithium-ion batteries of HSSs has been used to date (year 2024) for scientific capacity estimation. We expect the dataset to enable researchers worldwide to develop new SOH estimation methods.

Dataset of 21 home storage systems over 8 years

The ISEA/CARL of RWTH Aachen University measured 21 private HSSs in Germany over up to 8 years from 2015 to 2022. All these HSSs are combined with residential PV systems to increase self-consumption. The measured quantities relevant to this paper are system-level battery current, voltage, power, battery pack housing temperature and room temperature, while the sample rate is 1 second. Table 1 contains an overview of the measured HSS batteries and their main parameters, and Supplementary Note 1 gives detailed information on the measurements in general and the high-resolution measurement systems used.

Table 1 Aggregated overview of the measured HSSs

The operational behaviour of the systems is determined by two main parameters, which are the system design and the used cell chemistry. While some metrics such as the current rate (C-rate) or the number of equivalent full cycles (EFCs) depend on the system design and the ratio of battery energy to inverter power, the cell chemistries have different open circuit voltage (OCV) curves and specific ageing characteristics. Supplementary Tables 1 and 2 give more information on the system-specific quantities and the used measurement hardware. Figure 1a shows the C-rate and the cell voltage for an exemplary HSS and summer day, while Fig. 1b,c presents an overview of data availability and data gaps, which is explained in more detail in Supplementary Note 2.

Fig. 1: Dataset overview.
figure 1

a, Voltage and C-rate over one exemplary day. b, Data availability. c, Data gap analysis. d, Voltage distribution. e, Current distribution. f, Battery housing temperature. g, EFCs per year. h, DOD distribution with a bin width of 5 pp and depicted at the midpoint DOD, so DODs close to 0% and 100% are not shown, although they occur. i, Mean voltage seasonality.

Source data

The different cathode materials are lithium nickel manganese cobalt oxide (NMC), a blend of lithium manganese oxide (LMO) and NMC (simply referred to as ‘LMO’ in this paper), and lithium iron phosphate (LFP). Within the NMC systems, there are differences in cell chemistry. Two of the NMC systems have a high nickel share.

To account for system design and cell chemistry, the system nomenclature gives information on both parameters to interpret the results. Therefore, this paper refers to ‘SmallLMO’ (or ‘SLMO’), ‘MediumNMC’ (or ‘MNMC’) and ‘MediumLFP’ (or ‘MLFP’) HSSs. As there are already HSSs in the energy range above 15 kWh on the market, none of the systems are classified as large. SLMO and MNMC systems share similar battery chemistries, but their system design is not comparable. While the MLFP and the MNMC systems show a similar system design, their battery chemistries are not comparable.

The 1,270 monthly CSV files can be downloaded from a public repository84 (https://doi.org/10.5281/zenodo.12091223).

Field operation of home storage systems

The large dataset allows the information extraction on actual home storage operation (Supplementary Notes 35). In the following, the most important findings for method development are presented.

The HSSs are often fully charged or fully discharged. This can be seen by the wider parts of the violin plots at the top and the bottom of the voltage distribution of the SLMO and MNMC systems in Fig. 1d. The SLMO systems have a clear end of charge (EOC) voltage of around 4.15 V. Their end of discharge (EOD) voltage shows two peaks, as the BMS decreases it over the lifetime to compensate for ageing. The distribution of the MNMC systems shows three peaks each, representing the different EOD and EOC voltages of specific systems with different NMC types and BMS settings. The MLFP distribution does not show the characteristic EOD and EOD peaks owing to the flat OCV of LFP.

The C-rate distributions in Fig. 1e indicate that systems are charged at higher C-rates (positive values) than discharged (negative values). Charging often occurs at higher C-rates owing to the relatively high PV power compared with the battery inverter. By contrast, discharge occurs largely during the night, in which household standby consumption is also met with a few hundred watts, resulting in low discharge rates. The maximum C-rate of an HSS is typically limited by the system design. As the SLMO systems have the highest relative inverter power, their C-rates are higher. However, being a ‘small’ system is not a general feature of LMO chemistry. There was simply one manufacturer that offered these small battery systems and used LMO batteries by chance.

It is important to note that the parameter distributions of identical systems vary between households as the HSS operation is highly dependent on the PV generation and the electrical house consumption. In addition, the systems show different operational strategies (Supplementary Fig. 1) and different temperature distributions in dependence on the room and the system (Fig. 1f and Supplementary Fig. 2).

Over a year, around 200 EFCs occur for the measured MNMC and MLFP systems, while SLMO systems show yearly values of around 250 EFCs (Fig. 1g). The cycles show clear seasonality with regular full cycles during spring and fall, fewer full cycles during summer and barely any full cycles during winter (Supplementary Figs. 3 and 4). In the summer, some systems are not discharged frequently, and in the winter, some systems do not get fully charged. Figure 1h shows the depth of discharge (DOD) distribution calculated via the rainflow method85 for the whole operational period. Most DODs are relatively small, with values below 20%. In addition to the small DODs, there are increased values at the upper end of the distribution, with DODs near 100%.

When analysing field data, operational strategies and software updates need to be taken into account. Some of these changes are shown in Supplementary Fig. 5, where the derating behaviour is changed, and the EOD voltage is decreased to counteract battery ageing.

Field capacity tests for method validation

Regular field capacity tests were conducted during the whole measurement period. For this, the HSSs were first fully charged and then fully discharged. The discharge was conducted at full power to have a reproducible test scheme in the field. The capacity tests performed have great value. In relation to the research, they validate developed methods such as these within this paper. Such validation is not straightforwardly possible in other studies. In addition, the sheer amount of time required is worth emphasizing. The test preparation, conduction and follow-up take about 2 days on average. The 60 successfully conducted capacity tests alone correspond to a working time of around 120 days.

Figure 2 shows the development of the SOHC based on the usable capacity over (a) system age, (b) EFCs and (c) both system age and EFCs. Each point corresponds to a manual capacity test. Figure 2d depicts the test current for exemplary HSSs, normalized to its maximum value. It shows the difference between the HSSs’ control strategies. While some HSSs have a constant current discharge, others apply a constant power discharge, leading to a rising current. Towards the end of discharge, control strategies from a sharp drop in current to a steady decrease exist, which varies from product to product and even changes with software updates.

Fig. 2: Results of the periodic capacity tests for method validation.
figure 2

a, SOHC over system age. b, SOHC over EFCs. c, SOHC according to age and EFCs. d, Different discharge behaviour of HSSs. SOHC normalized to the nominal capacity stated on the modules.

Source data

The usable capacity decreases over the years. While SLMO HSSs show an average capacity decrease of 2.1 pp per year (a−1), MNMC systems show an average decrease of 3.2 pp a−1, and MLFP systems of 2.2 pp a−1 based on the capacity tests. After 7 years, the first end of life (EOL) cases can be identified following the test results, although some tests show slightly lower values than 80% after already 4.5 years. However, the EOL shown in Fig. 2 does not define a warranty case, as datasheets often contain an ageing reserve in their stated value to meet the warranty (Supplementary Note 6).

The EFCs increase approximately linearly over time for the different HSS classifications. The SLMO systems show the most cycles, followed by the medium systems. Nevertheless, one MLFP system reaches a similar number of EFCs as the SLMO systems after more than 7 years (Fig. 2c). Regarding ageing, a capacity fade can be observed for all HSSs. The SOHC values depend on the specific HSS within a system type. Especially for the older SLMO systems, varying SOHC results can be observed at a similar age. In addition, the ageing behaviour differs among the system types. The SLMO systems do not show lower SOHC values compared with the MNMC systems despite more EFCs at a comparable age.

Supplementary Note 6 provides all the necessary information to understand the conducted capacity tests and warranty conditions. Supplementary Fig. 6 explains the capacity reserves that manufacturers apply to meet warranty conditions. The reserves are implemented by stating less capacity on the datasheet than the HSSs have. Supplementary Fig. 7 shows the capacity test scheme, and Supplementary Fig. 8 shows the influence of the ageing reserve and the normalization quantity on the warranty condition.

Capacity estimation based on operational data

HSSs regularly reach EOC and EOD voltage, and full cycles occur. The developed method uses this behaviour to estimate the capacity (Methods and Supplementary Methods). First, it identifies relaxation phases around the EOC and EOD voltages. Second, it estimates the relaxation OCV using a second-order equivalent circuit model with a two-step fitting procedure. Third, it estimates the capacity using an offset-current-corrected coulomb counting between a fully charged (EOC voltage) and a fully discharged (EOD voltage) state.

Figure 3 shows the storage operation of two exemplary days. During this period, the HSS shows a full cycle: It is empty at the beginning of day 1, gets fully charged until noon, stays at a high SOC during the day and is fully discharged overnight. The next day, it gets fully charged again. On the basis of this storage operation, four integration possibilities exist to estimate the capacity while integrating the current from empty-to-empty (E2E), full-to-full (F2F), empty-to-full (E2F) and full-to-empty (F2E).

Fig. 3: Methodology of the capacity estimation based on field measurements of an exemplary SLMO HSS (15 Ah cell).
figure 3

a, Voltage and relaxation points. To identify full cycles, EOC and EOD relaxations can be combined. Full cycles are represented between E2E and F2F. A full charge is represented by E2F and a full discharge by F2E. b, C-rate and capacity estimate. The current can be integrated between the identified relaxation points to compute the capacity.

Source data

When integrating full cycles in the form of E2E and F2F operations, it becomes apparent that the integrated current is not exactly equal to zero. First, the respective start and end points in the form of EOC and EOD voltage are not identical, as these vary during operation. Second, the BMS is also partly supplied by the HSS itself. Next to the BMS power supply, balancing different cell voltages requires energy as well. Therefore, the current is corrected, as presented in Supplementary Methods. In addition, a coulomb-counting SOC estimation that is recalibrated with the regularly occurring EOC or EOD relaxation phases is implemented.

Figure 4 shows the results of the capacity estimate for three exemplary HSSs: (a) SLMO, (b) MNMC and (c) MLFP. While the orange dots represent the manual field capacity tests, the blue dots show the algorithmic capacity estimates. The confidence interval (CI) in light blue contains 75% of all estimates.

Fig. 4: SOHC estimate (normalized to nominal capacity) for three exemplary HSSs.
figure 4

a, SLMO. b, MNMC. c, MLFP. Estimates validated by field capacity tests. The darker the blue points are, the more estimates exist. The trend is a linear fit of the estimates. The CI of 75% of all estimates is shown in light blue. The measurements started delayed after system commissioning (start of x-axis). Beginning of life (BOL) defined as 100%, and EOL defined as 80% of the nominal capacity.

Source data

As this paper does not focus on ageing fitting methods, a linear fit is applied to the values of the algorithmic capacity estimate, and its gradient is defined as the ageing rate. A linear fit is in line with some observations in literature86, although both linear and nonlinear ageing behaviour are possible87. However, the so-called knee point88,89 cannot yet be observed clearly for the systems. While the field capacity tests of the SLMO and the MNMC systems vary around the ageing trend, the capacity test values of the MLFP systems are always at the lower end of the estimates and below the trend. The reason for this is that the system does not reduce the current towards the EOD voltage. This results in an earlier reach of EOD owing to overvoltage and a smaller capacity value, because the capacity tests are conducted at maximum C-rate.

Figure 5 provides all ageing rates for the three system types identified by the linear fit for the HSSs both for capacity and energy on an annual basis and per 100 EFCs.

Fig. 5: Ageing trends of all 21 systems.
figure 5

Decrease in SOHC (usable capacity decrease (CD)) and SOHE (usable energy decrease (ED)) per year and per 100 EFCs. The box plot contains the gradient of the linear ageing fit from Fig. 4. Each circle is one system. One MLFP system has too many data outages to estimate the EFCs, which is why there are only seven systems depicted. Box plots are calculated by ‘box plot’ function of MATLAB, where boxes contain 50% of all values of the assumed distribution.

Source data

The mean values for the decrease in SOHC are 3.1 pp a−1 for SLMO, 1.9 pp a−1 for MNMC and 2 pp a−1 for MLFP. Concerning 100 EFCs, the ageing rates are 1.2 pp per 100 EFCs for SLMO, 0.9 pp per 100 EFCs for MNMC and 0.7 pp per 100 EFCs for MLFP. The high SOHC decrease of SLMO systems can possibly be explained by their higher number of EFCs, higher DODs and higher C-rates compared with the medium systems. However, for the yearly values, a range of approximately 2 pp a−1 can be observed for all system types, showing differences in ageing behaviour between similar HSSs. Purely from the gradients, the physical EOL can be estimated after 5 years (for 4 pp a−1) and after 20 years (for 1 pp a−1). While the shorter durations can already be confirmed for some HSSs, linear extrapolation is not reasonable for the HSSs with low ageing rates based on the nonlinear ageing towards the EOL.

The SLMO systems show the lowest mean capacity confidence width with a value of 4.4 pp, followed by the MNMC systems with 6.8 pp and the MLFP systems with 8.0 pp. The reason for the high accuracy of SLMO systems can be explained by their high cycle number and an accurate identification of EOC and EOD voltage owing to the LMO OCV curve. The MNMC systems with NMC cells share this voltage characteristic but show fewer full cycles, leading to fewer estimates and, consequently, wider confidence ranges. In addition, a higher connection of cells in parallel leads to higher inaccuracy in OCV estimation for some systems while others can be estimated with higher accuracy. LFP has a substantially flatter OCV curve than LMO and NMC, leading to less accurate EOC and EOD detection. Combined with a medium number of full cycles, their average usable capacity and energy estimates are worse, although some systems reach comparably good CI values below 5%.

The results for the usable energy decrease look similar to the capacity analysis, leading to the conclusion that the loss of capacity is the dominant ageing effect. A possible increase in internal resistance appears secondary. Otherwise, the decrease in energy-related state of health (SOHE) would be significantly higher owing to resistance losses.

An offset-corrected coulomb counting is described as an established method in literature with known shortcomings of error accumulation and unknown start SOC. In addition, manufacturers have already applied the method. However, we think that the following arguments highlight the importance of the presented method, which is tailored to HSS operation. First, the regular and timely close reach of EOD and EOC voltage of HSSs ensures that error accumulation is limited, and the SOC can be recalibrated frequently. This focus on regularly occurring full cycles is not possible for many other applications such as EVs (which typically do not get fully discharged) or battery storage systems for frequency restoration (which have a mean SOC of around 50%). Further, the applied current correction makes the errors smaller. Second, the capacity estimation method does not require an initial SOC, as it starts and ends at fully charged or fully discharged states close to the identified EOD and EOC voltage. Third, literature describes the method as state-of-the-art. However, we still see industry field data analyses with inaccurate SOH estimates not being the exception. To us, there is a gap between literature statements and especially smaller manufacturers’ implementation state. Our method shows how to adapt an established method to the specifics of HSS operation, offering an approach that needs to store only the data of full cycles occurring in normal operation. We consider the method robust, as it works for system-level field data of three relevant lithium-ion technologies without knowing all exact battery cells or having manufacturer OCV curves. Thus, it can also be used by external companies to help customers with warranty claims. The rather simple methodology makes the method transparent to all parties involved.

Conclusion

The batteries regulation of the European Union4 requires reliable SOH estimation based on field data. However, so far, neither standardized methods nor enough datasets exist to develop these. This paper contributes to both by analysing field measurements of 21 HSSs over a measurement period of up to 8 years. The dataset is, so far, valuable for a scientific dataset in terms of measurement duration and sample rate. It consists of 106 system years represented by 14 billion data points. Its 146 gigabytes cover three important lithium-ion battery technologies: LFP, NMC and a blend of LMO and NMC.

The developed SOH estimation automatically detects the electrochemical processes of overvoltage relaxation at low currents and uses their characteristics for the parameter estimation of a second-order equivalent circuit battery model. With these parameters, the exact SOC at both full charge and full discharge is calculated. These values compute the remaining capacity, energy and SOH while analysing current and voltage using coulomb counting and current correction. The analysed storage systems show average decreases in usable capacity of around two to three percentage points per year. From a technical perspective, single systems with a higher capacity fade reach their EOL after 5–7 years, defined as 80% of their nominal capacity. Others still show reasonable SOH values after the same operational period, indicating a longer lifetime. Nevertheless, the given warranty periods can be reached in most cases by including ageing reserves. Considering that the measured HSSs were from the first product generation, this is a positive sign for the industry.

Methods

Battery model and relaxation phase detection

A battery model is needed to estimate the OCV of the battery at identified EOC and EOD relaxation phases. The used second-order battery model is inspired by literature90,91,92,93, and its parametrization is described in Supplementary Methods (Supplementary Figs. 911, with Supplementary equations (1)–(5)).

Equation (1) is used for the final fit. The parameters include the measured battery voltage Vbat, the open circuit voltage VOCV, the voltage Vfast over the first resistor-capacitor (RC) element for the fast processes like charge transfer with the time constant τfast, and the voltage Vslow over the second RC element responsible for slow diffusion effects with the time constant τslow. The distribution of the relaxation voltages and their durations can be seen in Supplementary Fig. 12.

$$\begin{array}{l}{V}_{\rm{bat}}(t)-{V}_{\rm{OCV}}={V}_{\rm{fast}}\times {\mathrm{e}}^{\frac{-{t}^{+}}{{\tau }_{\rm{fast}}}}+{V}_{\rm{slow}}\times {\mathrm{e}}^{\frac{-{t}^{+}}{{\tau }_{\rm{slow}}}}\,\\{\rm{with}}\,{\tau }_{\rm{fas{t}}_{min}}\le {\tau }_{\rm{fast}}\le {\tau }_{\rm{fas{t}}_{max}}\,{\rm{and}}\,{\tau }_{\rm{slo{w}}_{min}}\le {\tau }_{\rm{slow}}\le {\tau }_{\rm{slo{w}}_{max}}\end{array}$$
(1)

SOC calculation and current correction

The SOC is calculated using equation (2). The reasonably constant energy supply of the battery to the BMS and regular balancing activities lead to an error in SOC estimation. The reason for this is that the measurement system is attached to the DC poles of the whole HSS’s battery. Thus, the internal energy supply of the BMS and balancing activities are not measured directly, possibly leading to offsets. The whole current correction is explained in Supplementary Methods (Supplementary Figs. 13 and 14). The parameters include the battery current Ibat, the offset current Ioffset and the usable capacity Cusable:

$${\mathrm{SOC}}\left(t\right)={\mathrm{SOC}}\left({t}_{0}\right)+\frac{\displaystyle\mathop{\int}\nolimits_{{t}_{0}}^{{t}_{0}+t}({I}_{{\rm{bat}}}\left(t\right)-{I}_{{\rm{offset}}}(t)){dt}}{{C}_{{\rm{usable}}}(t)}$$
(2)

SOH calculation

The SOHC serves as an indicator of a battery’s ageing conditions and limitations set by the BMS. It is derived from the ratio between the remaining usable capacity Cusable and its nominal capacity Cnominal, as represented in equation (3)94:

$${{\rm{SOH}}}_{\rm{C}}(t)=\frac{{C}_{{\rm{usable}}}(t)}{{C}_{{\rm{nominal}}}}$$
(3)