Introduction

Computational materials design1,2,3 is complicated by the complexity of materials and by interaction effects. This hampers both calculations and understanding. The fundamental problem lies in the fact that the effects of the Coulomb interaction and of the specific material cannot be separated. Otherwise, one could calculate the interaction contributions once and for all, store them and add them every time a new material is calculated. This is an unreachable dream, but it still indicates an intriguing direction of thinking: in some model systems the effects of the Coulomb interaction can be treated exactly, or at least to a much better extent than in real systems, and attempts to use model results in order to approximately simulate interaction effects in real materials are widespread. The most prominent example is the local density approximation (LDA) to the exchange-correlation (xc) energy whose functional derivative is the Kohn–Sham (KS) xc potential vxc(r; [n]) of density functional theory (DFT)4. The exact potential is unknown in most real materials. The LDA replaces vxc(r; [n]) at a point r by its value in a homogeneous electron gas (HEG) that is calculated at the density n(r) of the real system in the same point r. In this way, DFT profits from the existence of tabulated and interpolated Quantum Monte Carlo (QMC) results5. Similarly, dynamical mean field theory in the single site approximation takes the effective local self-energy from the Anderson impurity model, and although in this case results have not been tabulated, the procedure has enabled a realistic description of correlated materials6.

However, in spite of numerous studies and attempts4,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44 it is very difficult to go beyond these simple schemes. One reason is that the very fact of using results of the models is considered as an approximation from the start. This, however, is not inevitable. In the present work, we pose the following questions: Can one exactify the idea of re-using results from one system, for example a model, to describe another system? If yes, under which conditions? And does this suggest strategies for systematic approximations? Our answer, termed connector theory (COT), is an in principle exact connection between different systems, which is used as starting point to find good approximations in practice. As we will illustrate, it is a promising tool to design functionals, for example within DFT. It is, however, much more general than just density functionals and can be used to design other functionals, and also to speed up calculations.

Results

From the LDA to an exact xc potential

In order to face this challenge, let us first analyze the LDA from the connector point of view. This seemingly simple approximation is probably one of the main reasons for the early success of DFT. Thanks to the LDA, many thousands of ground state calculations could be performed in the KS scheme without ever recomputing the exchange-correlation energy: indeed, the really involved calculation was done once and forever using QMC, in the HEG. In the LDA, these results are then used over and over again, for all materials. This constitutes an invaluable gain. Of course, the idea requires a prescription of how to use the HEG results.

The simple prescription given by the LDA reads \({v}_{{{{\rm{xc}}}}}^{{{{\rm{LDA}}}}}({{{\bf{r}}}};[n])={v}_{{{{\rm{xc}}}}}^{h}({n}_{{{{\bf{r}}}}}^{h})\), where \({n}_{{{{\bf{r}}}}}^{h}=n({{{\bf{r}}}})\). In other words, one supposes that the xc potential of a material in a point r takes the same value as \({v}_{{{{\rm{xc}}}}}^{h}\) in a HEG with a density \({n}_{{{{\bf{r}}}}}^{h}\). Note the subscript r, which indicates that for every point in which the potential is to be determined, one may choose a different homogeneous density nh. The LDA makes the guess that a reasonable choice for the homogeneous density that represents the material in r is n(r), its local density.

Probably, both the power and the limitation of the LDA stem from the fact that it is based on Kohn’s ingenious intuition of nearsightedness4,45,46, where the xc potential in a given point doesn’t know about the density elsewhere. This hypothesis not only leads to a very simple prescription, it also justifies why results for the HEG can be used at all for systems that are extremely inhomogeneous, such as atoms: differences away from the point of interest are supposed to have no influence at all. The shortcoming, however, is that the hypothesis of nearsightedness gives no a priori clue of how to do better than the LDA, and straightforward attempts such as the first gradient corrections were not conclusive. Successful suggestions for better xc potentials were rather based on the fitting of parameters to known results47,48, or, most importantly, on the use of exact constraints18,19,20,21,22. One such attempt, which is may be the closest to our work, is the Weighted Density Approximation (WDA), where the sum rule on the xc hole was imposed in order to replace the local density by a suitably averaged one11,12,13,14,15. Along the same lines, one also finds the Average Density Approximation11,14 (ADA). However, while capable to overcome the failure of the LDA to describe image potential effects49, also the WDA and ADA did not lead to more systematic developments and are rarely used today.

Still, these approaches based on averaged densities were pioneering and suggest an interesting question: which homogeneous density \({n}_{{{{\bf{r}}}}}^{h}\) would represent the real system best in each point r, and could one even find a prescription where \({v}_{{{{\rm{xc}}}}}^{h}({n}_{{{{\bf{r}}}}}^{h})\) exactly equals the xc potential of the real material? We know that \({v}_{{{{\rm{xc}}}}}^{h}({n}^{h})\) as a function of nh takes values spanning a range of all negative real numbers. This means that as long as the xc potential of the real system is negative in a given point, it equals the xc potential of the HEG with a specific, though unknown, homogeneous density \({n}_{{{{\bf{r}}}}}^{c}\):

$${v}_{{{{\rm{xc}}}}}({{{\bf{r}}}};[n])={v}_{{{{\rm{xc}}}}}^{h}({n}_{{{{\bf{r}}}}}^{c})\,.$$
(1)

In other words, the task to find vxc(r; [n]) can be reformulated as the task to find \({n}_{{{{\bf{r}}}}}^{c}[n]\). This quantity, which is itself a functional of the density, is the prescription of how to use the HEG in order to represent the real system: we call it the connector. The exact connector reads

$${n}_{{{{\bf{r}}}}}^{c}[n]={({v}_{{{{\rm{xc}}}}}^{h})}^{-1}({v}_{{{{\rm{xc}}}}}({{{\bf{r}}}};[n]))\,.$$
(2)

Since \({v}_{{{{\rm{xc}}}}}^{h}\) is a monotonic function the inversion poses no problem. There are, however, two issues with these equations: first, Eq. (1) has no solution when the xc potential of the real system is positive. This is not the usual case, but it can happen in certain points in some systems; for example the xc potential of some linear molecules builds up a positive bump upon dissociation, see, e.g.,50,51,52,53,54,55. We will discuss in Sec. Generalization how to potentially face the problem. Second, Eq. (2) is our exact starting point, but it cannot be used in practice, because it expresses the connector in terms of the xc potential of the real system, which is the unknown that we are looking for. However, having an in principle exact relation is extremely precious, since it allows one to make controlled approximations.

For this, let us formulate the problem differently: the function \({v}_{{{{\rm{xc}}}}}^{h}({n}^{h})\) is a more compact representation of \({v}_{{{{\rm{xc}}}}}{({{{\bf{r}}}};[n])}_{| n(\tilde{{{{\bf{r}}}}}) = {n}^{h}}\), the functional vxc restricted to the domain of homogeneous densities. In other words, we could have equally expressed Eq. (1) as

$${v}_{{{{\rm{xc}}}}}{({{{\bf{r}}}};[\tilde{n}])}_{| \tilde{n}(\tilde{{{{\bf{r}}}}}) = n(\tilde{{{{\bf{r}}}}})}={v}_{{{{\rm{xc}}}}}{({{{\bf{r}}}};[\tilde{n}])}_{| \tilde{n}(\tilde{{{{\bf{r}}}}}) = {n}_{{{{\bf{r}}}}}^{c}}\,:$$
(3)

the equation requires that the functional vxc takes in each point r the same value for at least two different density distributions, namely, for the \(n(\tilde{{{{\bf{r}}}}})\) of interest and for some homogeneous density of value \({n}_{{{{\bf{r}}}}}^{c}\). The same functional appears on both sides of the equation, we only restrict the domain of allowed densities on the right side. Therefore, one can expect that by making the same approximation on both sides, a similar error is made, which means that we can benefit from massive error canceling if we determine an approximate connector

$${n}_{{{{\bf{r}}}}}^{c,{{{\rm{approx}}}}}[n]={({v}_{{{{\rm{xc}}}}}^{h,{{{\rm{approx}}}}})}^{-1}({v}_{{{{\rm{xc}}}}}^{{{{\rm{approx}}}}}({{{\bf{r}}}};[n]))\,.$$
(4)

With the approximate connector being close to the exact one, the connector approximation for the xc potential defined as

$${v}_{{{{\rm{xc}}}}}^{c}({{{\bf{r}}}};[n])={v}_{{{{\rm{xc}}}}}^{h}\left({n}^{h}={n}_{{{{\bf{r}}}}}^{c,{{{\rm{approx}}}}}([n])\right)$$
(5)

can yield good results since \({v}_{{{{\rm{xc}}}}}^{h}({n}^{h})\) is a smooth function. Note that in Eq. (5) the HEG potential \({v}_{{{{\rm{xc}}}}}^{h}\) appears without approximation, which is a crucial point, as it allows us to benefit from the knowledge encoded in the model.

Of course, such an approximate xc potential is not, in general, the functional derivative of an energy functional Exc. There are various ways to reconstruct an energy functional starting from some \({v}_{{{{\rm{xc}}}}}^{{{{\rm{approx}}}}}\), leading to different results that may have problems such as a lack of translational or rotational invariance, see, e.g., refs. 56,57,58. Using COT, one might think to explore \({E}_{{{{\rm{xc}}}}}^{{{{\rm{approx}}}}}\equiv \int d{{{\bf{r}}}}\,n({{{\bf{r}}}}){\epsilon }_{{{{\rm{xc}}}}}^{h}({n}_{{{{\bf{r}}}}}^{c,{{{\rm{approx}}}}})\) with \({\epsilon }_{{{{\rm{xc}}}}}^{h}\) the energy density per particle of the HEG. Again, \({v}_{{{{\rm{xc}}}}}^{h}({n}_{{{{\bf{r}}}}}^{c,{{{\rm{approx}}}}})\) will in general not be the functional derivative of this \({E}_{{{{\rm{xc}}}}}^{{{{\rm{approx}}}}}\). Moreover, the exact connector for ϵxc is not the same as the exact connector for vxc: the connector is in general different for every object of interest (see also Eq. (11) below). Hence, it is not obvious that a \({n}_{{{{\bf{r}}}}}^{c,{{{\rm{approx}}}}}\) which yields a good approximation for vxc also yields a good approximation for Exc. Exploring total energies using COT is therefore a topic on its own, which goes beyond the scope of the present work.

Let us now come back to vxc and formulate a concrete proposal for a connector approximation. A potentially useful approximate connector functional for vxc involves an approximation for which both \({v}_{{{{\rm{xc}}}}}^{h}\) and \({v}_{{{{\rm{xc}}}}}^{h,{{{\rm{approx}}}}}\) are known. Here we explore the choice of a first-order expansion around a given homogeneous density4,59n0 as approximation in Eq. (4). The approximated potentials in the real and in the homogeneous system read, respectively:

$$\begin{array}{ll}{v}_{{{{\rm{xc}}}}}^{{{{\rm{approx}}}}}({{{\bf{r}}}};[n])=\,{v}_{{{{\rm{xc}}}}}^{h}({n}_{0})+\int d{{{\bf{r}}}}^{\prime} (n({{{\bf{r}}}}^{\prime})-{n}_{0}){f}_{{{{\rm{xc}}}}}(| {{{\bf{r}}}}-{{{\bf{r}}}}^{\prime} | ;{n}_{0})\\ {v}_{{{{\rm{xc}}}}}^{h,{{{\rm{approx}}}}}({n}^{h})\,\,=\,{v}_{{{{\rm{xc}}}}}^{h}({n}_{0})+({n}^{h}-{n}_{0}){f}_{{{{\rm{xc}}}}}^{h}({n}_{0})\,,\end{array}$$
(6)

where \({f}_{{{{\rm{xc}}}}}(| {{{\bf{r}}}}-{{{\bf{r}}}}^{\prime} | ;{n}_{0})={\left.\delta {v}_{{{{\rm{xc}}}}}({{{\bf{r}}}},[n])/\delta n({{{\bf{r}}}}^{\prime})\right|}_{n = {n}_{0}}\) is the static nonlocal xc kernel of the HEG with density n0, and \({f}_{{{{\rm{xc}}}}}^{h}({n}_{0})={\left.d{v}_{{{{\rm{xc}}}}}^{h}({n}_{h})/d{n}_{h}\right|}_{{n}_{h} = {n}_{0}}\) is its limit of zero wavevector, which corresponds to the case where variations are restricted such that the density remains homogeneous, i.e., we remain within the parameter space given by the model. The approximate connector of Eq. (4) is obtained by equating the two expressions, and solving for nh. The solution is

$${n}_{{{{\bf{r}}}}}^{c,{{{\rm{approx}}}}}([n])=\frac{1}{{f}_{{{{\rm{xc}}}}}^{h}({n}_{0})}\int d{{{\bf{r}}}}^{\prime} \,n({{{\bf{r}}}}^{\prime}){f}_{{{{\rm{xc}}}}}(| {{{\bf{r}}}}-{{{\bf{r}}}}^{\prime} | ;{n}_{0})\,.$$
(7)

This is then used to obtain \({v}_{xc}^{c}\) according to the prescription of Eq. (5). For a density of the real system that varies slowly on the scale of spatial decay of \({f}_{{{{\rm{xc}}}}}(| {{{\bf{r}}}}-{{{\bf{r}}}}^{\prime} |)\) the connector tends to \({n}_{{{{\bf{r}}}}}^{c,{{{\rm{approx}}}}}([n])=n({{{\bf{r}}}})\), the LDA, which is exact in the limit of slowly varying density4. This is therefore a systematic way to derive the LDA. For a very quickly varying periodic density the connector nc,approx tends to the mean density, as one would expect. This limit is missed by the LDA. The approximate connector interpolates between the two limits: \({n}_{{{{\bf{r}}}}}^{c}\) displays the degree of nearsightedness of the xc potential.

For the fxc of the HEG, parametrized data are available60,61. Equation (7) is one of our major results: \({v}_{{{{\rm{xc}}}}}^{c}({{{\bf{r}}}};[n])={v}_{{{{\rm{xc}}}}}^{h}({n}_{{{{\bf{r}}}}}^{c,{{{\rm{approx}}}}}[n])\) is a truly non-local density functional. It benefits from the knowledge of both vxc and fxc in the HEG, i.e., from high-level calculations that were performed once and for all in this model. It is therefore perfectly in line with our goal to build functionals containing pre-calculated complex ingredients, similarly to constructions using complex Lego pieces instead of simple bricks.

The linear approximation depends on the homogeneous density n0 around which we expand. This density may be chosen to be a different one for every point r where we calculate the potential. To focus on the improvement with respect to the LDA, in the following we choose for n0 the local density n(r), i.e., the direct approximation is a first-order expansion around the LDA. In principle, n0 can be further optimized, but this simple choice suits the purpose of the present paper.

We will now examine to which extent our functional can correctly describe non-local effects. The exact xc potential is of course not known, and the search for a simple but reliable KS exchange-correlation potential vxc(r; [n]) beyond the LDA is still a subject of intense research (see e.g.,11,12,13,14,15,16,17,18,19,20,21,22,62,63,64,65,66). This makes a direct benchmarking of the improvement of our functional with respect to the LDA difficult. We therefore take an established non-local functional as target, and we will do all approximations and calculations consistently with respect to this functional. It is important to note that the aim is not to elucidate the quality of the target functional, but to use it as benchmark for the connector approach. In particular, we will show how the connector yields the LDA in the simplest approximation, and how well it can capture non-local effects beyond the LDA in a systematic way.

We choose as target functional an expression based on the WDA of the xc hole nxc11,12,13,14, with the weight function proposed in ref. 15. The xc energy reads

$$E_{\rm xc}^{\rm WDA}[n] = \int d \mathbf{r} d\mathbf{r'} \frac{n(\mathbf{r})n(\mathbf{r'})}{2\mathbf{|r-r'|}} C(\tilde{n}(\mathbf{r,r'})) \left( 1- \rm{e}^{\frac{- \lambda(\tilde{n}(\mathbf{r,r'}))}{\mathbf{|r-r'|^5}}} \right),$$
(8)

with \(\tilde{n}({{{\bf{r}}}},{{{\bf{r}}}}^{\prime})=[n({{{\bf{r}}}})+n({{{\bf{r}}}}^{\prime})]/2\) as in ref. 49 and the Coulomb interaction \(1/| {{{\bf{r}}}}-{{{\bf{r}}}}^{\prime} |\) appearing at two places, as factor in the integral and in the exponent to the fifth power. The functions C and λ are set by the sum rule \(\int d{{{\bf{r}}}}^{\prime} {n}_{{{{\rm{xc}}}}}({{{\bf{r}}}},{{{\bf{r}}}}^{\prime} -{{{\bf{r}}}})=-1\), and by making Exc exact in the HEG. The functional derivative of \({E}_{{{{\rm{xc}}}}}^{{{{\rm{WDA}}}}}\) with respect to the density yields the exchange-correlation potential \({v}_{{{{\rm{xc}}}}}^{{{{\rm{WDA}}}}}\) that is given in the Supplementary Note 1, together with the functions C and λ. Our aim is to approximate this potential for a given density.

We take as density of the ‘real system’ the periodic density \(n({{{\bf{r}}}})=A\cos ({{{\bf{a}}}}\cdot {{{\bf{r}}}})+B\), with reciprocal lattice vector a and parameters A and B. Here a and the maximum density correspond to the case of solid argon. This yields the target potential \({v}_{{{{\rm{xc}}}}}^{{{{\rm{WDA}}}}}\) shown in Fig. 1. It is negative for all systems examined in our work, so it can in principle be reproduced by \({v}_{{{{\rm{xc}}}}}^{h}\). We suppose now that this target potential is unknown (as it is the case for the exact xc potential), but that we are able to evaluate it in some approximation.

Fig. 1: Target WDA xc potential, and approximations, for a system with periodic density.
figure 1

Target \({v}_{{{{\rm{xc}}}}}^{{{{\rm{WDA}}}}}({{{\bf{r}}}};[n])\) (red line). Coulomb cutoff at short-range (s.-r.): direct approximation (dotted green) and used via the connector (dotted green with filled circles); linear expansion: direct approximation (blue dashed) and used via the connector (blue dashed with filled circles); LDA (yellow line). Minimum and maximum densities are \(0.0402\,{a}_{0}^{-3}\) and \(0.3776\,{a}_{0}^{-3}\), reciprocal lattice vector \(a=0.93\,{a}_{0}^{-1}\).

Before we come to the approximation proposed in Eq. (7), let us make a more drastic approximation, namely, we cut the Coulomb potential \(1/| {{{\bf{r}}}}-{{{\bf{r}}}}^{\prime} |\) in Eq. (8) to zero beyond a distance rc. This cutoff now appears as limit of the integral boundaries in Eq. (8) and in the corresponding \({v}_{{{{\rm{xc}}}}}^{{{{\rm{WDA}}}}}\). For rc → 0 the integral vanishes. For illustration we take rc = 0.1 a0, <2% of the unit cell length. This rc yields an almost vanishing \({v}_{{{{\rm{xc}}}}}^{{{{\rm{approx}}}}}\), which is of course very different from the exact result. Now we follow instead the connector scheme, using the same approximation for both \({v}_{{{{\rm{xc}}}}}^{{{{\rm{approx}}}}}\) and \({v}_{{{{\rm{xc}}}}}^{h,{{{\rm{approx}}}}}\) in Eq. (4). The resulting connector density \({n}_{{{{\bf{r}}}}}^{c,{{{\rm{approx}}}}}\) is very close to the local density and the connector result for the xc potential, \({v}_{{{{\rm{xc}}}}}^{c}\) obtained from Eq. (5) and shown in Fig. 1, tends to the LDA. This is a huge gain. The physical reason is that we do not apply the approximation to the potential, but only to the way the potential in a given point ‘sees’ the environment, limiting it, for rc → 0, to the environment close to the point r where the potential is calculated. We therefore obtain a much more meaningful result. The huge improvement of the final connector result with respect to the approximation applied directly to vxc makes the LDA a prototype illustration for the power of the connector approach.

Still, the LDA has significant errors especially in regions of low density, because the approximation is too crude. In principle one could remedy by increasing the range of the Coulomb interaction, going systematically toward the exact result. However, this approximation would be of limited practical interest. Instead, we will now test our approximation Eq. (7). Although the numerically exact static fxc is known in the HEG, in order to be consistent within our benchmark framework, here we use the WDA \({f}_{{{{\rm{xc}}}}}^{{{{\rm{WDA}}}}}\) for homogeneous densities67, which is derived from \({v}_{{{{\rm{xc}}}}}^{{{{\rm{WDA}}}}}\) and given in the Supplementary Note 1.

Let us first look at the performance of the linear approximation itself, used directly on the potential (first line of Eq. (6)). The comparison in Fig. 1 shows that the WDA potential is well described by the direct linear approximation \({v}_{{{{\rm{xc}}}}}^{{{{\rm{approx}}}}}\) in regions of high density (large vxc), but not where the density is low. We now move on to the connector scheme, and first examine the connector density \({n}_{{{{\bf{r}}}}}^{c}\) given by Eq. (7) and shown in Fig. 2 for the same periodic system as above. The connector density that yields the LDA is simply \({n}_{{{{\bf{r}}}}}^{c}=n({{{\bf{r}}}})\), the density itself, which has its minimum in the middle of the unit cell. The exact WDA connector \({n}_{{{{\bf{r}}}}}^{c,{{{\rm{WDA}}}}}={({v}_{{{{\rm{xc}}}}}^{h,{{{\rm{WDA}}}}})}^{-1}({v}_{{{{\rm{xc}}}}}^{{{{\rm{WDA}}}}}({{{\bf{r}}}};[n]))\) spans a smaller range of values than the local density, confirming that it represents an effective density that is suitably averaged over a range around the local density. It is far from trivial, even in this simple system: where the density is very low, the exact connector WDA shows a bump, which is a feature that would not be easy to guess. One may explain it with the fact that in regions of low density the system is more far-sighted, which, in the present system, mixes in more of the higher densities, and with the non-monotonic distance dependence of effective interactions, such as \({f}_{{{{\rm{xc}}}}}^{{{{\rm{WDA}}}}}\). The approximate connector density, evaluated according to Eq. (7) with WDA ingredients, captures this effect very well, with a remaining discrepancy that is much smaller than the one of the local density.

Fig. 2: Comparison of connector densities.
figure 2

Exact WDA connector \({n}_{{{{\bf{r}}}}}^{{{{\rm{c,WDA}}}}}\) (red line), connector based on a linear expansion in the density as given by Eq. (7) (blue line), and the density itself, corresponding to the local density approximation connector (yellow line), for the periodic system of Fig. 1.

Finally, we use the approximate connector density in Eq. (5) together with the exact WDA xc potential \({v}_{{{{\rm{xc}}}}}^{h,{{{\rm{WDA}}}}}\) of the HEG to obtain the approximate connector xc potential \({v}_{{{{\rm{xc}}}}}^{c}\). Figure 1 shows that it gives by far the best result of all approximations, illustrating the fact that the connector Eq. (7) takes into account a significant amount of non-locality. It demonstrates that, by using the very same linear expansion through the connector instead of applying it directly, strong improvement is obtained, without additional cost.

This benchmark shows that the connector strategy is very promising. Note that also in the case of the true many-body problem, the connector defined in Eq. (7) can be used to construct an approximate density functional for vxc. In this case, the xc kernel, \({f}_{{{{\rm{xc}}}}}(| {{{\bf{r}}}}-{{{\bf{r}}}}^{\prime} | ;{n}_{0})\), should be the one of the HEG obtained from a QMC calculation60,61. The implementation of this vxc will be left to future work. Our aim in the present work is to elucidate to which extent the approach can be formulated and applied as a very general concept. In the following, we will propose the corresponding formalism, which we term COT.

Generalization

The general idea can be summarized as follows: suppose one wishes to calculate an observable or quantity O that is a functional of a function Q, and that can depend on additional arguments x, so O = O(x; [Q]). Above, O was the xc potential vxc(r; [n]), the argument x was the spatial coordinate r and Q a function, the density \(n(\tilde{{{{\bf{r}}}}})\), but there are many other possibilities. For clarity, Table 1 summarizes the general approach in strict parallel to the specific realization of the KS potential, as well as the additional examples given later.

Table 1 Summary of the general COT approach and its declinations.

In most cases O(x; [Q]) is difficult or impossible to calculate in a real material without approximations, or even unknown. However, it may be possible to calculate O(x; [Q]) for some Q in a restricted domain: this restricted domain defines a model. Above, where Q was the electron density, the restricted domain of homogeneous densities defined the HEG as model. Connector theory (COT) aims at using the model results in order to simulate systems where Q lies outside the model domain. The goal is therefore to find, for a given real system described by Q, at each point x another function \({Q}_{x}^{c}\) that lies in the model domain, such that

$$O{(x,[\tilde{Q}])}_{\tilde{Q}(\tilde{y}) = Q(\tilde{y})}=O{(x,[\tilde{Q}])}_{\tilde{Q}(\tilde{y}) = {Q}_{x}^{c}(\tilde{y})}\,.$$
(9)

This is the straightforward generalization of Eq. (3) above. Note also here the subscript x of \({Q}_{x}^{c}\), which indicates that Qc is allowed to be different for every value of the argument x. For the sake of generality, we have kept the explicit argument x on the right side, but as we have seen above, in some cases \(O(x,{Q}_{x}^{c})\) may depend on x only through \({Q}_{x}^{c}\).

To arrive at the generalization of Eq. (1), in the following we suppose that, because the model is simple, it can be described by only one effective parameter \({{{\mathcal{Q}}}}\) or, when there are more parameters, we can choose one that will be used to fulfill Eq. (9). This restriction can be dropped, but it is often useful and we keep it here for clarity. In this case, in the model the functional or multi-dimensional function can be represented by a simple function, \(O(x;[{Q}_{x}^{c}])\to {{{{\mathcal{O}}}}}_{x}({{{{\mathcal{Q}}}}}_{x}^{c})\). For example, in the HEG the one parameter \({{{\mathcal{Q}}}}\) is its number density nh, so \({v}_{{{{\rm{xc}}}}}{({{{\bf{r}}}};[\tilde{n}])}_{\tilde{n}(\tilde{{{{\bf{r}}}}}) = {n}^{h}}\to {v}_{{{{\rm{xc}}}}}^{h}({n}^{h})\), where \({v}_{{{{\rm{xc}}}}}^{h}\) corresponds to \({{{\mathcal{O}}}}\) and nh to \({{{\mathcal{Q}}}}\). The equivalent of Eq. (1) reads then

$$O(x;[Q])={{{{\mathcal{O}}}}}_{x}({{{{\mathcal{Q}}}}}_{x}^{c})\,.$$
(10)

Of course, even allowing \({{{{\mathcal{Q}}}}}_{x}^{c}\) to be different for every x it could be impossible to fulfill Eq. (10) if one restricts the model domain too severely. However, if the model is flexible enough such that the equation can be satisfied in principle, one can try to find the one or more \({{{{\mathcal{Q}}}}}_{x}^{c}\) for which the equality holds. In other words, for the idea to be applicable the following condition must be satisfied:

  • [A] On its domain of definition, \({{{{\mathcal{O}}}}}_{x}({{{{\mathcal{Q}}}}}_{x}^{c})\) must yield all values that O(x; [Q]) can take on its domain, i.e., the functions Q of interest.

The Q which define the domain of interest depend on the range of physical systems one wants to explore; this range does not necessarily include all possible physical systems. The domain of \({{{\mathcal{O}}}}\), on the other hand, defines the model system. If for certain Q and/or x Eq. (10) cannot be fulfilled, we have to change model by changing its domain, i.e., the range of allowed \({{{{\mathcal{Q}}}}}_{x}^{c}\).

Equation (10) is then formally solved,

$${{{{\mathcal{Q}}}}}_{x}^{c}={{{{\mathcal{O}}}}}_{x}^{-1}(O(x;[Q])):$$
(11)

this is the exact connector for an observable O in a system described by a function Q, the generalization of Eq. (2). This operation requires inversion of \({{{{\mathcal{O}}}}}_{x}\), which brings us to a second condition. Indeed, [A] is the only necessary condition, but there is also a question of uniqueness in Eq. (11): \({{{{\mathcal{O}}}}}^{-1}\) may require boundary conditions in order to be well defined. This is not a problem of principle, but may create difficulties for the design of approximations. We therefore require:

  • [B] When the inverse \({{{{\mathcal{O}}}}}^{-1}\) of \({{{\mathcal{O}}}}\) is not unique, it should at least be possible to specify a unique choice among the possible \({{{{\mathcal{O}}}}}_{i}^{-1}\).

In turn, when \({{{{\mathcal{Q}}}}}_{x}^{c}\) is known it can be used to get the observable in the real system using the model, through Eq. (10). As in the case of DFT and the LDA, the general scheme suggests to store \({{{{\mathcal{O}}}}}_{x}\) as numerical data, interpolated analytic expression or, in some cases, analytic result. Once a connector \({{{{\mathcal{Q}}}}}_{x}^{c}\) is given, for any real system one can then simply use these data instead of calculating O(x; [Q]), which will make calculations extremely efficient. This concept of re-using data has made the LDA a breakthrough, since numerous calculations could be performed without ever re-evaluating the interaction effects in the HEG, and it would be highly desirable to extend it not only well beyond the LDA, but also beyond DFT itself.

To make the idea useful in practice, we also have to generalize the approximation scheme Eq. (2) and Eq. (5), which straightforwardly yields

$${{{{\mathcal{Q}}}}}_{x}^{c,{{{\rm{approx}}}}}={({{{{\mathcal{O}}}}}_{x}^{{{{\rm{approx}}}}})}^{-1}({O}^{{{{\rm{approx}}}}}(x;[Q])).$$
(12)

The final approximate connector result is obtained as

$$O(x;[Q])\approx {O}_{x}^{c}\equiv {{{{\mathcal{O}}}}}_{x}({{{{\mathcal{Q}}}}}_{x}^{c,{{{\rm{approx}}}}}).$$
(13)

Note that the model observable \({{{\mathcal{O}}}}\) is supposed to be well known, but it is important to approximate it in Eq. (12) in the same way as O of the real system, whereas the exact model function \({{{\mathcal{O}}}}\) is used in Eq. (13). This guarantees that by choosing a model that approaches the real system, the connector potential approaches the exact one, even when a quite rough approximation is used. The exact result is also approached with an increasingly good approximation, for any, even rough, model.

Away from these limits, using the equivalent approximation in Oapprox(x; [Q]) and \({{{{\mathcal{O}}}}}_{x}^{{{{\rm{approx}}}}}({{{\mathcal{Q}}}})\) still leads to error cancelling, and the limiting behavior indicates that results can be improved in a controllable way. How far the model system can be chosen from the real system depends on the quality of the approximation, and vice versa, how rough the approximation is that one can tolerate depends on the closeness of the model and the real system. This double dependence is a source of the power of the connector approach. It implies that one can expect the approximate connector result Eq. (13) to be superior to the direct approximation Oapprox(x; [Q]), for similar computational cost. This benefit will be larger when the model contains important features of the real system, like in the example of the xc potential, the Coulomb interaction. Therefore, in COT the main effort and intuition will go into the choice of a suitable model.

Illustration

For a simple illustration of COT outside DFT-LDA, we first look at energy levels in a toy system: we take one electron in a one-dimensional potential of shape \(V(z)=\frac{1}{2}m{\omega }_{0}^{2}{\left(\left|z\right|-\frac{L}{2}\right)}^{2}\theta \left(\left|z\right|-\frac{L}{2}\right)\), with ω0 and L real positive parameters68. For L → 0 this is the oscillator potential, and for ω0 → , the infinite potential well \({V}_{\infty }(z)={V}_{0}\theta \left(\left|z\right|-\frac{L}{2}\right)\), with V0 → . These potentials and their energy levels are schematized in panel a of Fig. 3 (see also the Supplementary Note 2 and Supplementary Fig. 1 for further details). The jth energy level E(j; ω0, L) is our observable of interest corresponding to O(x; [Q]). The third column in Tab. 1 summarizes this and the subsequent steps for this example. The numerically exact E(j; ω0, L) are shown in panel b of Fig. 3 as function of α = 1/(L2ω0). This parameter is a measure of the difference between the real system and the infinite potential well, which vanishes for α → 0. Next, we choose an approximation. Here, we take a first-order perturbation expansion around the infinite well, i.e., α = 0, with width L. The expansion parameter is then α. Figure 3 shows the resulting approximate energy levels Eapprox(j; ω0, L). As expected from perturbation theory, they are close to the exact E(j; ω0, L) for small α and deviate for larger values, becoming unphysically negative above a critical threshold. To apply COT, we now define the model domain to cover all possible infinite square wells, characterized by a width \(\tilde{L}\) and yielding energies \({{{{\mathcal{E}}}}}_{j}(\tilde{L})={\pi}^{2}{j}^{2}\!/(2{\tilde{L}}^{2})\). It is intuitive to suppose that the effect of α can be simulated by varying the width \(\tilde{L}\) of the model potential, i.e., to suppose that each level sees a surrounding with some effective width. COT tells us to search for the effective width that best represents the real system. Condition [A] is fulfilled for any potential with positive eigenvalues if we accept to have a different connector, i.e., a different effective width, for every energy level, since the energy levels in an infinite potential well can take any positive value. The exact solution would be the connector width \(\tilde{L}={L}^{c}\) for which \({{{{\mathcal{E}}}}}_{j}({L}_{j}^{c}({\omega }_{0},L))=E(j;{\omega }_{0},L)\). Here, we have chosen to connect the energy levels one by one. This would be a reasonable choice also to connect, for example, poles of Green’s functions.69 Since the energies \({{{{\mathcal{E}}}}}_{j}(\tilde{L})\) of an infinite potential well with width \(\tilde{L}\) go as \(1/{\tilde{L}}^{2}\), there is no unique inverse \({{{{\mathcal{E}}}}}_{j}^{-1}(E)\). However, one of the two solutions of the resulting quadratic equation for \({L}_{j}^{c}\) is negative and can be discarded by using positivity as exact constraint, so our example also fulfills condition [B]. Our approximation strategy, which avoids the calculation of \(E(j;{\omega }_{0},L)\), is to expand also the model around the infinite square well (α = 0) with width L. Hence, \({L}_{j}^{c,{{{\rm{approx}}}}}={({{{{\mathcal{E}}}}}_{j}^{{{{\rm{approx}}}}})}^{-1}({E}^{{{{\rm{approx}}}}}(j))\), and the final result is \({E}_{j}^{c}={{{{\mathcal{E}}}}}_{j}({L}_{j}^{c,{{{\rm{approx}}}}})\). Note that in the last step the exact model function \({{{{\mathcal{E}}}}}_{j}\) is used, as prescribed by Eq. (13). While obtained with a similar computational cost, the connector results \({E}_{j}^{c}\) in Fig. 3 are much better than the direct approximation Eapprox(j), and they remain physical over the whole parameter range, which reflects an implicit approximate resummation of perturbation theory to infinite order through the use of the exact model \({{{{\mathcal{E}}}}}_{j}\). The connector \({L}_{j}^{c}\) formalizes the intuitive interpretation of an effective width that determines the energy levels in the real system and shows that this effective width is approximately independent of j (see Supplementary Note 2). This result indicates a promising route to obtain quick estimates for energy levels.

Fig. 3: Energy levels in a one-dimensional potential.
figure 3

a In gray, sketch of the potential V(z) (see text), as a function of z for L = 1, and different values of α = 1/(L2ω0). In red, the lowest energy levels E(j). The first six of these are shown in the bottom (b), as a function of α: the continuous red line is the exact result. Blue dashed, \({E}_{j}^{{{{\rm{approx}}}}}\) of first-order perturbation theory. Blue dashed line with symbols, connector results \({{{{\mathcal{E}}}}}_{j}({L}_{j}^{c,{{{\rm{approx}}}}})\). On the left side, the levels \({{{{\mathcal{E}}}}}_{j}(L)\) of an infinite potential well of width L = 1, which is the α → 0 result.

Finally, we will look at a more realistic system and, most importantly, at a completely different approximation, since besides the Coulomb cutoff, the above examples used perturbation theory as approximation. The observable will be a band structure obtained from the poles of the KS Green’s function G, the real material is solid cubic helium70 with given KS potential V(r), and as model system we choose the HEG, as summarized in the last column of Table 1. The KS band structure of cubic He is shown in panel c of Fig. 4: the occupied state is very flat, whereas the empty bands strongly disperse with k in the BZ. Our approximation is to calculate the exact energy levels at Γ but to neglect dispersion, which, if applied directly to the observable, would yield the flat bands in panel a of the figure, obviously far from reality.

Fig. 4: Band structure of cubic helium.
figure 4

The energies are aligned to the Fermi energy. a White lines are the bands resulting from the direct approximation, where the energies results at all k-points equal those at k0. b Connector result, plain (intensity plot) and constrained (blue dots), following the prescription in the text. c Comparison of first principles results (red line) with constrained connector (blue dots). In white, HEG folded into the Brillouin zone of helium and attached to the second band at Γ. For comparison of the lowest band, which corresponds to a localized state, the lowest HEG band is used.

In order to apply COT, we connect G to the Green’s function of the HEG

$${G}_{{{{\bf{K}}}}{{{\bf{K}}}}}({{{\bf{k}}}},\omega ;[V])={{{{\mathcal{G}}}}}_{{{{\bf{k}}}}\omega {{{\bf{K}}}}}({{{{\mathcal{V}}}}}_{{{{\bf{k}}}}\omega {{{\bf{K}}}}}^{c}),$$
(14)

where K are reciprocal lattice vectors. The Green’s function of the HEG is diagonal in K, so we can represent it by \({{{{\mathcal{G}}}}}_{{{{\bf{k}}}}\omega {{{\bf{K}}}}}\), which is a function of k + K and of the homogeneous potential \({{{{\mathcal{V}}}}}^{c}\),

$${{{{\mathcal{G}}}}}_{{{{\bf{k}}}}\omega {{{\bf{K}}}}}({{{{\mathcal{V}}}}}^{c})=\frac{1}{\omega -| {{{\bf{k}}}}+{{{\bf{K}}}}{| }^{2}/2-{{{{\mathcal{V}}}}}^{c}}\,,$$
(15)

where the frequency ω includes an infinitesimal imaginary part. Equation (14) is the equivalent of Eq. (10), and the potential \({{{{\mathcal{V}}}}}^{c}\) plays the role of the connector. To determine the approximate connector we apply the approximation G(k) ≈ G(k0) to both sides of Eq. (14), which is equivalent to supposing \({{{{\mathcal{V}}}}}_{{{{\bf{k}}}}\omega {{{\bf{K}}}}}^{c,{{{\rm{approx}}}}}={{{{\mathcal{V}}}}}_{{{{{\bf{k}}}}}_{0}\omega {{{\bf{K}}}}}^{c}\), i.e., transferability of the connector itself. It is interesting to note that this corresponds to a diagonal approximation of the in principle exact Dyson equation that relates Gk and \({G}_{{{{{\bf{k}}}}}_{0}}\). The effort required is the calculation of GKK(k0, ω; [V]) in only one point k0. Here, we take k0 to be the Γ-point. Following Eq. (13), this yields the band structure at all k-points as

$${G}_{{{{\bf{K}}}}{{{\bf{K}}}}}^{c,{{{\rm{approx}}}}}({{{\bf{k}}}},\omega ;[V])=\frac{1}{-\frac{| {{{\bf{k}}}}+{{{\bf{K}}}}{| }^{2}}{2}+\frac{| {{{{\bf{k}}}}}_{0}+{{{\bf{K}}}}{| }^{2}}{2}+\frac{1}{{G}_{{{{\bf{K}}}}{{{\bf{K}}}}}({{{{\bf{k}}}}}_{0},\omega ;[V])}}\,.$$
(16)

The result of this COT approximation, shown in panel b of Fig. 4, is clearly much improved with respect to the panel a, thanks to the information imported from the HEG. At the same time it is also better than a simple alignment of the HEG bands at k0 (which would also require the full calculation at k0), which can be seen most clearly, but not only, for the occupied state (HEG bands are white lines in panel c).

In principle, the poles of GKK should not depend on K. Our approximation violates this exact constraint. Therefore, when the spectral function of G is summed over all K the band structure comes out slightly blurred, as one can see in panel b, and even spurious energy levels appear. We can, however, impose the exact constraint by choosing in each group of poles the one with the largest oscillator strength: since in the HEG only one of these poles would have non-zero weight, this reflects the situation where the real system and the HEG are most similar, which means, where COT should work best. Details of how this choice is implemented in practice are given in the Supplementary Note 3. This constraint yields the result in panel c of Fig. 4: the constrained connector makes the bands sharp and eliminates spurious non-dispersing structures. COT therefore leads to a simple approach to get a first, very quick, idea of the band structure of materials.

Discussion

In conclusion, inspired by a historical strategy to approximate density functionals using the HEG, we propose an in principle exact and very general approach. Its aim is to calculate once and for all, and store, a given observable or other object in a model system with high precision. These results are then used to determine the same object in real systems, via a procedure termed connector. The connector is different for every target object, and must be approximated. We suggest a strategy for a systematic connector approach, which, for a given model, makes use of an approximation in a way that leads to strong error cancelling. Indeed, we show that, and why, a given approximation is often much more powerful when used within COT than when directly applied to the object of interest. In the present work we have used three different kinds of approximations, namely, a modification of the Coulomb interaction, perturbation theory, and the hypothesis of transferability, and in all cases obtained significant improvement compared to simply applying the approximation to the observable of interest. Of course, the choice of the model and of the approximation still determine the quality of the results and require care and physical insight. The approach opens the way for fast computational methods, which we have illustrated with a quick estimate of energy levels and bands. It can also be used to design functionals. As example, we have derived an exchange-correlation potential for DFT, using as model system the HEG. By using the WDA as benchmark functional we have demonstrated that our connector functional is able to capture non-local effects in a very efficient way. Beyond the WDA, the connector approximation to the exact functional only requires data from the HEG that are already available in the literature as interpolated expressions. More complex models may require new calculations to be done, with results that can be either interpolated with modern machine learning algorithms or stored thanks to today’s storage capacities. Such more flexible models could potentially further improve the results71 and are envisaged for future applications. The present work sets the framework, elucidates the fundamentals and suggests directions for practical application, with a potentially huge impact on computational materials design.

Methods

The reference band structure of cubic He with experimental lattice parameters70 has been obtained from the diagonalization of the KS-LDA Hamiltonian of size 5.1 Ha in reciprocal space. The Hamiltonian has been calculated using local Troullier-Martins72 pseudopotentials in the Abinit code73 with a Γ-centered 5 × 5 × 5 k-point grid.