Continued fractions and the Thomson problem

Moscato, Pablo; Haque, Mohammad Nazmul; Moscato, Anna

doi:10.1038/s41598-023-33744-5

Download PDF

Article
Open access
Published: 04 May 2023

Continued fractions and the Thomson problem

Pablo Moscato¹,
Mohammad Nazmul Haque¹ &
Anna Moscato²

Scientific Reports volume 13, Article number: 7272 (2023) Cite this article

3891 Accesses
11 Citations
Metrics details

Subjects

Abstract

We introduce new analytical approximations of the minimum electrostatic energy configuration of n electrons, E(n), when they are constrained to be on the surface of a unit sphere. Using 453 putative optimal configurations, we searched for approximations of the form $E(n) = (n^2/2) \, e^{g(n)}$ where g(n) was obtained via a memetic algorithm that searched for truncated analytic continued fractions finally obtaining one with Mean Squared Error equal to ${5.5549 \times 10^{-8}}$ for the model of the normalized energy ($E_n(n) \equiv e^{g(n)} \equiv 2E(n)/n^2$). Using the Online Encyclopedia of Integer Sequences, we searched over 350,000 sequences and, for small values of n, we identified a strong correlation of the highest residual of our best approximations with the sequence of integers n defined by the condition that $n^2+12$ is a prime. We also observed an interesting correlation with the behavior of the smallest angle $\alpha (n)$, measured in radians, subtended by the vectors associated with the nearest pair of electrons in the optimal configuration. When using both $\sqrt{n}$ and $\alpha (n)$ as variables a very simple approximation formula for $E_n(n)$ was obtained with MSE= $7.9963 \times 10^{-8}$ and MSE= 73.2349 for E(n). When expanded as a power series in infinity, we observe that an unknown constant of an expansion as a function of $n^{-1/2}$ of E(n) first proposed by Glasser and Every in 1992 as $-1.1039$, and later refined by Morris, Deaven and Ho as $-1.104616$ in 1996, may actually be very close to −1.10462553440167 when the assumed optima for $n\le 200$ are used.

Emergence of competing electronic states from non-integer nuclear charges

Article Open access 08 September 2023

Electron g-factor in nanostructures: continuum media and atomistic approach

Article Open access 15 December 2020

Measurement of the neutron charge radius and the role of its constituents

Article Open access 19 March 2021

Introduction

Very recently, a number of papers have been related to uncovering equations that model physical systems with high accuracy. The denomination of ‘precision machine learning’ has been suggested for this research field¹. An increased interest now exists in the research and development of new methods for symbolic regression, a technique for multivariate regression problems that also has a history as a hub between the machine learning, computational intelligence and evolutionary algorithms communities.

While many pioneer works exist in the area, since the publication of the work of Schmidt and Lipson in 2009 in the journal Science², it is without question that many researchers have found its outcomes as a source of inspiration in Physics. Their work then led to the development of a commercial software (called Eureqa)³, and thanks to their free academic license, the power of the genetic programming heuristic running as a search engine, and the easy-to-use graphical user interface, this spawns the interest of many newcomers into the field.

In the original paper by Schmidt and Lipson, the objective of the authors was to show how symbolic regression can be used to uncover natural laws of Physics from experimental data. Far from being a mathematical curiosity in the field of classical mechanics, identifying invariants in the dynamics of systems and uncovering new empirical laws is still pretty much a constant need in science. Like in the case of Kepler, who established three laws for the movement of the celestial bodies, those laws may not stand forever, but they may inspire the work of theoreticians, particularly when the fitting of the observed data by the empirical law is relatively simple and also pretty accurate. In that case, it was Newton who simplified the three laws and later established his simpler theory. Today we have a similar process going on; sometimes, many features are measured from experimental settings, and since symbolic regression methods are generally coupled with model complexity reduction heuristics, a subset of the variables is generally chosen together with a good model. This allows theoreticians to concentrate perhaps on those variables in the quest to find a simpler explanation to the phenomenon of interest.

More recently, an MIT group has published another machine-assisted method to uncover relationships in Physics⁴. The approach is properly called AI-Feynman because its claim is to have been able to “uncover” 100 equations from the three-book series called The Feynman Lectures on Physics by Feynman, Leighton, and Sands as well 90 percent of another set of 20 “more challenging” equations coming from other Physics collections. In a new preprint, the new version called AI-Feynman 2.0 is shown to be superior to the original approach⁵ including robustness to experimental noise.

One of the fields that well illustrates on the benefits from symbolic regression is Astronomy, an area of science with a long tradition in data-driven discovery. In 2014, Krone-Martins et al. reported “the first analytical expression to estimate photometric redshifts” suggested by a computer algorithm⁶, and a year before a group at Caltech illustrated on the use of symbolic regression to uncover classical astronomical relationships like the Hertzsprung-Russell diagram, the fundamental plane of elliptical galaxies, and the period-amplitude plane of RR Lyrae stars⁷. These two groups have also used the academic version of Eureqa, and for many years, the company website collected information on several new applications of symbolic regression, including our own using their software, in areas as diverse as consumer behavior and business analytics^8,9,10 and drug response in cancer cell lines¹¹.

This paper is structured as follows: at first, we present the problem and previous work, and then we present novel approximations based on our work using symbolic regression and analytic continued fractions. That introduces a very simple formula that is based on a particular feature of the known optimal configurations so far found. Finally, we present our conclusions from this study.

The Thomson problem

The basic Thomson problem is simple to state but difficult to solve constrained non-linear global optimization; find the equilibrium configuration of n identical charges subjected to be on a surface of a sphere. This means that for each value of n the task is to find the global optimum. This has proved to be a challenge in itself, the problem is used as a classical benchmark to evaluate the performance of optimization packages¹².

Although its long history, the problem is far from being solved for an arbitrary value of n, and it has the ongoing interest of chemists, physicists and mathematicians. In fact, it has a renewed interest lately since it is a special case of one of the “Eighteen unsolved mathematical challenges” proposed for the 21st Century by Steve Smale¹³.

This problem was posed in 1904 by J.J. Thomson and, in some sense, it has become a classical problem of ongoing interest both in Mathematics and Physics. His interest was to solve which is the 3-dimensional configuration of n electrons that minimizes the electrostatic potential if they are constrained to be on a surface of a unit sphere. The electrons repel each other with a force that is given by Coulomb’s Law.

Part of the renewed interest also lies in the generalizations and some of the present open conjectures. For instance, if p(a) and p(b) are the positions of two points on the sphere where two electrons a and b are located, and if we assume the Riesz potential, the energy of a set S of $|S|=n$ “electrons” will have potential energy given by

$$\begin{aligned} E(n) = \sum _{a,b \in S} \frac{1}{|p(a)-p(b)|^{s}} \end{aligned}$$

(1)

where $s>0$ and the case $s=1$ corresponds to the Coulomb potential; thus, in that case, we have the Thomson problem. It has been conjectured¹⁴ that for the minimum configuration potential, the Riesz potential, the value of the energy is given by:

$$\begin{aligned} E(n,s) = \frac{(\sqrt{3}/2)^{s/2} \, \xi _{\Lambda _2}(s)}{(4\pi )^{s/2}} n^{1+s/2} + V_s n^2 + o(n^2) \end{aligned}$$

(2)

where

$$\begin{aligned} V_s=\frac{2^{1-s}}{2-s}, \end{aligned}$$

(3)

$\Lambda _2$ is the regular planar triangular lattice generated by basis vectors (1, 0) and $(1/2, \sqrt{3}/2)$, and $\xi _{\Lambda _2}(s)$ is the Epstein-zeta function¹⁵ for the lattice $\Lambda _2$¹⁴ (see also^16,17 for other seminal papers).

Having presented the relevance and interest of this problem, we now look at the history of the Thomson problem and how researchers have tried to obtain an analytical expression from existing data.

Configurations with energy lower than the trend

Glasser and Every suggested that “minimum energy accurately follows a simple half-integral power law in 1/n”¹⁸. They first noticed how “it is striking how close the configuration energies lie to a smooth curve. On the scale that has been used, scatter in the energies can barely be discerned”. In fact, a number of optimal configurations seem to be characterized by a “significant lower (energy value) than the neighbours”. In¹⁹, the authors comment that the configurations for $n=12, 32, 72, 122, 132,$ and 192 have all in common the fact that they have icosahedral symmetry, and that “the icosahedral structures for $n=212, 272, 282,$ and 312 also have icosahedral symmetry and have low energy”. While they argue that this has been predicted, they also point out that the cases of $n=42, 92,$ and 162 are also icosahedral structures, but these have high energies relative to the trend of the equation we are fitting. D.J. Wales suggested that the energy lies at the tail of the distribution for the systems with high symmetry^20,21. This explains that the structures with higher symmetry measures tend to have either high or low energy minima; hence the icosahedral structure was found to exhibit both high and low energy for the aforementioned cases.

While the question is outside the scope of this work, we point out, however, that the integer sequence 12, 32, 42, 72, 92, 122, 132, 162, 192, 212, 252, 272,282, 312, 362, 372, 392, 432, 482, 492, 522, 572, 612, 632, 642, 672, 732, 752, 762, 792, 812, $\ldots$ corresponds to the number of vertices of Goldberg-Casper-Klug pseudo-icosahedra in the curated Online Encyclopedia of Integer Sequences (OEIS) A071336. We underline those values of n, which were discussed before by Glasser and Every¹⁹. We also note that the numbers we write in boldface belong to another sequence curated at the OEIS (A005901), and it is one corresponding to the “Number of points on surface of cuboctahedron (or icosahedron): $a(0) = 1;$ for $n > 0,\, a(n) = 10 \, n^2 + 2$. Also coordination sequence for f.c.c. or $A_3$ or $D_3$ lattice.” Since the sequence A071336 contains the one defined by their recurrence relation of A005901, it may then be an interesting open problem to determine if other icosahedral configurations with “avoided” low energy values are exactly those defined by that specific subsequence of the number of vertices of Goldberg-Casper-Klug pseudo-icosahedra or if others exist. Very recently, Timothy Michaels used equidistributed icosahedral configurations based on combining the (m, n) icosahedral nodes of Casper and Klug and an azimuthal projection method to define a low deformation equal area mapping²². This allowed him to analyze the Riesz potential energies numerically up to $n < 50000$. The OEIS seems to be an interesting tool to analyze sequences of values of n on which we observed some property, and we introduced it here since it will be again used later in the manuscript as an exploratory tool.

Approximations to the energy in Thomson problem as a function of the number of electrons

We briefly review here how several approximations have been given; generally, all of them are linked to an extension on the availability of conjectured or proved exact solutions for increasingly larger values of n. We will start with the already cited work of Glasser and Every and their derivation of a functional form.

Glasser and Every¹⁸

There has been huge progress in this problem in the last thirty years. Before 1992, optimal configurations for the problem were known only for $n\le 50$. In¹⁸, Glasser and Every finally extended the number of problems solved for up to 101 electrons. They observe that the values of the “minimum energy accurately follows a simple half-integral power law in 1/n”. They fit a function to the data by using the known minimum energies for all cases with $n\le 50$ and another 20 optimal configurations with $51 \le n \le 101$.

They propose a functional form for the energy as

$$\begin{aligned} E(n) = \frac{n^2}{2} \left( 1 + \frac{a}{n^{\alpha }} + \frac{b}{n^{\beta }} + \cdots \right) \end{aligned}$$

(4)

with $0< \alpha< \beta < \ldots \,$. Using only the values of the energies for $n=70$ ($E(70)=2127.100902$) and $n=80$ ($E(80)=2805.355876$), and leaving the first term of the expansion, they fit the free parameters as follows: $\alpha = 0.496$ and $a = -1.084$, concluding from the good fit to the first configurations with $n\le 30$ that $\alpha$ should be equal to 1/2 instead. Then they observe the need to include the second term of the expansion and, after another round of fitting and approximation of an exponent to a ratio of two integers they ended, finally, setting for a formula of the form:

$$\begin{aligned} E(n) = \frac{n^2}{2} \left( 1 - \frac{1.1028 }{n^{1/2}} + \frac{0.096}{n^{3/2}} \cdots \right) . \end{aligned}$$

(5)

A note added in proof indicates a better fitting with $a =-1.1039$ and $b = 0.105$ for $n\le 65$, so we will call the “Glasser and Every model” the one coming from the use of the formula with these values.

Morris, Deaven, Ho¹⁹

In 1996, Morris, Deaven and Ho presented results of what we consider a memetic algorithm^23,24, a population-based strategy for finding the minimum energy configurations that use conjugate gradient optimization as a local search strategy. Not only have they been able to independently reproduce all the known optimal results for $10<n<132$, but they have been able to provide optimal configurations up to $n \le 200$. They use the same formula employed by the Glasser and Every model but with slightly different values of the parameters (i.e. $a = - 1.104616 \pm 0.000 01$ and $b=0.1376 \pm 0.001$), concluding that the fitted formula for the new information available still is in reasonable agreement with the previous fits and calculations by Every and Glasser.

Approximations using symbolic and analytic continued fraction regression

Given the reported successes of symbolic regression in other problem domains, we first aimed at finding a fit to all the existing data that Morris, Deaven, Ho used.

An initial model of interest found with symbolic regression

Again, using all results for $n \le 200$, and using the commercial package Eureqa (together with a post-optimization procedure based on non-linear programming), we have been able to find an approximation model for the total energy of the form:

$$\begin{aligned} E(n) {\approx } \frac{n^2}{2} \, k^{{-} \gamma (n)} \end{aligned}$$

(6)

where Eureqa obtained

$$\begin{aligned} \gamma (n) = {\frac{4}{11 \sqrt{n} - 6}}, \end{aligned}$$

(7)

and $k=11591/547$. Once again, the simplicity of this model for E(n), defined by five integer constants, is quite impressive.

A Puiseux series expansion at $n=\infty$ for $\gamma (n)$ reveals an interesting role of the powers of the integers 12 and 11 in this approximation since it has the following form:

$$\begin{aligned} \begin{aligned} \gamma (n) =&4 \frac{1}{11} n^{-1/2} + 2 \frac{12}{11^2} n + 1 \frac{12^2}{11^3} n^{-3/2} + 6 \frac{12^2}{11^4} n^2 + 3 \frac{12^3}{11^5} n^{-5/2} + 18 \frac{12^3}{11^6} n^3 + 9 \frac{12^4}{11^7} n^{-7/2} + 54 \frac{12^4}{11^8} n^4 + 27 \frac{12^5}{11^9} n^{-9/2} + O\left( n^{-5}\right) . \end{aligned} \end{aligned}$$

And a Puisaux series at $n=\infty$ of $k^{ - \gamma (n)}$ is given by:

$$\begin{aligned} k^{ - \gamma (n)} \, = 1 - \frac{4}{11} \, log(k) n^{-1/2} + \frac{8 \, (log(k) - 3) \, log(k)}{121} n^{-1} + O(n^{-3/2}) \end{aligned}$$

(8)

with log(k) being the natural logarithm of k. We can observe that the second term of the expansion, $- 4/11 \, log(k)=-1.11037651326\ldots \,$, is relatively close to Morris, Deaven and Ho’s estimation of $a=-1.104616$, but we note that their original proposal did not include a term in $n^{-1}$. By comparison, the Puisaux series expansion of our model of Eq. (6) explains why perhaps a better approximation was not found until now.

Behaviour of the residual error and the relative error of the model of equation (6) for $n \le 200$

This motivated us to look at the behavior of both the residual error and the relative error for the values that we have used to find this approximation (i.e. $2 \le n \le 200$). For simplicity, we will refer to the normalized energy, i.e. $E_n(n) = 2 \, E(n) / n^2$. The difference between the exact value of the normalized energy and the one provided by our model for $n=2$ is $0.25 - k^{ - \gamma (2)} = {-} 0.028561067$. Figure 1 shows the difference between the exact value of the normalized energy and the value given by $k^{ - \gamma (n)}$ for $13 \le n \le 204$. For the configuration with the largest number of electrons in the figure, the difference is $0.922699961 - k^{ - \gamma (204)} =$ $\, {3.4829385 \times 10^{-4}}$.

Approximations with continued fraction regression

We have then used symbolic regression to find a more fitting function than Eq. (7) to see if we can find a better approximation of very low complexity using the software Eureqa on all the known conjectured or proven optimal solutions to Thomson problems that are currently known. Unfortunately, no better solution than Eqs. (6) and (7) was found. We then turned our attention to a new method we have recently developed based on analytic continued fractions.

The analytic continued fraction regression method was introduced in²⁵ as a general approach to model an unknown target function of multivariate independent variables, ${\textbf{x}}$, as a generalized continued fraction. For a multivariate unknown target function $f : \mathbb {R}^d \rightarrow \mathbb {R}$, we then write:

$$\begin{aligned} f({\textbf{x}}) = b_0({\textbf{x}})\, + \, \frac{a_1({\textbf{x}})}{b_1({\textbf{x}}) + \frac{a_2({\textbf{x}})}{b_2({\textbf{x}}) + \frac{a_3({\textbf{x}})}{b_3({\textbf{x}}) + \ddots }}} \end{aligned}$$

(9)

where $a_i({\textbf{x}}), b_i({\textbf{x}}) \in \mathbb {R}$ for all integer ($i\ge 1$, respectively $i\ge 0$). For each function $a_i({\textbf{x}}): \mathbb {R}^d \rightarrow \mathbb {R}$ is associated with a d-dimensional vector $\mathbf {a_i} \in \mathbb {R}^d$ and a constant $\alpha _i \in \mathbb {R}$:

$$\begin{aligned} a_i({\textbf{x}}) = \mathbf {a_i}^{\textrm{T}} {\textbf{x}} + \alpha _i \end{aligned}$$

(10)

Analogously, each function $b_i({\textbf{x}}) : \mathbb {R}^d \rightarrow \mathbb {R}$ is associated with a vector $\mathbf {b_i} \in \mathbb {R}^d$ and a constant $\beta _i \in \mathbb {R}$:

$$\begin{aligned} b_i({\textbf{x}}) = \mathbf {b_i}^{\textrm{T}} {\textbf{x}} + \beta _i. \end{aligned}$$

(11)

As an example, given a function f(x) we can use Mathematica to find its analytic continued fraction expansion. For instance, given $f(x)=(\sqrt{4x+1}-1)/2$ we can approximate it with arbitrarily precision. Truncating the expansion at $depth=6$ we have:

$$\begin{aligned} \frac{\sqrt{4x +1}-1}{2} \approx x + \frac{-x^2}{1+\frac{2x}{1+\frac{\frac{x}{2}}{1+\frac{\frac{3x}{2}}{1+\frac{\frac{2x}{3}}{1+\frac{\frac{4x}{3}}{1}}}}}} \end{aligned}$$

This function and a continued fraction expansion is discussed in²⁶ (this expansion we present was provided by Mathematica and it has $b_i(x,x^2) = 1, \forall i\ge 0$ and $b_0(x,x^2) = x$), but many others could have been used to illustrate the expansion. The authors of²⁶ point to the interesting “Handbook on continued fractions” contributed by Annie Cuyt et al.²⁷ in which many expansions of interests are which may be of interest for some of the readers.

For the application of these ideas for regression, the methodology proposed in²⁵ implies that, given a dataset, we face both a combinatorial and a non-linear optimization problem to solve. The first one comes from the selection of the variables that need to be used in the expansion (in the example given, both x and $x^2$ are used). Associated to each of the models that need to be iteratively created, we need to estimate the associated coefficients via a non-linear optimization algorithm or heuristic. Since²⁵ all the subsequent papers^28,29,30 have been employing a memetic algorithm, i.e. a population-based search technique, is used as the method of choice to find both the variables and coefficients.

A Nelder-Mead (NM) based local search optimizes the current solutions at each generation in the memetic algorithm and several heuristics exist for the inclusion and rejection of variables in the models. Details of the method can be found in²⁹. In²⁸, the authors present results on 452 datasets of physical relevance; the method showed a very good performance, and in²⁹ another comprehensive review of comparative performance against many state-of-the-art regression algorithms is also given on a large multivariate dataset used for testing.

A first approximation using continued fraction regression

We used a method based on multivariate regression using an analytic continued fraction representation of our target function of interest^{25,28,29,30,31}. We looked at approximating E(n) as follows:

$$\begin{aligned} E(n) \approx \frac{n^2}{2} \, e^{g(n)} \end{aligned}$$

(12)

We executed our memetic algorithm based continued fraction regression (CFR) method to approximate g(n) for $depths=\{0,1,2\}$²⁹. The CFR method uses a continued fraction-based representation of the unknown function that we are aiming to fit.

We found the following $depth=2$ model for g(n):

$$\begin{aligned} g(n)={\left( a\, u +\frac{b\, u+ c}{(d\, u + e)+\frac{f\,u + g}{h\, u + i}}\right) } \end{aligned}$$

(13)

as the best approximation (with a with $MSE={6.1080 \times 10^{-8}}$) for normalized energy where $u=n^{-1/2}$ and the coefficients are: a = − 0.880367889446965535, b = 0.793386942751073909, c = 0.184694236906771669, d = 1.27484122748516482, e = − 0.488981361176515139, f = 0.0528983408465706698, g = 0.298731221071015351, h = − 0.341376009238492262, i = − 0.000516314596688742609.

All the residuals presented in this paper are plotted on the same range, between $-12 \times 10^{-4}$ and $2 \times 10^{-4}$ and for $13< n < 204$. In particular, this helps to compare the results with the previous one given by Eqs. (6) and (7).

From Figs. 1 and 2 we can observe that for increasing values of n, we now have an approximation that converges faster to a desired zero residual, with noticeable differences already in the range of $n\le 204$ in Fig. 2.

An approximation using $n^{-1/2}, n^{-1}$ and, $n^{-3/2}$ as independent variables

We then look at the possibilities that we can have by explicitly employing $n^{-1/2}, n^{-1}$ and, $n^{-3/2}$ as independent variables since our memetic algorithm for analytic continued fraction regression can handle well multi-dimensional problems. In this case, we have managed to obtain a new approximation for E(n) of the form:

$$\begin{aligned} E(n) \approx \frac{n^2}{2} \, e^{g(n)} \end{aligned}$$

(14)

where g(n) is now an explicit function of these variables:

$$\begin{aligned} g(n) = - n^{-1/2} + \frac{7 n^{-1/2} + 35 n^{-1} - 14 n^{-3/2}}{58 n^{-1/2} - 67}, \end{aligned}$$

(15)

here, the mean square error of normalized energy (i.e. $2 E(n) / n^2$) is ${8.4161 \times 10^{-8}}$ (difference between the exact normalized energy data and the approximation are showed in Fig. 3 for $13 \le n \le 204$ with highlighting the n where $n^2 + 12$ is prime). Most notably, the Puisaux series at $n=\infty$ of $e^{g(n)}$ is given by:

$$\begin{aligned} e^{g(n)} = 1 - \frac{74}{67} n^{-1/2} - \frac{13}{4489} n{^{-1}} + \frac{117974}{902289} n^{-3/2} + O(n^{-2}), \end{aligned}$$

(16)

so the second term of the expansion is $-74/67 = - 1.10447761\ldots \,$, again very close to Morris, Deaven and Ho’s previous estimate of $-1.104616$.

A new solution with ratios of integers as coefficients

With Eq. (13), we used Mathematica to simplify it as a ratio of two polynomials. Then the coefficients were optimized further and presented as a ratio of two integers, as it is common in many analytic continued fractions. We used a high-precision non-linear optimization routine of Matlab. Consequently, we obtained this other expression with a reduced MSE value (${5.5549 \times 10^{-8}}$) to calculate the normalized energy:

$$\begin{aligned} { g(n) = \left( \frac{ \frac{12}{20467}\, n + \frac{4421}{1523}\, \sqrt{n} - \frac{998}{495\, \sqrt{n}} + \frac{8675}{6156} }{ -\frac{8998}{3401}\, n + \frac{1271}{3564}\, \sqrt{n} + \frac{9641}{5261} } \right) } \end{aligned}$$

(17)

We showed the residual of the continued fraction regression model with only ratio of integer coefficients for approximating the Thomson values in the range of $n\le 204$ in Fig. 4. This is a remarkably good approximation in all the ranges, even for the case of two electrons (giving a value of 0.2504976, when the exact value of the normalized energy is 0.25). For the largest configuration in the dataset (with $n=4352$ electrons^32,33, lowest minima located for the Thomson problem: http://www.junlilab.org/database/TP2.html), the approximation is 0.98313334728 for the known solution of value equal to 0.983244295.

A simple equation based on minimum angles

We have shown before that, at least for optimal configurations with small n, all approximations seem to err somewhat more than the trend when $n^2+12$ is prime. We have not yet found an explanation for this fact. We then turned our attention to some other proven and putative optimal known solutions for $n\le 200$³² (Global Minima for the Thomson Problem: https://www-wales.ch.cam.ac.uk/~wales/CCD/Thomson/table.html, https://www-wales.ch.cam.ac.uk/~wales/CCD/Thomson/xyz/) and we found a pattern worth exploring further.

A new sequence of integers

In fact, a new sequence of integers can be defined, which at the time of preparing this manuscript does not seem to be currently listed in the Online Encyclopedia of Integer Sequences, while other sequences related to the Thomson problem are. It is: $11, 13, 16, 19, 23, 26, 29, 31,\ldots \,$. This sequence is defined in the following way. Let $\alpha (n)$ is the smallest angle, measured in radians, which is subtended by the vectors associated with the nearest charge pair in the optimal configuration for n electrons. We then say that an integer n is in this sequence if and only if $\alpha (n) < \alpha (n+1)$. This means that in the optimal configuration for n electrons there is at least one pair which is near than the nearest pair in a configuration with $n+1$ electrons.

We have observed that there was an overlap between this integer sequence and those n such that $n^2 + 12$ is prime (e.g. 19, 23, 26, 29) (OEIS A114275). Motivated by this fact, we looked at the possibility that $\alpha (n)$, together with the inverse of the square root of the number of electrons, could provide a low complexity but good approximation formula for the energy of the optimal configuration.

Using symbolic regression (in this case the commercial package TuringBot), we have also found the following simple formula, which is a good approximation for E(n):

$$\begin{aligned} E(n) \approx \frac{n^2}{2} \left( \frac{\sqrt{n} -1}{\sqrt{n} + \frac{\alpha (n) + \pi }{30} } \right) . \end{aligned}$$

(18)

The formula was obtained by only accepting integer coefficients in the expressions generated by the symbolic regression method TuringBot. The constant $\pi$ was incorporated ad hoc in the formula after interpreting that ‘$\alpha (n) + 3$’ may actually be representing an angle measured in radians ‘$\alpha (n) + \pi$’. We noted that the ‘3’ in that formula is a consequence of the symbolic regression approach (that searched for integer coefficients), and we present this formula after observing that the MSE improved with the substitution.

This is very interesting, in the limit for $\alpha (n)$ tending to zero, we have that

$$\begin{aligned} \lim _{\alpha (n) \rightarrow 0} E(n) \approx \frac{n^2}{2} \left( \frac{ 30 \, (\sqrt{n} - 1)}{ 30 \, \sqrt{n} + \pi } \right) \end{aligned}$$

(19)

and if we take a look at the series expansion at $n=\infty$ we have

$$\begin{aligned} \lim _{\alpha (n) \rightarrow 0} E(n) \approx \frac{n^2}{2} \left[ 1 + \left( -1 - \frac{\pi }{30} \right) \, n^{-1/2} + \cdots \right] \end{aligned}$$

(20)

an expression that includes the transcendental number

$$\begin{aligned} -1 - \frac{\pi }{30} = -1.10471975511965977\ldots \end{aligned}$$

(21)

again, a result that is in reasonable agreement with the previous estimate by Morris, Deaven and Ho $( -1.104616\ldots \,)$ which, it is useful to recall, was obtained by analyzing only electron configurations with $n \le 200$. Using a non-linear optimization approach and all the data we obtained $-1 - \pi /29.96627907927524\ldots \approx -1.104837\ldots \,$. When we used the 254 configurations not used by Morris, Deaven and Ho, i.e. those with $200 \le n \le 4352$, we obtained $-1-\pi /30.0270165553\ldots \,$, leading to a value of $-1.10462553440167\ldots \,$.

We present our previous model in Eq. (17) which highlights the n for the angle-based new sequence (11, 13, 16, 19, 23, 26, 29, 31,$\ldots \,$ here defined for the first time) of $10 \le n \le 50$ in Fig. 5a and $50 \le n \le 100$ in Fig. 5b. We present the residual calculated between exact and approximation by Eq. (18) in Fig. 6. Also, Fig. 7a ($10 \le n \le 50$ ) and 7b ($50 \le n \le 100$ ) show that, for the new integer sequence, the points in this sequence do not look as outliers of the general trend, indicating that the geometric information introduced by just including the minimum angle between pairs of electrons leads to a model of low complexity and also good approximation capability.

As Supplementary Information, we have provided the approximations by the models presented in this work along with the MSE score for the prediction of normalized energy in Table S1 and Energy of Thomson in Table S2 for the reader’s convenience.

Conclusions

In this paper, we have shown that the minimum potential energy of the Thomson problem, E(n), can be well approximated by a formula for the form: $E(n) = (n^2/2) \, e^{g(n)}$ where g(n) is obtained using a truncated analytic continued fraction expansion. The methodology used here is very general and could eventually be used for other cases such as Reisz potential and other variants of this problem of interest^35,36,36. Investigation of the sequence of values of n for which the residual error of our approximations was a bit higher than the trend lead to the introduction of a new sequence of integers. This, in turn, suggested that perhaps another expression could be found involved $\alpha (n)$ is the smallest angle, measured in radians, which is subtended by the vectors associated with the nearest charge pair in the optimal configuration for n electrons. These values are only known after a very hard non-linear optimization is solved. Future work involves finding an approximation of $\alpha (n)$ using only the value of n which could be potentially something of interest for building constructive heuristics for the challenging non-linear optimization problem for the general case of n electrons.

Data availability

The datasets generated during and/or analyzed during the current study are available in the public domain. The Global Minima for the Thomson Problem for $n\le 400$ are available³² at: https://www-wales.ch.cam.ac.uk/~wales/CCD/Thomson/table.html and³⁷ https://en.wikipedia.org/wiki/Thomson_problem. The files containing (x, y, z) coordinate values are available³² at: https://www-wales.ch.cam.ac.uk/~wales/CCD/Thomson/xyz/. The lowest found minima for the Thomson problem for some larger values in the range of $400 \le n\le 4352$ (and associated (x, y, z) coordinates of the configurations) are available³³ at: http://www.junlilab.org/database/TP2.html.

References

Michaud, E. J., Liu, Z. & Tegmark, M. Precision machine learning. Entropy 25, 175 (2023).
Article ADS MathSciNet PubMed PubMed Central Google Scholar
Schmidt, M. & Lipson, H. Distilling free-form natural laws from experimental data. Science 324, 81–85 (2009).
Article ADS CAS PubMed Google Scholar
Schmidt, M. & Lipson, H. Eureqa (version 1.24.0). Software (2018).
Udrescu, S.-M. & Tegmark, M. AI Feynman: A physics-inspired method for symbolic regression. Sci. Adv. 6, eaay2631 (2020).
Article ADS PubMed PubMed Central Google Scholar
Udrescu, S. et al. AI Feynman 2.0: Pareto-optimal symbolic regression exploiting graph modularity. In: Larochelle, H., Ranzato, M., Hadsell, R., Balcan, M. & Lin, H. (eds.) Advances in Neural Information Processing Systems 33 (NeurIPS 2020) (2020).
Krone-Martins, A., Ishida, E. E. D. O. & De Souza, R. The first analytical expression to estimate photometric redshifts suggested by a machine. Mon. Not. R. Astron. Soc Lett. 443, L34–L38 (2014).
Article ADS Google Scholar
Graham, M., Djorgovski, S., Mahabal, A., Donalek, C. & Drake, A. Machine-assisted discovery of relationships in astronomy. Mon. Not. R. Astron. Soc. 431, 2371–2384 (2013).
Article ADS Google Scholar
de Vries, N. J., Carlson, J. & Moscato, P. A data-driven approach to reverse engineering customer engagement models: Towards functional constructs. PLOS ONE 9, e102768 (2014).
Article ADS PubMed PubMed Central Google Scholar
de Vries, N. J., Reis, R. & Moscato, P. Clustering consumers based on trust, confidence and giving behaviour: Data-driven model building for charitable involvement in the australian not-for-profit sector. PLOS ONE 10, e0122133 (2015).
Article PubMed PubMed Central Google Scholar
Moscato, P. & de Vries, N. J. Marketing meets data science: Bridging the gap. In: Business and Consumer Analytics: New Ideas, 3–117 (Springer, 2019).
Fitzsimmons, J. & Moscato, P. Symbolic regression modelling of drug responses. In: First IEEE Conference on Artificial Intelligence for Industries, Sep 26, 2018 - Sep 28, 2018 (Laguna Hills, CA, 2018).
Dolan, E. D., More, J. J. & Munson, T. S. Benchmarking optimization software with COPS 3.0. Tech. Rep. ANL/MCS-TM-273, Argonne National Laboratory, Mathematics and Computer Science Division, Illinois 60439, United States (2004).
Smale, S. Mathematical problems for the next century. Math. Intell. 20, 7–15 (1998).
Article MathSciNet MATH Google Scholar
Borodachov, S. V., Hardin, D. P. & Saff, E. B. Discrete Energy on Rectifiable Sets (Springer, New York, 2019).
Book MATH Google Scholar
Henn, A. in The Hexagonal Lattice and the Epstein Zeta Function, chap. Chapter 7, 127–140 (World Scientific, 2016).
Rakhmanov, E. A., Saff, E. B. & Zhou, Y. Minimal discrete energy on the sphere. Math. Res. Lett. 1, 647–662 (1994).
Article MathSciNet MATH Google Scholar
Kuijlaars, A. & Saff, E. Asymptotics for minimal discrete energy on the sphere. Transact. Am. Math. Soc. 350, 523–538 (1998).
Article MathSciNet MATH Google Scholar
Glasser, L. & Every, A. G. Energies and spacings of point charges on a sphere. J. Phys. A Math. Gen. 25, 2473–2482 (1992).
Article ADS Google Scholar
Morris, J. R., Deaven, D. M. & Ho, K. M. Genetic-algorithm energy minimization for point charges on a sphere. Phys. Rev. B 53, R1740–R1743 (1996).
Article ADS CAS Google Scholar
Wales, D. J. Symmetry, near-symmetry and energetics. Chem. Phys. Lett. 285, 330–336 (1998).
Article ADS CAS Google Scholar
Erratum. Chem. Phys. Lett. 294, 262 (1998).
Michaels, T. Equidistributed icosahedral configurations on the sphere. Comput. Math. Appl. 74, 605–612 (2017).
Article MathSciNet MATH Google Scholar
Moscato, P. On evolution, search, optimization, genetic algorithms and martial arts: Towards memetic algorithms. Caltech Concurr. Comput Program, C3P Report 826, 37 (1989).
Cotta, C., Mathieson, L. & Moscato, P. Memetic algorithms. In Martí, R., Pardalos, P. M. & Resende, M. G. C. (eds.) Handbook of Heuristics, 607–638 (Springer, 2018).
Sun, H. & Moscato, P. A memetic algorithm for symbolic regression. In: 2019 IEEE Congress on Evolutionary Computation (CEC) (IEEE, Wellington, New Zealand, 2019).
Cuyt, A., Petersen, V. B., Verdonk, B., Waadeland, H. & Jones, W. B. Continued Fraction Representation of Functions 29–43 (Springer, Netherlands, Dordrecht, 2008).
Google Scholar
Cuyt, A. et al. Continued fractions for special functions: Handbook and software. In: Cuyt, A., Krämer, W., Luther, W. & Markstein, P. (eds.) Numerical Validation in Current Hardware Architectures, 27–40 (Springer Berlin Heidelberg, 2009).
Moscato, P., Sun, H. & Haque, M. N. Analytic continued fractions for regression: Results on 352 datasets from the physical sciences. In: IEEE Congress on Evolutionary Computation, CEC 2020, July 19-24, 2020, 1–8 (IEEE, Glasgow, United Kingdom, 2020).
Moscato, P., Sun, H. & Haque, M. N. Analytic continued fractions for regression: A memetic algorithm approach. Expert Syst. Appl. 179, 115018 (2021).
Article Google Scholar
Moscato, P. et al. Multiple regression techniques for modelling dates of first performances of Shakespeare-era plays. Expert Syst. Appl. 200, 116903 (2022).
Article Google Scholar
Moscato, P., Haque, M. N., Huang, K., Sloan, J. & de Oliveira, J. C. Learning to extrapolate using continued fractions: Predicting the critical temperature of superconductor materials. arXiv e-prints (2020). arXiv:2012.03774.
Wales, D. J. & Ulker, S. Structure and dynamics of spherical crystals characterized for the Thomson problem. Phys. Rev. B 74, 212101 (2006).
Article ADS Google Scholar
Wales, D. J., McKay, H. & Altschuler, E. L. Defect motifs for spherical topologies. Phys. Rev. B 79, 224115 (2009).
Article ADS Google Scholar
Levin, Y. & Arenzon, J. J. Why charges go to the surface: A generalized Thomson problem. Europhys. Lett. (EPL) 63, 415–418 (2003).
Article ADS CAS Google Scholar
Vernizzi, G. & de la Cruz, M. O. Faceting ionic shells into icosahedra via electrostatics. Proc. Natl. Acad Sci. 104, 18382–18386 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Bowick, M. J., Cacciuto, A., Nelson, D. R. & Travesset, A. Crystalline particle packings on a sphere with long-range power-law potentials. Phys. Rev. B 73, 024115 (2006).
Article ADS Google Scholar
Wikipedia contributors. Thomson problem—Wikipedia, the free encyclopedia (2022). [Online; accessed 18-October-2022].

Download references

Acknowledgements

We thank Jeferson Arenson, Luke Mathieson, Mehdi Abedi, M. A. Hakim Newton and Markus Wagner for their comments on an earlier version of the manuscript. The authors would also like to thank M. Schmidt (from Nutonian) and Rafael Ruggiero (from TuringBot) for making available, in the past, their symbolic regression packages via an academic license agreement. This work was supported by the Australian Government through the Australian Research Council’s Discovery Projects funding scheme (project DP200102364). P.M. acknowledges a generous donation from the Maitland Cancer Appeal.

Author information

Authors and Affiliations

School of Information and Physical Sciences, The University of Newcastle, Callaghan, NSW, 2308, Australia
Pablo Moscato & Mohammad Nazmul Haque
The Australian National University, Canberra, ACT, 2600, Australia
Anna Moscato

Authors

Pablo Moscato
View author publications
Search author on:PubMed Google Scholar
Mohammad Nazmul Haque
View author publications
Search author on:PubMed Google Scholar
Anna Moscato
View author publications
Search author on:PubMed Google Scholar

Contributions

P.M. and A.M. conceived the experiment(s), P.M. developed putative models with symbolic regression, M.N.H. wrote software code for analytic continued fraction regression with integer coefficients and run experiments, A.M. contributed with other supporting code and P.M, M.N.H. and A.M. analysed the results. All authors wrote, revised and reviewed the manuscript.

Corresponding author

Correspondence to Pablo Moscato.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Moscato, P., Haque, M.N. & Moscato, A. Continued fractions and the Thomson problem. Sci Rep 13, 7272 (2023). https://doi.org/10.1038/s41598-023-33744-5

Download citation

Received: 06 September 2022
Accepted: 18 April 2023
Published: 04 May 2023
DOI: https://doi.org/10.1038/s41598-023-33744-5

This article is cited by

Approximating the nuclear binding energy using analytic continued fractions
- Pablo Moscato
- Rafael Grebogi
Scientific Reports (2024)
New alternatives to the Lennard-Jones potential
- Pablo Moscato
- Mohammad Nazmul Haque
Scientific Reports (2024)
Mathematical Modelling of Peak and Residual Shear Strength of Rough Rock Discontinuities Using Continued Fractions
- Olivier Buzzi
- Michael Jeffery
- Mohammad Nazmul Haque
Rock Mechanics and Rock Engineering (2024)