Abstract
The 100-km ultra-marathon is one of the most popular ultra-marathon distances. While we have a lot of scientific knowledge, no data exist about the influence of race course characteristics and other geographical aspects, on race performance. Therefore, the aims of this study were (i) to investigate where the fastest 100-km races are held and where the fastest runners originate from, (ii) to evaluate a potential influence of specific race characteristics (i.e., influence of elevation and race course characteristics) on performance, and (iii) to assess the influence of individual athlete performance against the other investigated factors. A total of 858,544 race records (732,748 from men and 125,796 from women) from 317,312 unique runners originating from 103 different countries and participating in 2,648 100-km races held in 80 different countries worldwide between 1892 and 2022 were analyzed using several descriptive, inferential and predictive methods, including a machine learning XG Boost Regression model. We evaluated the influence on the average running speed (in km/h) of factors such as gender of the athlete, age group, country of origin of the athlete, country where the race was held, course characteristics (i.e. mountain, trail, road, or track race) and elevation (i.e. flat or hilly course). The relative effect of the individual athlete performance was also investigated through a Mixed Effects Linear model. Discounting the fact that individual athlete performance is between 3 and 4 times ahead in race speed influence compared to the other factors, the model rated elevation (0.85) as the most important variable ahead of the country where the race was held (0.07), gender (0.02), age group (0.02), the country of origin of the runner (0.02) and the course characteristics (0.02). Running on a track (9.32 km/h) was the fastest ahead of road running (8.11 km/h), trail running (6.21 km/h) and mountain running (5.74 km/h). Flat running (8.85 km/h) was faster than running on a hilly course (6.57 km/h). The fastest athletes originated from African and Eastern European countries, with Swaziland (13.15 ± 0.88 km/h), Botswana (11.61 ± 2.22 km/h), Belarus (11.10 ± 2.29 km/h), Kazakhstan (10.74 ± 3.78 km/h), and Cape Verde (10.49 ± 2.26 km/h) in the top five. Africa, the Middle East, and Europe hold the fastest 100 km races, with Botswana (12.23 ± 1.35 km/h), Qatar (12.10 ± 1.63 km/h), Belarus (11.24 ± 1.27 km/h), Jordania (11.05 ± 1.58 km/h), and Montenegro (10.63 ± 1.90 km/h) in the top five. In summary, elevation was the most important variable in 100-km ultra-marathon running ahead of the country where the race was held, gender, age group, country of origin of the runner and course characteristics. Running on a track was the fastest ahead of road, trail and mountain running. Flat running was faster than running on a hilly course. Africa, the Middle East, and Europe hold the fastest 100 km races. Common for the fastest 100-km race courses was the fact that they were mainly indoor races and/or Championships. The fastest runners originated mainly from former republics of the dissolved Soviet Union. Future studies might select the fastest 100-km race courses.
Similar content being viewed by others
Introduction
In ultra-marathon running, the 100-km race is among the most popular1 and most traditional events2. This is also reflected by the scientific research in recent years, where studies have been performed regarding different topics about the 100-km race. Several studies investigated the aspect of training3,4, pre-race preparation5, previous experience, prediction of race performance5,6, age-related performance decline7 and age of peak performance8. Other aspects were pacing9,10 and gender differences in performance11.
Apart from sports science, medical topics were also investigated such as the influence of 100-km running on the heart12,13, the kidneys14, the immune system15,16, the endocrine system17,18, the skeletal muscle19, the mood20,21, and the regulation of the acid–base balance22. Also, nutritional aspects and aspects of fluid metabolism were considered, especially during the race23,24. Regarding fluid metabolism, the influence of dehydration25, electrolyte regulation25, fluid intake during the race26,27, and exercise-associated hyponatremia28 were investigated.
The aspect of the origin of the fastest 100-km ultra-marathoners has also been examined29,30. Regarding the origin of the fastest 100-km ultra-marathoners, a study from 2014 concluded that the fastest runners originated from Japan31. A study from 2018 found, however, that runners from Russia were the fastest in 100-km running29. The disparate findings might be explained by the different analytical approaches (i.e., single, and multi-level regression analyses/regression analyses adjusted by gender, age, and year) and/or the different time frames (1998–2011 versus 1959–2016) of these two studies. Nevertheless, an up-to-date analysis is needed to confirm or reject the more recent finding of Russian dominance29.
Furthermore, the kind of race course has most probably also an influence on overall race time. Little is known about the effect on course characteristics on ultra-marathon performance. Knowledge about this influence on 100-km ultra-marathon performance might help athletes and coaches to select the most suitable race. For marathon running, topography has a high influence on race time32. For a very fast marathon, undulations and curves have a high impact on race time33. In marathon running, altitude above sea level is of importance where race times in marathons at sea level are considerably faster than race times of marathons held in altitude34,35. Since we have no specific knowledge about a potential influence of changes in elevation and race course characteristics in 100-km ultra-marathon running, more investigation is needed.
Therefore, the present study aimed, first, to re-evaluate the origin of the fastest 100-km ultra-marathoners using a different approach (i.e., machine learning). A second aim was to evaluate where the fastest 100-km races are held. And a third aim was to investigate a potential influence of race course characteristic such as elevation changes (i.e., hilly, or flat course) and the kind of race course (i.e., mountain, trail, road, or track). Based upon recent findings, we hypothesized that Russians were the fastest 100-km ultra-marathoners. Considering race course characteristics, we assume that they would have an influence where faster race times would be achieved on flatter race courses.
Method
Ethical approval
This study was approved by the Institutional Review Board of Kanton St. Gallen, Switzerland, with a waiver of the requirement for informed consent of the participants as the study involved the analysis of publicly available data (EKSG 01/06/2010). The study was conducted following recognized ethical standards according to the Declaration of Helsinki adopted in 1964 and revised in 2013.
Data set and data preparation
Official race records from all 100-km ultra-marathons held since 1892 were obtained from the official DUV (Deutsche Ultramarathon Vereinigung) website (https://statistik.d-u-v.org). Each record included the athlete´s first and last name, age group, gender, country of origin, race name, location and year, race distance, and the athlete’s race time. The raw dataset comprised a total of 858,722 race records. This data set was checked for consistency, removing any incomplete or impossible records. In order to minimize the presence of outliers, a top race running speed of 21 km/h was set. Likewise, any countries with less than 10 race records in the sample were removed. The resulting cleaned sample used for analysis and modeling consisted of 858,544 race records. The dataset was further augmented by adding elevation data (as per the DUV website) and the races were classified into hilly or flat courses (as per the DUV website) since only a few races indicated numerical values of changes in altitude. Furthermore, we checked for race course characteristics as indicated in the DUV website and classified the races into mountain, trail, road, and track races.
Statistical data analysis
Once the dataset was cleaned, we commenced the analysis creating two independent ranking tables, by aggregating the race records by country of origin and country of event, and then sorting each list of countries by average running speed, with the fastest countries at the top. The results are summarized in two large reference tables, and included the number of records, the number of unique runners and running speed and descriptive statistical values of the running speed target are given as mean, standard deviation (std), minimum values (min), and maximum values (max). Also, median values are used in the box plots. In addition to giving us a descriptive view of the performance and participation in each country of origin or event, these ranking tables served the purpose of sorting the countries and provide a numerical index that will later be used in their encoding. The association of running speed with the type of race course and the elevation were analyzed using a set of boxplots charts. We built and evaluated three different data models: (i) a Multivariate Linear Regression model, (ii) a Mixed Effects Linear Regression (MELR) model, and (iii) a non-linear ML (Machine Learning) predictive model based on the XG Boost Regression algorithm. In all cases, the full sample was used for training and evaluating the models (in sample tests). Not surprisingly, the XG Boost model obtained the highest predictive score, although each model provided some insights. We also looked into the XG Boost model logic through some explainability tools. The model features relative importances are a measure of how the model rates the predictors in their ability to split the sample data into groups of lower entropy, hence working optimally towards the objective of making an accurate prediction. This tongue-twister means the model rating of “relative importance” often, but not always, matches our human-perceived importance. In this respect, the use of other models to compare against is a good thig when evaluating the results.
Numerical encoding of categorical variables
Before the data models could be trained, the predictor’s data needed to be converted (encoded) into numerical data. The Athlete gender variable was encoded as female = 0 and male = 1. The Age group variable was numerically encoded in 5-year groups (except group 18, which represents runners of less than 20 years, and group 75, which represents 75 years and older). The Athlete country and Event country variables were encoded based on their position in the ranking tables, with the countries with the fastest average running speeds at the top. The Course and Elevation variables were encoded in increasing order of average running speed, that is, the first is hilly = 0, flat = 1, and the second is mountain = 0, trail = 1, road = 2, and track = 3.
Models training and evaluation strategy
No hold-out evaluation strategy was used to train and evaluate the models, as it was not our intent to do any out-of-sample predictive work. Our aim was just to train the models with the full sample and then use different applicable methods to obtain answers to our research questions. The MLR models, a multivariate extension of the common linear regressor, does not need any specific configuration. The model achieved a R2 = 0.379 and rated all factors as statistically significant (this likely due to the large sample size). The second of the models, the MELR model was built to evaluate the level of extent to which individual athlete performance influenced our target variable (the running speed), in the context of the other variables under study. The model rated all factors as statistically significant, and showed an individual athlete variance coefficient of ⁓2.231 (km/h)2 which is between three and four times larger than the next factors (gender, elevation and course). The third and most complex model, the non-linear XG Boost regressor, was run with different test splits and different combinations of estimators and learning rates. The optimal XG Boost model was finally built with n_estimators = 500 and trained with the full data sample and a learning_rate = 0.25, obtaining a predictive score of R2 = 0.51 well ahead of the MLR model (0.379).
XG Boost model interpretation
Beyond the R2 metrics aforementioned, we further exploited the interpretability possibilities of the ML model. We first computed and plotted the model relative features importances. These importances refer to a rather technical aspect of the ML model function that has to do with the effectiveness of using a specific feature to split the data, in order to obtain groups with a lower entropy level. It often coincides with the real-life importances but not always. We also calculated and plotted the model prediction distributions, based on the Partial Dependent Plots (PDP) library. These ML explainability tools allow us to look into that 51% of explained variability in the race speed for each predictor. The prediction distribution plots use boxplots to show the distribution of the model predictions of average race speed.
All computation and analysis were done using a Jupyter Notebook (Google Colab) and Python and associated libraries (pandas, numpy, xgboost, pdpbox, sklearn, matplotlib, sns).
Results
The clean dataset used in the analyses contained a significant sample of 858,544 race records (732,748 from men and 125,796 from women) from 317,312 unique runners from 103 different countries finishing in 2648 100-km races held in 80 different countries worldwide between 1892 and 2022.
Athlete country ranking
The country ranking is shown in Table 1 where the fastest athletes originated from African and Eastern-European countries with Swaziland, Botswana, Belarus, Kazakhstan, and Cape Verde as the top five. Note that, aside Belarus with 118 unique runners and 396 race records, all the countries among the top five have small samples. In the 6th and 8th positions are two other East European countries such as Lithuania and Russia with more representative samples, with Hungary, Latvia and Slovakia next. In the opposite end of the performance axis, we can find most of the South Asian countries including Hong-Kong, China, Philippines, Singapore and others.
Event country ranking
The ranking of countries by race events shown in Table 2. A combination of Africa, the Middle East, and Europe countries seem to hold the fastest 100 km races, with Botswana, Qatar, Belarus, Jordania, and Montenegro as the top five (albeit with very small samples all but Qatar). Netherlands scores next with a much more sizeable sample.
MLR model results
The linear model scored a R2 value of 0.379 serving as a baseline for comparison. All factors were assessed as being statistically significant.
MLER model results
The main result of the mixed effects model is that the effect of the individual athlete performance is between 3 and 4 times larger than the following factor under consideration (elevation, course and gender).
XG Boost regression model results
The ‘optimal’ XG Boost model (sample size 858,544, 500 estimators and learning rate 0.25) achieved a score value of R2 = 0.51 (MAE (km/h) 1.22) which indicates a moderate effect of the predicting variables in the model output, and not surprisingly higher than the linear model. Figure 1 shows the model features relative importances with elevation (0.85) being overwhelmingly relevant ahead of the country where the race was held (0.07), gender (0.02), age group (0.02), country of origin of the runner (0.02) and course characteristics (0.02).
Combined prediction distributions and target plots
The PDP library allows to look in more detail to the associations between predictors and target. The so-called target plots represent a descriptive visualization of the 100-km race dataset by predictor and show the group’s sizes and average running speeds. The prediction plots use boxplots to show the distribution of the XG Boost model output (the predicted running speed) by predictor. For gender (Fig. 2), age group (Fig. 3), course type (Fig. 4) and elevation (Fig. 5) all possible values are displayed. The charts show that men (7.42 km/h) were faster than women (6.68 km/h), runners in age groups 35–39 years and 40–44 years were the fastest, running on track (9.32 km/h) was the fastest ahead of road (8.11 km/h), trail (6.21 km/h) and mountain (5.74 km/h) running. Furthermore, flat running (8.85 km/h) was faster than running on a hilly course (6.57 km/h). Since the athlete (Fig. 6) and event (Fig. 7) countries have a very high cardinality, only the first 20 elements are shown. The bottom chart in the sets show the number of race records in each predictor group. The red line chart (in the middle) represents each group’s average race speed, while the boxplot at the top represents the predictive model output with the 2nd quartile (median value) in the box label. In general, the results replicate those obtained in the descriptive analysis. However, the prediction charts show some peaks for Kazakhstan and Kenia in the athlete country chart and for Qatar, the Netherlands and Slovakia in the event country chart. Given the high sensitivity of ML models, it is always wise to ponder the relevance of any observations with the specific group size.
Type of race course and elevation by gender
Figure 8 shows the running speed by gender regarding elevation where men were always faster than women and running on a flat course was faster than running on a hilly course for both genders. Figure 9 presents running speed by gender in respect of the race course characteristics. Men were always faster than women and the fastest running speeds were achieved in track running ahead for road, trail and mountain running.
Discussion
The aims of this study were threefold where we wanted to know (i) the origin of the fastest runners, and the countries where the fastest 100-km races are held, (ii) a potential influence of race characteristics on performance, (iii) the relative relevance of the predictors or factors under study against the individual athlete performance. The most important findings were that (i) elevation was the most important predictive variable, after discounting individual performance which weighed between 3 and 4 times the following factor (ii) running on a track was the fastest, (iii) flat running was faster than running on a hilly course, (iv) the fastest athletes originated from African and, most notably from East-European countries and (v) the fastest race courses were found in Africa, in the Middle East, and in Europe.
Change in elevation as the most important predictor
Our most important finding was that the model rated elevation as the most important variable ahead of the country where the race was held. A potential explanation for the high influence of elevation could be anthropometry where lower body mass might be helpful for ascents. A study investigating performance determinants in trail running races of different distances reported that body fat percentage was a predictor in a trail run of medium distance36. Furthermore, a loss in body mass during ultra-marathon trail-running seemed also to be of importance regarding ultra-running performance37. Also, physiological aspects such as running economy might have a considerable influence on running races with covering altitude38. Furthermore, alterations in neuromuscular function might have an impact on mountain ultra-marathon running39. Overall, more studies are needed to investigate the influence of changes in elevation on ultra-marathon performance.
Running on a track as the fastest race course
A further important finding was that running on a track was faster than road, trail and mountain running. Similar findings were found in 72-h ultra-marathon running where the fastest races are held on track, followed by road, and then trail40. We assume that running on a 400-m track is more efficient than road running or running on trails and in the mountains. Athletes can better pace in track running with a consistent speed and a predictable pace based in running distance per time unit (e.g. km per minute)41. In contrast, in trail running, the pacing is more effort-based and must be adapted to the terrain42. Future studies might evaluate the fastest track races in 100-km ultra-marathon running.
Race location as the second important predictor
A further important finding was that the country (event location) where the race is held was the second most important variable. Africa, Middle East, and Europe have the fastest 100 km races, with Botswana (23 runners with 47 race finishes), Qatar (143 runners with 319 finishes), Belarus (82 runners with 137 finishes), Jordania (17 runners with 17 finishes), and Montenegro (also 17 runners with 26 finishes) as the top five.
A very likely explanation is that only a few runners competed in these locations and the density was very high. Regarding Botswana, the ‘Salt Pans Ultra Marathon’ is a 100 km race with the fastest runner finishing in 9:11:40 h:min:s and the slowest in 19:08:03 h:min and a difference of around 10 h between the fastest and the slowest (https://saltpansultra.com/). Considering Qatar, the explanation was that in 2014, the IAU 100 km World Championship open race with 143 finishers was held in Doha, Qatar (https://worldathletics.org/news/report/max-king-ellie-greenwood-iau-100km-world-cham). Regarding the race times, 13 men were below 7 h in the World Championship. The best athletes likely competed in a World Championship and obtained one of their best performances in life43. However, a total of 319 runners were competing in a 100-km ultra-marathon held in Qatar, so the World Championship was only one opportunity to achieve a fast race time. These results should be considered in light of our limitations, which included the lack of information regarding the environmental factors or additional geographical information that could provide a qualitative explanation.
Apart from Qatar and Belarus, fast 100-km races were held in Jordan, the Netherlands, Montenegro, Slovakia, Egypt, Lithuania, and Croatia. It was common for these countries to hold Championships at the national or international level. In 2019, the IAU 100 km Asia and Oceania Championships were held in Jordan (https://iau-ultramarathon.org). In the Netherlands, the ‘Run Winschoten’ was held since 1976 (www.runwinschoten.nl). In that event, more than 4000 athletes have already competed, and by the end of 2022, 235 runners have achieved a race time below 7 h. The race organized several Championships, such as the National, the European, and the World Championships. In Montenegro, the ‘Podgorica 100 km Ultramarathon’ was held from 2005 to 2007 as a regular race; however, it was not a Championship. Moreover, Slovakia has a long tradition of running 100-km ultra-marathons. The ‘Medzinárodný cestný beh “Družba” 100 km Košice’ was held from 1974 until 1992 with 476 finishers, and the ‘100 km Self Transcendence Run Nitra’ was held from 2009 to 2019 (https://cs.srichinmoyraces.org/). In Croatia, the ‘100 km Varazdin’ was held from 1979 to 1988; since 2018, the ‘Polojska ultra 100 km’ was held. Furthermore, in 2018, the 30th IAU 100 km World Championship was held in Sveti Martin with 221 finishers, where 14 runners finished below 7 h. In Lithuania, for example, a 100 km race was held in 2023, with Aleksandr Sorokin winning in 06:05:35 h:min:s (www.ultramarathon.org/). In Egypt, the ‘100 km Pharaonic Race’ is iconic and one of the oldest races in Egypt (www.sportseventsegypt.com/event/100-km-pharaonic-race/). Future studies might investigate more deeply the influence of the particular races held in these countries.
The fastest runners are from African and East-European countries
We also found that the fastest athletes originated from African and East-European countries, with Swaziland (5 runners with 18 finished races), Botswana (11 runners with 51 finished races), Belarus (118 runners with 396 finished races), Kazakhstan (11 runners with 29 races), and Cape Verde (9 runners with 48 races) as the top five countries. We could, therefore, not confirm recent findings that Russians were the fastest 100-km ultra-marathoners. Most likely it is more appropriate to say that these six countries had lower average times, but not necessarily have the best or fastest athletes. We assume that the low numbers of athletes are due to a highly selected population and the high average running speed is due to the low difference between the slowest and the fastest race times.
Regarding Swaziland with 18 finishers, we assume that Swaziland only participates with its best athletes. Regarding Russia (5,180 finishers), Germany (74,162 finishers), France (113,986 finishers) or Japan (205,908 finishers), the average running speed is most likely compromised by the large number of slower participants. Interestingly, from these six fastest countries, only a very small number of athletes originated, considering the sample of more than 850′000 runners. We assume that these runners have a very high density in performance. Regarding Swaziland, the fastest male runner achieved 7:05:17 h:min:s while the slowest man had 9:05:35 h:min:s with a difference of 2 h between the fastest and the slowest runner (www.ultramarathon.org/). This could also be because there is no running boom, recreational activity, or money to stimulate the general public to travel and participate in races. Most runners competed in races held in Europe and Japan (www.ultramarathon.org/). Similarly, the fastest male runner from Botswana achieved 7:20:12 h:min:s while the slowest man finished at 09:59:08 h:min: with a difference of less than 3 h between the fastest and the slowest (www.ultramarathon.org/). Again, they competed mainly in European races held in the Netherlands, Great Britain, Italy, Spain, Belgium, and Russia (www.ultramarathon.org/). Also, runners from Belarus were among the fastest and also the second fastest races were held in that country. In contrast to the runners from Swaziland and Botswana, the fastest runner from Belarus finished a 100-km ultra-marathon in 6:33:56 h:min:s and the slowest in 22:55:12 h:min:s with a difference of more than 16 h between the fastest and the slowest (www.ultramarathon.org/). In Belarus, the ‘All-Union 100 km Run Grodno’ was held from 1988 to 1992 as a road-based race with 66 finishers (www.ultramarathon.org/). In 1995, the ‘100 km indoor Minsk’ was held with 6 finishers (www.ultramarathon.org/). From 2000 to 2014, the ‘Molodechno Int. 100 km ultramarathon indoor’ was held in Maladsetschna again with 66 finishers (www.ultramarathon.org/). Overall, a total of 138 runners were recorded, and 132 athletes competed indoors.
We might assume that a selected sample of runners competed in a 100-km ultra-marathon on an indoor track and most probably, a high percentage of the runners were from Belarus. Some studies found that local athletes preferably competed in their own country43,44. Further-more, in an indoor race, the influence of environmental conditions was eliminated. It has been shown that a high temperature or a high humidity could influence long-distance running speed and fluid loss more than a lower temperature and a lower humidity45. Apart from Belarus, fast 100-km ultra-marathoners originated from Kazakhstan, Cape Verde, and Lithuania. Regarding the density in performance, the fastest man from Kazakhstan finished a 100-km race in 6:31:41 h:min:s whereas the slowest needed 12:21:40 h:min:s with a difference of less than 6 h between the fastest and the slowest (www.ultramarathon.org/). For Cape Verde, 7:10:41 h:min:s for the fastest man and 12:17:32 for the slowest male runner, with a difference of around 5 h (www.ultramarathon.org/). Lithuania, with a higher number of runners, also showed a larger difference between the fastest and the slowest runner, where the fastest man finished in 6:05:35 h:min:s and the slowest in 18:11:52 h:min:s with a difference of more than 12 h (www.ultramarathon.org/). It was important to know that Belarus, Kazakhstan, and Lithuania were part of the former Soviet Union (www.history.com/topics/european-history/history-of-the-soviet-union). In a previous study, Russians were the fastest 100-km ultra-marathoners29. We should be aware that in 1991, Russia emerged from the dissolution of the Soviet Union as the independent Russian Federation, similar to the countries that composed the Soviet Union, which can impair our findings.
The age of peak performance
We also found that athletes aged 35–45 years were the fastest, which agreed with previous research46,47. A study investigating the age-related performance decline in 100 km ultra-marathoners competing in a single race (100 km Biel, Switzerland) reported that the best 100-km running times were observed for another age frame with 30–49 years for men and 30–54 years for women48. Another study investigating a large sample (148,017 finishes with 18,998 women and 129,019 men) and a longer time frame (1960–2012) showed that the age of the fastest female and male 100-km ultra-marathoners remained unchanged at ∼35 years46. In addition, our findings about the age of peak performance were in line with an analysis of the World athletics (formerly known as the International Association of Athletics Federations) database (1999–2015), which reported an age of 35.9 years in men and 36.6 years in women2.
Analytical considerations and limitations
A study from 2014 concluded that the fastest runners originated from Japan31. The result was based on the ten fastest races times by nationality of races held between 1998 and 201131. A study from 2018 that runners from Russia were the fastest in 100-km running29. In that analysis, finishes of races held between 1959 and with more than 14 h were removed (i.e. truncated data set)29. In the present analysis, data between 1892 and 2022 were considered (longer time frame) and no data were excluded, except incomplete data. The results summarized the observations across the descriptive charts (i.e., target plots) and the model interpretability charts (PDP and prediction plots). Some countries with small sample sizes (less than 10 records) in the 100 km sample within the 1892–2022 period but with faster runners may have been excluded from the analysis due to the methodology used. Athletes could change their country of residence/nationality over the years, which was not considered in the present study. Similarly, qualitative information regarding the event location was not available. This was an important limitation because it impaired the generalization of the findings regarding the environmental characteristics that had a positive impact on athletes’ performance. A further limitation was that we have not accounted for repeated measures since some athletes might have competed several times in the same or in another event. We had to include the change in elevation as a categorial variable (i.e. hilly versus flat) since exact data about changes in elevation in meters were not provided on the website. Other variables such as temperature, humidity, altitude, wind, etc. were also not available. These variables might also have an influence on race performance. Another limitation is that in one country more than one 100-km ultra-marathon could have been held. Future studies need to analyze each single 100-km race. On the other hand, the strength of the present study was its novel methodological approach since it was the first time that a machine learning model was used to predict 100 km running performance from age, gender, country of origin, and country of event. Furthermore, considering the popularity of 100-km ultra-marathon races, our findings would provide practical information for professionals working with ultra-marathon runners to set optimal performance goals depending on the event country. For athletes and coaches, these findings provide insight into ultra-marathon running, performance aspects and aspects relevant to performance. Athletes and coaches can now select a race for performance aspects selecting races flat race courses with little elevation, considering track race courses over road, trail or mountain races and races in particular countries, that may be more beneficial for a faster race performance. Future studies should consider factors such as training culture, access to coaching, genetic predisposition, and socioeconomic influences on athletic development.
Conclusion
In summary, elevation is the most important variable in 100-km running ahead of the country where the race was held, gender, age group, country of origin of the runner and course characteristics. Running on a track was the fastest ahead of road, trail and mountain running. Flat running was faster than running on a hilly course. Common for the fastest 100-km race courses was the fact that they were mainly indoor races and/or Championships. The fastest runners originated mainly from former republics of the dissolved Soviet Union. Africa, the Middle East, and Europe hold the fastest 100 km races. Future studies should consider investigating the culture of long-distance endurance events in these countries and explore how the natural environment can be used as an important characteristic of training, providing a safe, supportive context to training and participation in competitions.
Availability of data and materials
Availability of Data and Materials For this study, we have included official race results from the official DUV website (https://statistik.d-u-v.org). The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.
Abbreviations
- DUV:
-
Deutsche ultramarathon vereinigung
- MAE:
-
Mean absolute error
- PDP:
-
Partial dependence plots
- XGBoost:
-
EXtreme gradient boosting
References
Stöhr, A. et al. An analysis of participation and performance of 2067 100-km ultra-marathons worldwide. Int. J. Environ. Res. Public Health 18(2), 362. https://doi.org/10.3390/ijerph18020362 (2021).
Knechtle, B., Scheer, V., Nikolaidis, P. T. & Sousa, C. V. Participation and performance trends in the oldest 100-km ultramarathon in the world. Int. J. Environ. Res. Public Health https://doi.org/10.3390/ijerph17051719 (2020).
Citarella, R. et al. Association between dietary practice, body composition, training volume and sport performance in 100-Km elite ultramarathon runners. Clin. Nutr. ESPEN. 42, 239–243. https://doi.org/10.1016/j.clnesp.2021.01.029 (2021).
Tanda, G. & Knechtle, B. Effects of training and anthropometric factors on marathon and 100 km ultramarathon race performance. Open Access J. Sports Med. 28(6), 129–136. https://doi.org/10.2147/OAJSM.S80637 (2015).
Knechtle, B., Knechtle, P., Rosemann, T. & Senn, O. What is associated with race performance in male 100-km ultra-marathoners–anthropometry, training or marathon best time?. J. Sports Sci. 29(6), 571–577. https://doi.org/10.1080/02640414.2010.541272 (2011).
Coquart, J. B. Prediction of performance in a 100-km run from a simple equation. PLoS ONE 18(3), e0279662. https://doi.org/10.1371/journal.pone.0279662 (2023).
Lepers, R., Knechtle, B. & Stapley, P. Trends in triathlon performance: Effects of sex and age. Sports Med. 43(9), 851–863. https://doi.org/10.1007/s40279-013-0067-4 (2013).
Nikolaidis, P. T. & Knechtle, B. Performance in 100-km ultramarathoners-at which age, it reaches its peak?. J. Strength Cond. Res. 34(5), 1409–1415. https://doi.org/10.1519/jsc.0000000000002539 (2020).
Lambert, M. I., Dugas, J. P., Kirkman, M. C., Mokone, G. G. & Waldeck, M. R. Changes in running speeds in a 100 KM ultra-marathon race. J. Sports Sci. Med. 3(3), 167–173 (2004).
Renfree, A., Crivoi do Carmo, E. & Martin, L. The influence of performance level, age and gender on pacing strategy during a 100-km ultramarathon. Eur. J. Sport Sci. 16(4), 409–415. https://doi.org/10.1080/17461391.2015.1041061 (2016).
Zingg, M. A., Rüst, C. A., Rosemann, T., Lepers, R. & Knechtle, B. Runners in their forties dominate ultra-marathons from 50 to 3,100 miles. Clinics (Sao Paulo). 69(3), 203–211. https://doi.org/10.6061/clinics/2014(03)11 (2014).
Rimensberger, C., Carlen, F., Brugger, N., Seiler, C. & Wilhelm, M. Right ventricular adaptations and arrhythmias in amateur ultra-endurance athletes. Br. J. Sports Med. 48(15), 1179–1184. https://doi.org/10.1136/bjsports-2013-092859 (2014).
Yoon, J. H., Park, Y., Ahn, J., Shin, K. A. & Kim, Y. J. Changes in the markers of cardiac damage in men following long-distance and ultra-long-distance running races. J. Sports Med. Phys. Fitness. 56(3), 295–301 (2016).
Kao, W. F. et al. Effects of 100-km ultramarathon on acute kidney injury. Clin. J. Sport Med. 25(1), 49–54. https://doi.org/10.1097/jsm.0000000000000116 (2015).
Kasprowicz, K. et al. Running a 100-km ultra-marathon induces an inflammatory response but does not raise the level of the plasma iron-regulatory protein hepcidin. J. Sports Med. Phys. Fitness. 53(5), 533–537 (2013).
Žákovská, A. et al. The effect of a 100-km ultra-marathon under freezing conditions on selected immunological and hematological parameters. Front Physiol. 8, 638. https://doi.org/10.3389/fphys.2017.00638 (2017).
Czajkowska, A. et al. The effect of the ultra-marathon run at a distance of 100 kilometers on the concentration of selected adipokines in adult men. Int. J. Environ. Res. Public Health. https://doi.org/10.3390/ijerph17124289 (2020).
Kasprowicz, K. et al. The effect of vitamin D(3) supplementation on hepcidin, iron, and IL-6 responses after a 100 km ultra-marathon. Int. J. Environ. Res. Public Health. https://doi.org/10.3390/ijerph17082962 (2020).
Jastrzębski, Z., Żychowska, M., Radzimiński, Ł, Konieczna, A. & Kortas, J. Damage to liver and skeletal muscles in marathon runners during a 100 km run with regard to age and running speed. J. Hum. Kinet. 45, 93–102. https://doi.org/10.1515/hukin-2015-0010 (2015).
Krokosz, D., Bidzan-Bluma, I., Ratkowski, W., Li, K. & Lipowski, M. Changes of mood and cognitive performance before and after a 100 km nighttime ultramarathon run. Int. J. Environ. Res. Public Health. https://doi.org/10.3390/ijerph17228400 (2020).
Krokosz, D., Lipowski, M., Aschenbrenner, P. & Ratkowski, W. Personality traits and vitamin D3 supplementation affect mood state 12 h before 100 km ultramarathon run. Front Psychol. 9, 980. https://doi.org/10.3389/fpsyg.2018.00980 (2018).
Jastrzębski, Z., Żychowska, M., Konieczna, A., Ratkowski, W. & Radzimiński, Ł. Changes in the acid-base balance and lactate concentration in the blood in amateur ultramarathon runners during a 100-km run. Biol. Sport. 32(3), 261–265. https://doi.org/10.5604/20831862.1163372 (2015).
Fallon, K. E., Broad, E., Thompson, M. W. & Reull, P. A. Nutritional and fluid intake in a 100-km ultramarathon. Int. J. Sport Nutr. 8(1), 24–35. https://doi.org/10.1123/ijsn.8.1.24 (1998).
Knechtle, B. et al. Branched-chain amino acid supplementation during a 100-km ultra-marathon–a randomized controlled trial. J. Nutr. Sci. Vitaminol. (Tokyo). 58(1), 36–44 (2012).
Bürge, J. et al. Maintained serum sodium in male ultra-marathoners–the role of fluid intake, vasopressin, and aldosterone in fluid and electrolyte regulation. Horm. Metab. Res. 43(9), 646–652. https://doi.org/10.1055/s-0031-1284352 (2011).
Cooper, R., Naclerio, F., Allgrove, J. & Jimenez, A. Creatine supplementation with specific view to exercise/sports performance: An update. J. Int. Soc. Sports Nutr. 9(1), 33. https://doi.org/10.1186/1550-2783-9-33 (2012).
Knechtle, B., Knechtle, P. & Rosemann, T. Do male 100-km ultra-marathoners overdrink?. Int. J. Sports Physiol. Perform. 6(2), 195–207. https://doi.org/10.1123/ijspp.6.2.195 (2011).
Knechtle, B., Knechtle, P. & Rosemann, T. Low prevalence of exercise-associated hyponatremia in male 100 km ultra-marathon runners in Switzerland. Eur. J. Appl. Physiol. 111(6), 1007–1016. https://doi.org/10.1007/s00421-010-1729-7 (2011).
Knechtle, B., Nikolaidis, P. T. & Valeri, F. Russians are the fastest 100-km ultra-marathoners in the world. PLoS ONE https://doi.org/10.1371/journal.pone.0199701 (2018).
Nikolaidis, P. T., Onywera, V. & Knechtle, B. Running performance, nationality, sex, and age in the 10-km, half-marathon, marathon, and the 100-km ultramarathon IAAF 1999–2015. J. Strength Cond. Res. 31(8), 2189–2207. https://doi.org/10.1519/JSC.0000000000001687 (2017).
Cejka, N. et al. Participation and performance trends in 100-km ultra-marathons worldwide. J. Sports Sci. 32(4), 354–366. https://doi.org/10.1080/02640414.2013.825729 (2014).
de Koning, J. J. World records: How much athlete? How much technology?. Int. J. Sports Physiol. Perform. 5(2), 262–267. https://doi.org/10.1123/ijspp.5.2.262 (2010).
Jamieson, J. P. The home field advantage in athletics: A meta-analysis. J. Appl. Soc. Psychol. 40(7), 1819–1848. https://doi.org/10.1111/j.1559-1816.2010.00641.x (2010).
Seffrin, A. et al. Origin of the Fastest 5 km, 10 km and 25 km Open-Water Swimmers-An Analysis from 20 Years and 9819 Swimmers. Int. J. Environ. Res. Public Health. 18, 21. https://doi.org/10.3390/ijerph182111369 (2021).
Berke, D. Performance comparison of long-distance running competitions in different meteorology and environment based influential conditions. Q. J. Hungar. Meteorol. Serv. 123(3), 313–328. https://doi.org/10.28974/idojaras.2019.3.4 (2019).
Cejka, N., Knechtle, B., Rüst, C. A., Rosemann, T. & Lepers, R. Performance and age of the fastest female and male 100-KM ultramarathoners worldwide from 1960 to 2012. J. Strength Cond. Res. 29(5), 1180–1190. https://doi.org/10.1519/jsc.0000000000000370 (2015).
Nikolaidis, P. T. & Knechtle, B. Do fast older runners pace differently from fast younger runners in the “New York City Marathon”?. J. Strength Cond. Res. 33(12), 3423–3430. https://doi.org/10.1519/jsc.0000000000002159 (2019).
Knechtle, B., Rüst, C. A., Knechtle, P., Rosemann, T. & Lepers, R. Age-related changes in ultra-triathlon performances. Extreme Physiol. Med. 1(1), 5. https://doi.org/10.1186/2046-7648-1-5 (2012).
Venturini, E. & Giallauria, F. Factors influencing running performance during a marathon: Breaking the 2-h barrier. Front Cardiovasc. Med. 2(9), 856875. https://doi.org/10.3389/fcvm.2022.856875 (2022).
Snyder, K. L. et al. Effects of course design (curves and elevation undulations) on marathon running performance: A comparison of Breaking 2 in Monza and the INEOS 1:59 Challenge in Vienna. J. Sports Sci. 39(7), 754–759. https://doi.org/10.1080/02640414.2020.1843820 (2021).
Lara, B., Salinero, J. J. & Del Coso, J. Altitude is positively correlated to race time during the marathon. High Alt. Med. Biol. 15(1), 64–69. https://doi.org/10.1089/ham.2013.1060 (2014).
Roi, G. S., Giacometti, M. & von Duvillard, S. P. Marathons in altitude. Med. Sci. Sports Exerc. 31(5), 723–728. https://doi.org/10.1097/00005768-199905000-00016 (1999).
Coates, A. M., Berard, J. A., King, T. J. & Burr, J. F. Physiological determinants of ultramarathon trail-running performance. Int. J. Sports Physiol. Perform. 16(10), 1454–1461. https://doi.org/10.1123/ijspp.2020-0766 (2021).
Pastor, F. S. et al. Performance determinants in trail-running races of different distances. Int. J. Sports Physiol. Perform. 17(6), 844–851. https://doi.org/10.1123/ijspp.2021-0362 (2022).
Bascuas, P. J. et al. Running economy in the vertical kilometer. Sensors (Basel). 23(23), 9349. https://doi.org/10.3390/s23239349 (2023).
Saugy, J. et al. Alterations of neuromuscular function after the world’s most challenging mountain ultra-marathon. PLoS ONE 8(6), e65596. https://doi.org/10.1371/journal.pone.0065596 (2013).
Knechtle, B. et al. Analysis of the 72-h ultramarathon using a predictive XG Boost model. Sport Sci. Health https://doi.org/10.1007/s11332-024-01243-3 (2024).
Casado, A., Hanley, B., Jiménez-Reyes, P. & Renfree, A. Pacing profiles and tactical behaviors of elite runners. J. Sport Health Sci. 10(5), 537–549. https://doi.org/10.1016/j.jshs.2020.06.011 (2021).
Genitrini, M., Fritz, J., Zimmermann, G. & Schwameder, H. Downhill sections are crucial for performance in trail running ultramarathons—a pacing strategy analysis. J. Funct. Morphol. Kinesiol. 7(4), 103. https://doi.org/10.3390/jfmk7040103 (2022).
Author information
Authors and Affiliations
Contributions
Author contributions Conceptualization: BK, Data curation: EV. Formal analysis: DV. Methodology: BK. Writing – original draft: BK, MT. Writing – editing: KW, VS, PTN, MA, IC, RG, TR.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
This study was approved by the Institutional Review Board of Kanton St. Gallen, Switzerland, with a waiver of the requirement for informed consent of the participants as the study involved the analysis of publicly available data (EKSG 01/06/2010). The study was conducted following recognized ethical standards according to the Declaration of Helsinki adopted in 1964 and revised in 2013.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.
About this article
Cite this article
Knechtle, B., Weiss, K., Valero, D. et al. Change in elevation predicts 100 km ultra marathon performance. Sci Rep 15, 25176 (2025). https://doi.org/10.1038/s41598-025-09502-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-025-09502-0