An agent-based model reveals lost person behavior based on data from wilderness search and rescue

Hashimoto, Amanda; Heintzman, Larkin; Koester, Robert; Abaid, Nicole

doi:10.1038/s41598-022-09502-4

Download PDF

Article
Open access
Published: 07 April 2022

An agent-based model reveals lost person behavior based on data from wilderness search and rescue

Amanda Hashimoto¹,
Larkin Heintzman²,
Robert Koester^3,4 &
…
Nicole Abaid⁵

Scientific Reports volume 12, Article number: 5873 (2022) Cite this article

12k Accesses
23 Citations
18 Altmetric
Metrics details

Subjects

Abstract

Thousands of people are reported lost in the wilderness in the United States every year and locating these missing individuals as rapidly as possible depends on coordinated search and rescue (SAR) operations. As time passes, the search area grows, survival rate decreases, and searchers are faced with an increasingly daunting task of searching large areas in a short amount of time. To optimize the search process, mathematical models of lost person behavior with respect to landscape can be used in conjunction with current SAR practices. In this paper, we introduce an agent-based model of lost person behavior which allows agents to move on known landscapes with behavior defined as independent realizations of a random variable. The behavior random variable selects from a distribution of six known lost person reorientation strategies to simulate the agent’s trajectory. We systematically simulate a range of possible behavior distributions and find a best-fit behavioral profile for a hiker with the International Search and Rescue Incident Database. We validate these results with a leave-one-out analysis. This work represents the first time-discrete model of lost person dynamics validated with data from real SAR incidents and has the potential to improve current methods for wilderness SAR.

Location-based collective distress using large-scale biosignals in real life for walkable built environments

Article Open access 12 April 2023

A shift from human-directed to undirected wild land disturbances in the USA

Article 18 September 2025

Environmental risks to humans, the first database of valence and arousal ratings for images of natural hazards

Article Open access 14 June 2022

Introduction

Between 2004 and 2014, in the US National Parks alone, there were 46,609 individuals who became lost and required a search and rescue campaign, which cost about 51.4 million dollars in total¹. Wilderness search and rescue (SAR) not only consumes many hours and monetary expenses for searchers annually, but it also comes at a great cost to the searchers involved, both physically and mentally. These searches can involve great risks to human searchers due to the pressure from having to search large areas under strict time constraints, as time impacts chances of a lost person’s survival. However, coordinated SAR operations are the only way to help locate a missing individual alive. Therefore, it is of critical importance to make these searches as efficient as possible by utilizing methods to better understand lost person behavior.

A lost person (LP) is defined as a person unable to identify or orient themselves with respect to known locations and with no effective means or method for reorientation². In this definition, there are two parts: confusion in identifying current location and an inability to orient. This lack of ability to reorient oneself drives LPs to use a variety of different behaviors. In Lost Person Behavior, Robert Koester has defined behavior strategies that have been reported by LPs through the collection of incident data^3,4. Different types of LPs, differentiated by demographics such as age, cognitive or emotional state, and activity performed prior to being lost, are prone to specific reorientation behaviors². A hiker, for example, may rely on aids such as roads and trails for travel, while a person with dementia may travel in one direction regardless of the terrain^3,5. The psychological effect of being lost and behaviors associated with it have been studied extensively by environmental psychologists^6,7,8. In this work, we use these behavioral profiles, or lost person types (LPTs), which the SAR community refers to as Subject Categories, to inform our model of lost person dynamics in the wilderness.

As it is currently practiced, a SAR team will initially create a probability distribution map of likely locations of the LP based on terrain features, the profile of the LP, weather, and input from SAR experts³. Searchers must know the initial location of the LP, whether it is the point last seen by an eyewitness, or the last known point where there is substantial evidence to place the lost person. In either case, this point is called the initial planning point (IPP) and is used to measure the progress of the search. The incident commander will allocate resources and coordinate searchers based on the information available and the probability map⁹. Good planning and efficient task allocation in the first few hours of the search can make a significant difference to the success of the search. As time progresses, the survivability of the lost person decreases rapidly, especially so if the LP is not found within the first 51 hours; in other words, effective planning can mean life or death¹⁰. Thus, the quality of the predicted locations of the LP is critical to search operations. Currently, these predictions are based on heuristics employed by the incident commander, which may not be able to simultaneously take into account geophysical and transient landscape features, and demographic and activity information about the LP. The goal of this article is to introduce a novel dynamic model of lost person behavior which synthesizes information about the specific environment for a search, as well as characteristics of the LP drawn from a large database of search incidents.

The International Search and Rescue Incident Database (ISRID), created and curated by Koester and reported in Lost Person Behavior, has data from more than 145,000 searches from around the world and identifies strategies lost people may use^3,11,12. The book identifies more than 30 categories of LPs, based on activity or demographic information. The reported metrics include the horizontal and vertical distance found from the IPP, the time the lost person remained mobile while lost, and the status when found. The summarized metric statistics are distinct for each lost person category. In practice, searchers use these statistics to create ad hoc probability maps, like a distance-ring model⁵, in order to predict the location of the LP. However, these tactics assume that, by the time the search has started, the lost person has stopped moving¹³. As the community seeks to streamline SAR operations, search teams are deployed faster and the assumption that the LP is static no longer holds. Therefore, motion should be considered in the planning phase in order to create a probability map that evolves in time.

There are existing models in the literature on lost person dynamics in the wilderness. Many existing models of human behavior have been used mainly to study pedestrian dynamics, including force-based models showing collision avoidance, vision-based guidance, or goal-oriented behaviors^{14,15,16,17,18,19} and agent-based models based on behavioral heuristics^20,21,22,23. An LP model is fundamentally different from pedestrian dynamics, since the behavior of the LP depends heavily on the landscape and the LP type, as evidenced by the statistics in ISRID. Models of LP behavior in the literature are both deterministic and stochastic and may account for or neglect the local landscape. The watershed model analyzed by Doke²⁴ and described by Sava et al.¹³ embodies the idea that an LP will move on a path to minimize watershed crossings. The distance-ring model uses statistics from a database like ISRID to draw concentric circles to bound the LP’s possible position after a given time³, but neglects specific features of the landscape and only relies on the maximum distance an LP could potentially traverse. Also drawing from ISRID, McDaniel has created an agent-based model using behavior strategies and a detailed landscape environment²⁵, which would be strengthened by using real search incidents over summarized statistics. Morelle et al. developed a spatially explicit agent-based model to investigate the movement of individuals at the interface of urban and natural settings²⁶. While this model depends on the landscape, the model was only simulated in and around a midsized town environment and does not include time. A Bayesian model created by Lin and Goodrich only considers terrain, but not strategies or lost person types²⁷. Mohibullah has expanded the Bayesian model²⁷ and created an agent-based model using different strategies which also accounts for the fact that the LP has an internal state that evolves over time as it is moving, that is, the LP can become fatigued²⁸. In order to evaluate the model, the authors compared the simulated tracks to actual recordings of participant movement in a wilderness environment²⁹. Though motion tracks are desirable, a controlled experiment can lack the behaviors expressed from the true psychological effects of being lost, and an evaluation using genuine search incidents is preferable. Alanis et al. created a mechanistic model for a lost hiker that evaluates the influence of a simulated terrain in combination with lost person behavior using a finite time horizon Markov Decision Process³⁰. Though it incorporates both geographic information and behavioral analysis, the model lacks the use of real map data and actual search incidents. In contrast, Šerić et al. use real search incidents in a cellular automata-based algorithm to determine the search area for a lost person using walking speed as the main parameter³¹. Their results show that landscape features play a large role in LP movement and should always be considered when planning a search area. Another model by Metcalfe uses a probability-based movement within a grid and emphasizes the importance of topography and fatigue on lost person movement³². However, in both of these cases, the authors neglect lost person strategies. Considering previous literature, the gap in current knowledge becomes clear. In this paper, we propose a model that incorporates different lost person types, known behavior strategies, and a detailed landscape environment that is evaluated using a large database of lost person incidents.

We seek to use a different approach from the previous work on LP behavior which incorporates the idea that LPs can be of the different types defined in Lost Person Behavior³, and which generates specific routes taken by the LP on a known landscape. This approach gives an expectation for the LP location at an arbitrarily high spatiotemporal resolution. This work expands on a zeroth-generation version of the model³³ to include all salient behavior strategies and updated terrain features, and we evaluate the results using an LP incident dataset instead of the summarized statistics from ISRID. Through simulation of all possible distributions of behaviors, we are able to compare the lost person types (LPTs) taken from real-world data to the behaviors used in our model and validate the profiles against information from ISRID. We anticipate that this model, since it generates potential trajectories of an LP on the landscape, will facilitate SAR efforts as they currently are, as well as enable new SAR practices such as the use of UAV teams to search large areas of land more efficiently.

The organization of this paper is as follows. We begin by describing the model, including map generation, lost person behavior strategies, and behavioral profiles. Next, we outline the simulations, detailing the dataset used to fit the model, simulation parameters, and the metric to validate the model fitting. The results of the simulations are detailed next, followed by the discussion and conclusions.

Modeling

In this section, we describe how we generate the maps on which lost person dynamics are simulated, then we define the behavior strategies an LP may use and show how these strategies combine to make a particular lost person type.

Map

The LP is modeled as a self-propelled agent moving in discrete time on a two-dimensional square grid of fixed side length. The grid represents a specific map region and each cell is informed by the location’s geophysical characteristics. These specific characteristics are pulled from USGS geographic information system (GIS) data, which provides map layers at a given resolution, and are used in this work to define how the LP interacts with the environment.

Maps comprise two types of features: (1) linear features that an LP may follow and (2) inaccessible areas where an LP cannot traverse. When lost, it is common to follow predefined paths, like hiking trails, roads, railroads, powerline easements, and water features, and these one-dimensional routes are defined as linear features in the model. In addition to the structural linear features, we include paths defined by the elevation (with respect to sea level), like mountain crests and drainages. These features manifest as critical points in the magnitude of the gradient of elevation. After the gradient field is computed using the derivative of a Gaussian filter, the magnitude is computed and smoothed, and linear features are found using the Canny Edge Detection method by looking for local maxima and minima of a scalar field³⁴. The Supplementary Information contains details on implementing these data treatments.

In addition to linear features, the map contains inaccessible areas, specifically the interiors of lakes and wider rivers. By finding the boundaries of these water regions, we can separate the river and lake shorelines from their interiors and into the linear feature and inaccessible maps, respectively. In Fig. 1, an example of a map is shown with all linear feature and inaccessibility layers. As the agent moves on the grid, it will cross check its position with the feature map to make sure to avoid inaccessible areas and to use linear features depending upon its selected behavior strategy. Sufficiently steep terrain would also be considered an inaccessible area in theory, but this has not been implemented in the current version of the model.

Lost person behavior strategies

The model defines how the LP moves between cells on the spatially discrete map. At every time step, the agent can move from its position to any of the eight adjacent cells, or stay in its current cell, by using an algorithm that defines possible behavior strategies. In the current model, we have defined six behaviors that are derived from Lost Person Behavior³:

1.
Random Walking (RW) An agent moves randomly.
2.
Route Traveling (RT) An agent travels on a linear feature.
3.
Direction Traveling (DT) An agent moves cross-country in one compass direction, often ignoring trails and paths.
4.
Staying Put (SP) An agent actively stays in the same location.
5.
View Enhancing (VE) An agent attempts to gain a position of height.
6.
Backtracking (BT) An agent attempts to follow the exact route traveled previously.

Each lost person type or LPT is defined by a probability mass function (PMF) that captures the probability of the agent using a specific strategy at each time step. The PMF is a six-element vector of probabilities of each of the above strategies which sums to one. For example, an LPT with a probability of [RW, RT, DT, SP, VE, BT] = [$\frac{1}{2}$, 0, $\frac{1}{6}$, $\frac{1}{3}$, 0, 0] has a 50% chance of random walking, a 17% chance of direction traveling, and a 33% chance of staying put at a given time step. Independent realizations of this distribution of behaviors are generated at each time step and the agent’s position is updated by the randomly selected strategy.

To begin a simulation, the initial position of the agent $x(1)\in \mathbb {N}^2$ is selected and the second position $x(2)\in \mathbb {N}^2$ is randomly generated from the eight adjacent cells relative to x(1). The initial velocity is computed as $v(1) = x(2)-x(1)$. The difference between successive positions can be thought of as a velocity, which facilitates implementing some behaviors that rely on the direction of motion, like direction traveling. When the position is updated at each time step, we use a smoothing factor $\alpha$ to take into account the previous velocity and make the trajectories of similar smoothness to a walking individual by introducing one time step of memory. The updated smoothed position at each time step t is computed as

$$\begin{aligned} x(t+1) = (2-\alpha )x(t) + (\alpha -1)x(t-1)+\alpha v(t) \end{aligned}$$

(1)

where $v(t)=\hat{x}(t+1)-x(t)$ and $\hat{x}(t+1)$ is the provisional update for the behavior strategy selected. At each time step t, an independent realization of a selected PMF is generated, defining the strategy the agent will use for the updated position $x(t+1)$ and velocity $v(t+1)$. Placing the agent at the center of a $3\times 3$ grid that is a subset of the larger discrete map, in body coordinates local to the agent’s position, we generate motion by selection of one of the six strategies. At time step t, the $3\times 3$ grid (seen in Fig. 2), with x(t) at its center is positioned so that it aligns with the velocity v(t). On the square grid of the map, this is accomplished by appropriately rounding the coordinate values when this orientation is not orthogonal to the global axes.

The six strategies above define updates for the position of the agent with respect to the $3\times 3$ grid and $x(t+1)$ in global coordinates. The strategies and their respective PMFs are shown in the schematic in Fig. 2. When the individual walks randomly (RW), the chance of moving into any adjacent cells, including its own, is the same. When the agent is route traveling (RT), it checks each of the surrounding cells for a linear feature. Then the updated position is randomly selected (with a uniform probability) from the at most three possible positions in the direction of motion in body coordinates if a linear feature is present. This definition enforces persistence in the direction of motion along a linear feature. If a feature is not present, the agent performs a random walk. When the individual uses direction traveling (DT), the agent only moves forward in body coordinates. The previous direction of travel is taken into account by the orientation of body coordinates parallel to the velocity. When staying put (SP), the only possible update is the agent’s previous location. When view enhancing (VE), the agent checks the elevation for each of the adjacent cells against its own current elevation and selects the cell with the highest altitude. If it is currently at the highest elevation, the agent stays put. Lastly, when backtracking (BT), the previous behavior is first checked to see if it was also BT, and then, if it is not, the updated position is the previous location. If the previous behavior was backtracking, the agent uses the last non-BT position. In this strategy, the agent is following its path backwards to previous positions. In the case that the agent tries to move into an inaccessible area, it stays put for the current time step regardless of the behavior used.

Then, the model is iterated for T time steps to generate the agent’s trajectory.

Behavioral profiles

The lost person model can simulate a multitude of lost person types, including child, hunter, or hiker, to name a few. In order to demonstrate the power of the model, we have chosen to only simulate a hiker lost person because of the large amount of data available and its prevalence in SAR incidents. To determine the behavioral profile for a hiker, we generate all possible permutations of the six behavior strategies as LPT PMFs, with the probability of each behavior as a multiple of $\frac{1}{6}$. Incrementing the probability of each strategy from zero to one by steps of $\frac{1}{6}$ and retaining only the distributions that sum to one, we have a set of 462 LPTs with varying proportions of each behavior. Each of these LPT distributions are simulated for 500 Monte Carlo replicates for each incident. A flowchart detailing the model algorithm is in Fig. 3.

Simulations

In this section, we describe the SAR incident data used to fit the model, we detail simulation parameters, and we define a metric for comparing numerical results.

LP incident data

For the simulation study, we use an augmented data set of lost person incidents from ISRID that includes a wealth of information, like the demographic information about the LP, number of lost people in an incident, region, and find location. The data used in this study comes from a collection of pre-existing lost person incidents and are thus retrospective. In particular, our simulations rely on the initial planning points (IPPs), find locations, and LP types, where the IPPs and find locations are defined as the simulations’ initial conditions and end criteria, respectively. Then, the LP type that minimizes differences with the real find locations is identified. We select a subset of the hiker incidents using exclusion criteria for the find location. Specifically, we exclude locations where the find location is within 1 km of the IPP, outside the simulated map limits, in an inaccessible area as defined by our GIS layers, or within 100 cells of the map boundary. After applying these criteria, we have a set of $N=65$ incidents on which to simulate our model that range across the United States. Each lost person incident is defined with GPS positions for both an IPP and a find location. The dataset used for this study is included in the Supplementary files. As a note, this data appears among the many incidents summarized by the statistics in publicly available datasets^3,12.

For each of the incidents, a $20\,\mathrm {km} \times 20 \,\mathrm{km}$ map is generated with its corresponding layers for elevation, linear features, and inaccessible areas, with the IPP placed at the center as the initial position. The map is discretized into a $3000 \times 3000$ cell grid, where cells are $6.67\,\mathrm {m} \times 6.67\,\mathrm {m}$ squares, and the IPP is at $(x,y)=(1500,1500)$. Figure 1 is an example of one incident’s map, showing all of the linear features and the inaccessible areas.

Simulation parameters

The maps are generated using Python v3.6.8 and the elevation and layer data are derived from ArcGIS services³⁵ using AGS Tools³⁶. The simulations are performed using MATLAB 2020a on the Virginia Tech Advanced Research Computing Cascades cluster³⁷. The complete list of parameters is in Table 1. The simulation lengths are set for $K=100$ hours, where each time step is based on a maximum walking speed of about 1.575 meters per second. This corresponds to $T=850$ time steps per hour, where T is multiplied by the length K to define the lengths of each Monte Carlo replicate. The choice of time step is determined by the average walking speed of a human, which is about 3.5 miles per hour, or 1.56 meters per second³⁸. By simulating for $K=100$ hours, the agent has the opportunity to traverse the entire map. The selection of the smoothing factor, $\alpha$, is to generate trajectories that are qualitatively as smooth as known trajectories from real hikers. Moreover, the value of $\alpha = 0.55$ is selected in particular to avoid cancellation in variables that may happen when a fair weighting of $\alpha =0.5$ is used. We simulate each of the $N=65$ incidents for 462 LPTs with 500 replicates.

Table 1 Simulation parameters.

Full size table

Energy statistic

To explore the validity of the model, we use a statistic to compare the simulated trajectories to the actual find locations for each of the incidents. Energy distance is a metric which quantifies the statistical distance between distributions of random vectors, thereby characterizing equality of distributions^39,40. To test for equal distributions, we consider the null hypothesis that two random variables X and Y have the same probability distributions. For samples from X and Y, $x_{1}, \ldots , x_{n}$ and $y_{1}, \ldots , y_{m},$ respectively, the energy statistic for testing this null hypothesis is defined as

$$\begin{aligned} \mathscr{E}(X, Y):=2 A-B-C \end{aligned}$$

(2)

where A, B, and C are the averages of pairwise distances between the X and Y samples:

$$\begin{aligned} A&=\frac{1}{n m} \sum _{i=1}^{n} \sum _{j=1}^{m}\left\| x_{i}-y_{j}\right\| ,&B&=\frac{1}{n^{2}} \sum _{i=1}^{n} \sum _{j=1}^{n}\left\| x_{i}-x_{j}\right\| ,&C&=\frac{1}{m^{2}} \sum _{i=1}^{m} \sum _{j=1}^{m}\left\| y_{i}-y_{j}\right\| . \end{aligned}$$

(3)

In our case, for each incident, we find the closest point of each replicate that minimizes the distance from the simulated trajectory to the actual find location and define this as realizations of X for each of the 462 behavior distributions. Y is the actual find location given from the incident dataset, which is a single GPS point. In other words, $n=500$ and $m=1$. We calculate the pairwise distances between $X-Y$ and X to find A and B, and thus the energy statistic. Since Y only has a single realization, the value for C is zero.

We define the best behavior distribution for incident i as the LPT with the lowest energy statistic, giving us a top behavioral profile, $p_i$. For example, the behavioral profile for the nineteenth incident is $p_{19}=\left[\frac{1}{6},0,\frac{5}{6},0,0,0\right]$ which is a normalized distribution of the six behavior strategies [RW, RT, DT, SP, VE, BT]. As a way to synthesize these profiles across all the incidents, we assign a weight $w_i$ to each probability distribution $p_i$ that is a power of the inverse of energy per unit distance, defined as

$$\begin{aligned} w_i = \bigg (\frac{d_i}{\mathscr{E}_i}\bigg )^L \end{aligned}$$

(4)

for $i=1,\ldots , N$. This quantity is based on the incident’s energy statistic $\mathscr{E}_i$, the distance $d_i$ measured between the IPP and the actual find location, and a characteristic length parameter L. We select $L=\frac{1}{2}$ to more heavily represent the better fitting distributions. In Fig. 4, the best fit behaviors p for each incident are shown in the top subfigure with the corresponding energy statistic $\mathscr{E}$ and weight w in the second and third subfigures, respectively. The incidents have been sorted by their weight in Eq. (4), from highest to lowest.

To give a better picture of how well a behavior fits an incident based on the corresponding energy distance, Fig. 5 shows incidents corresponding to the highest and lowest weights in Fig. 4. Each subfigure shows the closest points of the replicates’ trajectories for the first and tenth best fitting LPTs, the IPP, and find location. We expect to see the points from the first best LPT p (black dots) concentrated closer to the find location than the tenth best LPT (grey dots). The left plot of Fig. 5 is the highest weighted incident, and the distribution of points for p is closer to the find location than all other LPTs, including the other shown in the plot. On the other hand, the right plot of Fig. 5 is the lowest weighted incident and, while it can be seen that p is still the closest to the find location in comparison to the tenth, it is noticeable that the find location is substantially further from the IPP than in the left plot. It may be tempting to believe that the incidents with the shortest d would always be weighted higher, but this is not always the case. In Fig. 6, the distances and weights for all incidents are shown, and while there is a correlation between higher weights and shorter distances, deviations from this trend demonstrate the fact that the weight is defined inversely to the energy per unit distance and not merely energy.

Results

Average behavioral profile for a hiker

Once the best behavioral profile and associated weight has been identified for each incident, we can combine them to find the average behavioral profile for a hiker. Using these best fits, we compute the weighted average

$$\begin{aligned} p_{\mathrm {hiker}} = \frac{w_1 {p}_1+w_2 {p}_2+\cdots +w_N {p}_N}{|w_1 {p}_1+w_2 {p}_2+\cdots +w_N {p}_N|} \end{aligned}$$

(5)

where the distribution is normalized to sum to one and the weights are defined as in Eq. (4) for the $N=65$ incidents. The resulting distribution $p_{\mathrm {hiker}}$ corresponds to a behavioral profile of [RW, RT, DT, SP, VE, BT] $=[0.055, 0.377, 0.559, 0.003, 0.006, 0]$, and is shown in Fig. 7.

Evaluation of fit by cross validation

This average behavioral hiker profile can be evaluated using statistical measures to determine the goodness of the fit with the experimental data. We evaluate the method of fitting the model using a statistical analysis called leave-one-out cross-validation (LOOCV)^41,42,43. To perform cross validation, we partition all of the samples into two subsets, apply a numerical fitting predictor to one subset (the training set), and then assess its prediction performance by comparison against the other subset (the test set)⁴⁴. In LOOCV, for a dataset consisting of k samples, a single observation is left out as the test set and the remaining $k-1$ observations make up the training set. For the evaluation of the lost person model, this means that the model is fit with a behavioral profile N times, each time leaving out one incident in the training set.

First, for each of the N incidents, we train a weighted average behavioral profile using the method in the previous section on the subset of $N-1$ incidents (training set), where the left out incident is the test set. Second, the model is simulated for 500 Monte Carlo replicates for each of the N incidents using the corresponding trained behavioral profile as its behavior (i.e. 1 LPT instead of the original 462). The output of the LOOCV is N energy statistic values that can be compared to the original energy statistics.

To show the validity of the model fitting, the energy statistics from the LOOCV would need to be statistically “better” than the original energy statistics from the “untrained” data. To see this, we calculate the percentile of the LOOCV energy statistics among the 462 original energy statistics. The results show that $58.5\%$ of the N IPPs have energy statistic values above the 95th percentile, and $98.5\%$ above the 50th percentile. This means that the average behavioral profile captures very well the data from more than half of the SAR incidents on which it was not trained and works well across all the incidents in general.

Discussion

The model in this paper is designed to capture lost person dynamics, based on behavior heuristics, over a variety of landscapes using a real-world dataset. Based on the goodness of the leave-one-out analysis, the average behavioral profile represents a hiker well on a variety of landscapes. Specifically, the weighted average behavioral profile offers a very good fit to more than half of the incidents and it performs better than most behavioral profiles for almost all incidents. We stress that this analysis tests the ability of the model to predict data on which it was not trained. Thus, the average behavioral profile provides a description of the LP behavior which no longer depends on any particular landscape features. This suggests that such a profile, which also could be similarly generated for LPs in other categories like children or people with dementia, could be tapped during active searches to predict areas where a lost person is more likely to be found.

Relating our findings to the real world, the values of the average behavioral profile $p_\mathrm {hiker}$ are consistent with what we know about hiker behavior. Hikers are likely to use behavior strategies that allow them to travel further distances and are often influenced by the surrounding landscape³⁰. It has been seen that a third of lost hikers will move to a higher elevation to improve their view¹¹, but many would prefer a clear path like a trail versus a steep one²⁴. Our model hiker profile reflects some of these common behaviors. The best fit profile contains over $50\%$ direction traveling, which is the behavior that causes the agent to travel the furthest among the rest. The second most common behavior is route traveling, meaning the agent is utilizing the linear features of the landscape to navigate the terrain. Because of the somewhat serpentine trajectories by the simulated agents, we are not surprised to see that the most used strategies are route traveling and direction traveling. Out of all six behaviors, these strategies allow the agent to traverse the furthest on the map in order to reach the find locations.

Beyond the average hiker profile, the model results generate an effective agent walking speed by comparison of the time of the closest point for each p and the distance d from IPP to find location. In Fig. 8, the mean time (averaged over replicates) versus distance is shown for each incident. We notice that the data is dense and seems to be linearly increasing for distances less than 4 km; for distances greater than 4 km, the data is sparser and seems relatively constant. With these two regimes in mind, we fit the data with incidents with d less than 4 km using the MATLAB Curve Fitting Toolbox, and find a linear fit, $t=4.31d+13.31$, with $R^2=0.4383$. The slope of the line is used to compute the resulting effective speed of the LP model, which is $0.064\,\mathrm {m/s}$. Note that, compared to the maximum walking speed of $1.575\,\mathrm {m/s}$ which is biomechanically realistic for pedestrians⁴⁵, the effective speed is very slow. This is not entirely surprising, due to the convoluted nature of the simulated agent’s paths, and we hypothesize that the speed would increase by including more memory into each updated time step by way of inertia. The relatively constant time to closest point for incidents with larger values of d suggests that the underlying dataset captures fatigue in the lost person. Since the time values saturate at approximately 35 hours, we expect that there is a maximum amount of time that a lost person remains mobile. This idea is supported by the ISRID, and many models of mobility attempt to estimate the distance traveled based on the time and energy it takes to navigate a terrain. Specifically, Tobler estimated travel speed to be a function of pedestrian movement and the slope of the landscape⁴⁶. These cost-distance methods used in conjunction with the impact of land cover impedance have been shown to provide guidance in determining probability of area, but ultimately, they neglect the behavior of a lost person⁴⁷. We hope to explore mobility in expansions of this model.

Although these results give a strong indication that the methods we use generate a good representation of lost person behavior, limitations arise from using recorded SAR incident data from ISRID. First of all, lost person data is difficult to attain as many incidents go unreported, since most of the people who get lost end up reorienting themselves. This means that certain types of lost people may be over-represented in the data. In the incidents that are recorded, actual trajectories of the LP are rare since the LP is unlikely to have a tracking device. By having real trajectories, we could fit the model to these tracks instead of just the IPP and find locations. The incidents in the dataset also lack a sense of time. In Lost Person Behavior³, the LP categories have a statistic called mobility to describe the amount of time an LP was mobile, but this variable is often artificially shorter than in reality since many LPs are still mobile when found. Because of this, we decided to run the model for an extended time and use the closest point in the trajectory for fitting in our analysis. The lack of time in the incident data also means that the map layers we use may not always reflect the same terrain as at the time of the incident. Furthermore, the model neglects land cover and vision restriction, which are both known to affect navigation as well as walking speeds⁴⁷. Lastly, profiles for other LPTs have sparser data than hikers, which limits the implementation of the simulation and fitting method described in this paper.

Conclusions

This study describes the first dynamic model of lost person behavior that takes into account known reorientation strategies and is informed by data from real SAR incidents. This model is unique from others, not only for the use of SAR incident data, but also for incorporating both landscape features and behavior strategies. Our results show, through a leave-one-out analysis, that the average behavior profile we generate for a hiker captures well the features of SAR incidents on which it was not trained. The model here provides a solid foundation for future work. We can imagine many expansions, including adding more types of LPs and implementing a more realistic effective speed of the agent. By incorporating memory of more than one previous time step, the trajectories would be smoother and perhaps more realistic to a walking individual. Furthermore, instead of a constant walking speed, we can introduce a variable speed depending on the terrain and LPT. Another aspect to consider in future work is the specific demographics of the LP, including age, sex, and whether the agent was alone or in a group. Weather also plays a large part in where an LP will be found. This work offers the first step in defining dynamic models for lost people which may be tailored to facilitate current state-of-the-art SAR efforts.

Data availability

Data generated for this study are included as Supplementary Information files.

References

Federal Bureau of Investigation (FBI). 2018 NCIC Missing Person and Unidentified Person Statistics. Tech. Rep., National Crime Information Center (2018).
Hill, K. The psychology of lost. Lost Pers. Behav. 116, 1–15 (1998).
Google Scholar
Koester, R. J. Lost Person Behavior: A Search and Rescue Guide on Where to Look—for Land, Air and Water (dbS Productions LLC, 2008).
Hill, K. A. Cognition in the woods: Biases in probability judgments by search and rescue planners. Judgm. Decis. Mak. 7, 488–498 (2012).
Google Scholar
Syrotuck, W. Analysis of lost person behavior. Barkleigh Productions. Inc., Mechanicsburg, PA. NASAR version (2000).
Heth, C. D. & Cornell, E. H. Characteristics of travel by persons lost in Albertan wilderness areas. J. Environ. Psychol. 18, 223–35 (1998).
Article Google Scholar
Cornell, E. H., Heth, C. D. & Broda, L. S. Children’s wayfinding: Response to instructions to use environmental landmarks. Dev. Psychol. 25, 755–764. https://doi.org/10.1037/0012-1649.25.5.755 (1989).
Article Google Scholar
Cornell, E. H., Heth, C. D. & Alberts, D. M. Place recognition and way finding by children and adults. Mem. Cognit. 22, 633–643. https://doi.org/10.3758/BF03209249 (1994).
Article CAS PubMed Google Scholar
Koester, R. J., Cooper, D., Frost, J. & Robe, R. Sweep width estimation for ground search and rescue. United States Coast Guard National Search and Rescue Committee 245 (2004).
Adams, A. L. et al. Search is a time-critical event: When search and rescue missions may become futile. Wilderness Environ. Med. 18, 95–101. https://doi.org/10.1580/06-WEME-OR-035R1.1 (2007).
Article PubMed Google Scholar
Koester, R. International Search and Rescue Incident Database (ISRID) (accessed 04 October 2019); http://www.dbs-sar.com/ (2008).
Koester, R. J. Enhancements to statistical probability of area models based upon updated ISRID data collection for Autistic Spectrum Disorders and Typically Developing children. J. Search Rescue 4, 65–83 (2020).
ADS Google Scholar
Sava, E., Twardy, C., Koester, R. & Sonwalkar, M. Evaluating lost person behavior models. Trans. GIS 20, 38–53 (2016).
Article Google Scholar
Seyfried, A., Schadschneider, A., Kemloh, U. & Chraibi, M. Force-based models of pedestrian dynamics. Netw. Heterog. Media 6, 425–442. https://doi.org/10.3934/nhm.2011.6.425 (2011).
Article MathSciNet MATH Google Scholar
Moussaïd, M., Helbing, D. & Theraulaz, G. How simple rules determine pedestrian behavior and crowd disasters. Proc. Natl. Acad. Sci. 108, 6884–6888 (2011).
Article ADS Google Scholar
Fuchs, A. & Jirsa, V. K. Coordination: Neural, Behavioral and Social Dynamics (Springer, 2007).
Karamouzas, I., Skinner, B. & Guy, S. J. Universal power law governing pedestrian interactions. Phys. Rev. Lett. 113, 238701 (2014).
Article ADS CAS Google Scholar
Seyfried, A., Steffen, B. & Lippert, T. Basics of modelling the pedestrian flow. Phys. A Stat. Mech. Appl. 368, 232–238 (2006).
Article Google Scholar
Garcimartín, A., Pastor, J. M., Martín-Gómez, C., Parisi, D. & Zuriguel, I. Pedestrian collective motion in competitive room evacuation. Sci. Rep. 7, 1–9 (2017).
Article Google Scholar
Bonabeau, E. Agent-based modeling: Methods and techniques for simulating human systems. Proc. Natl. Acad. Sci. 99, 7280–7287 (2002).
Article ADS CAS Google Scholar
Takahashi, T., Tadokoro, S., Ohta, M. & Ito, N. Agent based approach in disaster rescue simulation - From test-bed of multiagent system to practical application. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 2377 LNAI, 102–111, https://doi.org/10.1007/3-540-45603-1_11 (Springer, 2002).
Azimi, S., Delavar, M. R. & Rajabifard, A. Developing a multi-agent based modeling for smart search and rescue operation. In Lecture Notes in Geoinformation and Cartography, 133–157, https://doi.org/10.1007/978-3-030-05330-7_6 (Springer, 2019).
Kratzke, T. M., Stone, L. D. & Frost, J. R. Search and rescue optimal planning system. In 2010 13th International Conference on Information Fusion, 1–8, https://doi.org/10.1109/ICIF.2010.5712114 (IEEE, 2010).
Doke, J. Analysis of search incidents and lost person behavior in Yosemite National Park. Master’s thesis, University of Kansas (2012).
McDaniel, M. D. Agent-based modeling of lost person wayfinding. Master’s thesis, University of California Santa Barbara (2010).
Morelle, K., Buchecker, M., Kienast, F. & Tobias, S. Nearby outdoor recreation modelling: An agent-based approach. Urban For. Urban Green. 40, 286–298 (2019).
Article Google Scholar
Lin, L. & Goodrich, M. A. A Bayesian approach to modeling lost person behaviors based on terrain features in wilderness search and rescue. Comput. Math. Organ. Theory 16, 300–323 (2010).
Article Google Scholar
Mohibullah, W. Agent-based lost person movement modelling, prediction and search in wilderness. Ph.D. thesis, UCL (University College London) (2017).
Mohibullah, W. & Julier, S. J. Developing an agent model of a missing person in the wilderness. Proceedings—2013 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2013 4462–4469, https://doi.org/10.1109/SMC.2013.759 (2013).
Alanis, J. et al. Topography and behavior based movement modeling for missing hikers in land-wilderness settings. Tech. Rep., Quantitative Research for the Life and Social Sciences Program, Arizona State University (2019).
Šerić, L., Pinjušić, T., Topić, K. & Blažević, T. Lost person search area prediction based on regression and transfer learning models. ISPRS Int. J. Geo-Inf. 10, 80. https://doi.org/10.3390/ijgi10020080 (2021).
Article Google Scholar
Metcalfe, M. A probability-based lost person simulation for wilderness search and rescue operations. Bachelor’s thesis, Boston University (2012).
Hashimoto, A. & Abaid, N. An agent-based model of lost person dynamics for enabling wilderness search and rescue. In Dynamic Systems and Control Conference, vol. 59155, V002T13A005, https://doi.org/10.1115/DSCC2019-9222 (American Society of Mechanical Engineers, 2019).
Canny, J. A computational approach to edge detection. IEEE Transactions on Pattern Analysis and Machine IntelligencePAMI-8, 679–698, https://doi.org/10.1109/TPAMI.1986.4767851 (1986).
U.S. Geological Survey. USGS TNM National Hydrography Dataset (accessed 01 March 2021); https://hydro.nationalmap.gov/arcgis/rest/services (2021).
Heintzman, L. ARCGIS Tools (accessed 01 March 2021); https://git.caslab.ece.vt.edu/hlarkin3/ags_grabber (2021).
Virginia Tech Advanced Research Computing (ARC). Cascades Cluster (accessed 28 March 2021); https://198.82.212.30 (2021).
Browning, R. C., Baker, E. A., Herron, J. A. & Kram, R. Effects of obesity and sex on the energetic cost and preferred speed of walking. J. Appl. Physiol. 100, 390–398 (2006).
Article Google Scholar
Székely, G. J. & Rizzo, M. L. Energy statistics: A class of statistics based on distances. J. Stat. Plan. Inference 143, 1249–1272. https://doi.org/10.1016/j.jspi.2013.03.018 (2013).
Article MathSciNet MATH Google Scholar
Rizzo, M. L. & Székely, G. J. Energy distance. Wiley Interdiscip. Rev. Comput. Stat. 8, 27–38. https://doi.org/10.1002/wics.1375 (2016).
Article MathSciNet Google Scholar
Lachenbruch, P. A. & RayMickey, M. Estimation of error rates in discriminant analysis. Technometrics 10, 1–11. https://doi.org/10.1080/00401706.1968.10490530 (1968).
Article MathSciNet Google Scholar
Geisser, S. The predictive sample reuse method with applications. J. Am. Stat. Assoc. 70, 320–328. https://doi.org/10.1080/01621459.1975.10479865 (1975).
Article MATH Google Scholar
Drehmer, D. E. & Morris, G. W. Cross-validation with small samples: An algorithm for computing Gollob’s estimator. Educ. Psychol. Meas. 41, 195–200. https://doi.org/10.1177/001316448104100120 (1981).
Article Google Scholar
Stone, M. Cross-validatory choice and assessment of statistical predictions. J. R. Stat. Soc. Ser. B (Methodol.) 36, 111–133. https://doi.org/10.1111/j.2517-6161.1974.tb00994.x (1974).
Article MathSciNet MATH Google Scholar
Bohannon, R. W. Comfortable and maximum walking speed of adults aged 20–79 years: Reference values and determinants. Age Ageing 26, 15–19 (1997).
Article CAS Google Scholar
Tobler, W. Non-isotropic geographic modeling (University of California, Santa Barbara, Three presentations on geographic analysis and modeling. Tech. Rep., 1993).
Doherty, P. J., Guo, Q., Doke, J. & Ferguson, D. An analysis of probability of area techniques for missing persons in Yosemite National Park. Appl. Geogr. 47, 99–110. https://doi.org/10.1016/j.apgeog.2013.11.001 (2014).
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank Tianzi Wang for her assistance in generating the maps, Advanced Research Computing at Virginia Tech for providing computational resources that have contributed to the results reported within this paper, and the Graduate School at Virginia Tech for support through a Cunningham Doctoral Scholarship to A.H. This work is supported by the National Science Foundation under Grant CNS-1830414.

Author information

Authors and Affiliations

Engineering Mechanics Program, Virginia Polytechnic Institute and State University, Blacksburg, VA, 24061, USA
Amanda Hashimoto
Department of Electrical and Computer Engineering, Virginia Polytechnic Institute and State University, Blacksburg, VA, 24061, USA
Larkin Heintzman
dbS Productions LLC, Charlottesville, VA, 22911, USA
Robert Koester
School of the Environment, Geography and Geosciences, University of Portsmouth, Portsmouth, P01 2UP, UK
Robert Koester
Department of Mathematics, Virginia Polytechnic Institute and State University, Blacksburg, VA, 24061, USA
Nicole Abaid

Authors

Amanda Hashimoto
View author publications
Search author on:PubMed Google Scholar
Larkin Heintzman
View author publications
Search author on:PubMed Google Scholar
Robert Koester
View author publications
Search author on:PubMed Google Scholar
Nicole Abaid
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization: A.H. and N.A., Data curation: A.H., L.H., and R.K., Formal analysis: A.H. and N.A., Funding acquisition: N.A., Methodology: A.H. and N.A., Software: A.H. and L.H., Visualization: A.H., Writing - original draft: A.H., Writing - review and editing: A.H. and N.A. All authors reviewed the manuscript.

Corresponding author

Correspondence to Nicole Abaid.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Hashimoto, A., Heintzman, L., Koester, R. et al. An agent-based model reveals lost person behavior based on data from wilderness search and rescue. Sci Rep 12, 5873 (2022). https://doi.org/10.1038/s41598-022-09502-4

Download citation

Received: 09 June 2021
Accepted: 16 March 2022
Published: 07 April 2022
Version of record: 07 April 2022
DOI: https://doi.org/10.1038/s41598-022-09502-4

This article is cited by

An empirical study on the survivability of trapped miners in underground mines post-accident – a global analysis
- Philani Larrance Ngwenyama
- Ronald C. W. Webber-Youngman
Safety in Extreme Environments (2025)
A Probabilistic Formulation of Problem of Optimal Search for a Lost Person
- Petr Volf
Operations Research Forum (2025)
Robot movement based inference of teleoperator state for multi-robot search
- Arunim Bhattacharya
- Sachit Butail
Complex & Intelligent Systems (2025)