A framework of large language model commander agent for spatial reasoning in combat simulation

Chen, Yi-bo; Ping, Yang; Zhou, Shuhang; Jojo, Caleb

doi:10.1038/s41598-026-43365-3

Download PDF

Article
Open access
Published: 13 March 2026

A framework of large language model commander agent for spatial reasoning in combat simulation

Yi-bo Chen¹,
Yang Ping¹,
Shuhang Zhou² &
…
Caleb Jojo³

Scientific Reports , Article number: (2026) Cite this article

1321 Accesses
Metrics details

We are providing an unedited version of this manuscript to give early access to its findings. Before final publication, the manuscript will undergo further editing. Please note there may be errors present which affect the content, and all legal disclaimers apply.

Subjects

Abstract

Large language models (LLMs) demonstrate strong reasoning and planning capabilities in static textual contexts, yet they struggle significantly with dynamic decision-making tasks involving spatial elements, such as point selection in military simulations. These limitations arise from their reduced capacity to integrate real-time geographic data and adapt to spatial conditions, which can lead to crucial errors in positioning decisions. Such deficiencies may result in missed opportunities for tactical advantages, increased vulnerability, and diminished overall effectiveness in combat scenarios. To mitigate these issues, this paper presents the Geo-Commander framework, an innovative multi-task agent to combat simulations by integrate the ReAct reasoning mechanism and spatial encoding. The Geo-Choice module of this framework employs hexagonal grid encoding for preliminary location screening, enabling the agent to establish spatial constraints early in the decision-making process. The ReAct chain of this framework incorporates detailed geographic insights into the reasoning loop, yielding interpretable decisions for point selection. We validate the framework through experiments that reveal substantial performance improvements in both static point selections and real-time dynamic command tasks within a tank detachment combat simulation environment. Results indicate that Geo-Commander consistently surpasses control groups across various metrics, including selection quality, win rate, and overall combat effectiveness. These performance metrics highlight the framework’s potential to meet the demands of dynamic combat environments, ultimately confirming the feasibility of integrating spatial reasoning within LLM frameworks and opening avenues for advancements in multi-agent geospatial intelligence systems and battlefield decision-making support.

Data availability

The data that support the findings of this study are openly available in ScienceDB at https://doi.org/10.57760/sciencedb.32513, reference number²⁴.

References

Yao, S. et al. React: SYnergizing reasoning and acting in language models. In: 11th International Conference on Learning Representations, (ICLR, 2023).
Shinn, N. et al. Reflexion: language agents with verbal reinforcement learning: In: 37th conference on neural information processing systems. (NeurIPS, 2023).
Wang, G. et al. VOYAGER: An open-ended embodied agent with large language models. arXiv, (2023).
Jafarnejad, S. MapLLM - A blueprint for improving geospatial reasoning in LLMs. (2025).
Cheng, A. et al. SpatialRGPT: Grounded spatial reasoning in vision-language models. In: 38th Conference on Neural Information Processing Systems, NeurIPS 2024.
Headquarters, D. & O T A. FM3-34.230-Topographic Operations. (2000).
Goecks, V. G. & Waytowich, N. C. O. A. G. P. T. Generative pre-trained transformers for accelerated course of action development in military operations. In: 2024 International Conference on Military Communication and Information Systems (ICMCIS), 2024.
Cao, X. et al. MAPLM: A real-world large-scale vision-language benchmark for map and traffic scene understanding. In: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 2024.
Zhang, Y. et al. GeoGPT: understanding and processing geospatial tasks through an autonomous GPT. arXiv, (2023).
Chen, B. et al. SpatialVLM: Endowing vision-language models with spatial reasoning capabilities. In: 2024 IEEE/CVF Conference on Computer Vision and (CVPR), 2024.
Huang, C. et al. Visual language maps for robot navigation.In: 2023 IEEE International Conference on Robotics and (ICRA), 2023.
Silver, D. et al. Mastering the game of Go with deep neural networks and tree search. Nature 529 (7587), 484–489 (2016).
Google Scholar
Wu, D. J. Accelerating Self-Play Learning in Go. Statistics,2. (2019)
Birch, C. P., D C, Oom, S. P. & Beecham, J. A. Rectangular and hexagonal grids used for observation, experiment and simulation in ecology. Ecol. Model. 206 (3–4), 347–359 (2007).
Google Scholar
Tang, F. Zhang Xin,You Xiong,et al. Design Method of Tactical Level Hexagonal Wargame Map. J. Syst. Simul. 31 (5), 869–878 (2019).
Google Scholar
Srivastava, V. et al. MapIQ: Benchmarking multimodal large language models for map question answering. (2025).
R A I, P. S. et al. Light-MLLMAD: A lightweight multimodal large language model for one-shot industrial visual anomaly detection. (2025).
Ruan, J. et al. MME-SCI: A comprehensive and challenging science benchmark for multimodal large language models. (2025).
Headquarters, D. & O T A. FM 3-21.8 - The Infantry Rifle Platoon and Squad. (2007).
Headquarters, D. & O T A. FM 3-06.11 - Camouflage, Concealment, and Cover. (2002).
Headquarters, D. & O T A. FM 3-05.222 / TC 23 – 14 - Sniper Training and Employment. (2003).
Zhou, Z. & Zhao, H. Design and implementation of B/S architecture-based wargaming system.J. Equip. Acad. 27 (2), 68–72 (2016).
Google Scholar
Yin, Q. et al. Intelligent decision-making technologies and challenges in wargaming. Acta Automatica Sinica. 49 (5), 913–928 (2023).
Google Scholar
Chen, Y. Experimental data of geo-commander[DS/OL]. Sci. Data Bank.,(2025). https://doi.org/10.57760/sciencedb.32513
Google Scholar

Download references

Acknowledgements

We would like to express our sincere gratitude to the military experts who contributed their professional knowledge and time to this study. Specifically, we thank the Army Commander, the Army Staff Officer, and the two wargaming experts for their in-depth analysis of the combat simulation scenarios and for their crucial role in developing the grid point quality rating table through discussion. Their expertise ensured the tactical relevance and validity of our experimental evaluation metrics.

Author information

Authors and Affiliations

PLA Academy of Military Science, Beijing, 100091, China
Yi-bo Chen & Yang Ping
North University of China, Taiyuan, 030051, China
Shuhang Zhou
Northwestern University, Evanston, IL, 60208, USA
Caleb Jojo

Authors

Yi-bo Chen
View author publications
Search author on:PubMed Google Scholar
Yang Ping
View author publications
Search author on:PubMed Google Scholar
Shuhang Zhou
View author publications
Search author on:PubMed Google Scholar
Caleb Jojo
View author publications
Search author on:PubMed Google Scholar

Contributions

Conceptualization, Yibo Chen; methodology, Caleb Jojo; software, Yibo Chen; validation, Shuhang Zhou; formal analysis, Shuhang Zhou; investigation, Caleb Jojo; resources, Yang Ping; data curation, Shuhang Zhou; writing—original draft preparation, Yibo Chen; writing—review and editing, Yang Ping; visualization, Caleb Jojo; supervision, Yang Ping; project administration, Yang Ping; funding acquisition, Yang Ping. All authors agree to be accountable for all aspects of the work.

Corresponding author

Correspondence to Yi-bo Chen.

Ethics declarations

Competing interests

The authors declare no competing interests.

Informed Consent

All participants (the four military experts) employed in this study were adults. Before their participation, all individuals were fully informed about the purpose of the study, the procedures involved, and how their input would be used. Written informed consent was obtained from all participants before their involvement in the study.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Cite this article

Chen, Yb., Ping, Y., Zhou, S. et al. A framework of large language model commander agent for spatial reasoning in combat simulation. Sci Rep (2026). https://doi.org/10.1038/s41598-026-43365-3

Download citation

Received: 08 December 2025
Accepted: 04 March 2026
Published: 13 March 2026
DOI: https://doi.org/10.1038/s41598-026-43365-3

A framework of large language model commander agent for spatial reasoning in combat simulation

Subjects

Abstract

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Informed Consent

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Keywords

Search

Quick links

Subjects

Abstract

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Informed Consent

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links