A Neuro-Symbolic GeoAI Framework for Extraction of Travel Routes From Unstructured Texts
Author/Creator
Date
Type of Work
Department
Program
Citation of Original Publication
Karabatis, Saydeh, Vandana P. Janeja, and Zhiyuan Chen. “A Neuro-Symbolic GeoAI Framework for Extraction of Travel Routes From Unstructured Texts.” Transactions in GIS 29, no. 7 (2025): e70130. https://doi.org/10.1111/tgis.70130.
Rights
This is the peer reviewed version of the following article: Karabatis, Saydeh, Vandana P. Janeja, and Zhiyuan Chen. “A Neuro-Symbolic GeoAI Framework for Extraction of Travel Routes From Unstructured Texts.” Transactions in GIS 29, no. 7 (2025): e70130. https://doi.org/10.1111/tgis.70130. , which has been published in final form at https://onlinelibrary.wiley.com/doi/10.1111/tgis.70130?msockid=025230b9c1936da439b7257ec0946c03 . This article may be used for non-commercial purposes in accordance with Wiley Terms and Conditions for Use of Self-Archived Versions. This article may not be enhanced, enriched or otherwise transformed into a derivative work, without express permission from Wiley or by statutory rights under applicable legislation. Copyright notices must not be removed, obscured or modified. The article must be linked to Wiley’s version of record on Wiley Online Library and any embedding, framing or otherwise making available the article or pages thereof by third parties from platforms, services and websites other than Wiley Online Library must be prohibited.
Abstract
Unstructured text, such as narratives describing the movement of people, can reveal valuable spatial information that is used to generate the routes people take. However, the lack of precision and the ambiguity of spatial information in these narratives create a significant problem in generating such routes. Existing work uses either traditional natural language processing (NLP) techniques or more recent large language models (LLMs) to extract relevant spatial information. However, traditional NLP techniques do not capture the contextual information in the text, and LLMs are often trained on data with insufficient coverage of developing countries, resulting in incomplete spatial information. This paper proposes a novel neuro-symbolic GeoAI framework called Narratives as Geographical Routes (NaR) to automatically extract and visualize geospatial routes from unstructured text and resolve spatial data quality issues in these texts. NaR extracts geographical information from narratives, identifies the toponyms, lists them in temporal order, resolves possible ambiguities, assigns their precise coordinates, and finally depicts the spatial routes on a map. This is achieved through the use of (1) retrieval augmented generation (RAG) techniques that leverage the geographical domain knowledge extracted from NLP techniques in conjunction with a gazetteer to improve the results of LLMs for toponym identification and temporal listing, and (2) a neuro-symbolic framework that uses symbolic reasoning to resolve toponym ambiguity. Experimental evaluation of our framework indicates that NaR outperforms other existing methods.
