Agentic RAG for Maritime AIoT: Natural Language Access to Structured Data
- PMID: 41755167
- PMCID: PMC12943969
- DOI: 10.3390/s26041227
Agentic RAG for Maritime AIoT: Natural Language Access to Structured Data
Abstract
Maritime operations are increasingly reliant on sensor data to drive efficiency and enhance decision-making. However, despite rapid advances in large language models, including expanded context windows and stronger generative capabilities, critical industrial settings still require secure, role-constrained access to enterprise data and explicit limitation of model context. Retrieval-Augmented Generation (RAG) remains essential to enforce data minimization, preserve privacy, support verifiability, and meet regulatory obligations by retrieving only permissioned, provenance-tracked slices of information at query time. However, current RAG solutions lack robust validation protocols for numerical accuracy for high-stakes industrial applications. This paper introduces Lighthouse Bot, a novel Agentic RAG system specifically designed to provide natural-language access to complex maritime sensor data, including time-series and relational sensor data. The system addresses a critical need for verifiable autonomous data analysis within the Artificial Intelligence of Things (AIoT) domain, which we explore through a case study on optimizing ferry operations. We present a detailed architecture that integrates a Large Language Model with a specialized database and coding agents to transform natural language into executable tasks, enabling core AIoT capabilities such as generating Python code for time-series analysis, executing complex SQL queries on relational sensor databases, and automating workflows, while keeping sensitive data outside the prompt and ensuring auditable, policy-aligned tool use. To evaluate performance, we designed a test suite of 24 questions with ground-truth answers, categorized by query complexity (simple, moderate, complex) and data interaction type (retrieval, aggregation, analysis). Our results show robust, controlled data access with high factual fidelity: the proprietary Claude 3.7 achieved close to 90% overall factual correctness, while the open-source Qwen 72B achieved 66% overall and 99% on simple retrieval and aggregation queries. These findings underscore the need for a secure limited-context RAG in maritime AIoT and the potential for cost-effective automation of routine exploratory analyses.
Keywords: GenAI; IoT; LLMs; RAG; maritime industry; sensor data.
Conflict of interest statement
The authors declare no conflicts of interest.
Figures
References
-
- Raza Z., Woxenius J., Vural C.A., Lind M. Digital transformation of maritime logistics: Exploring trends in the liner shipping segment. Comput. Ind. 2023;145:103811. doi: 10.1016/j.compind.2022.103811. - DOI
-
- DNV. Singapore Maritime Foundation The Future of Seafarers 2030: A Decade of Transformation. 2023. [(accessed on 1 November 2025)]. Available online: https://www.smf.com.sg/wp-content/uploads/2023/04/The-Future-of-Seafarer....
-
- Zeng F., Chen A., Xu S., Chan H.K., Li Y. Digitalization in the Maritime Logistics Industry: A Systematic Literature Review of Enablers and Barriers. J. Mar. Sci. Eng. 2025;13:797. doi: 10.3390/jmse13040797. - DOI
-
- Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A.N., Kaiser Ł., Polosukhin I. Attention is all you need. arXiv. 2017 doi: 10.48550/arXiv.1706.03762.1706.03762 - DOI
-
- Brown T., Mann B., Ryder N., Subbiah M., Kaplan J.D., Dhariwal P., Neelakantan A., Shyam P., Sastry G., Askell A., et al. Language Models are Few-Shot Learners. In: Larochelle H., Ranzato M., Hadsell R., Balcan M., Lin H., editors. Proceedings of the Advances in Neural Information Processing Systems. Volume 33. Curran Associates, Inc.; Red Hook, NY, USA: 2020. [(accessed on 1 November 2025)]. pp. 1877–1901. Available online: https://proceedings.neurips.cc/paper_files/paper/2020/file/1457c0d6bfcb4....
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
