Agentic RAG for Maritime AIoT: Natural Language Access to Structured Data

Oxana Sachenkova¹, Melker Andreasson¹, Dongzhu Tan¹, Alisa Lincke²

Affiliations

¹ Computer Science and Media Technology Department, Linnaeus University, SE-391 31 Kalmar, Sweden.
² Computer Science and Media Technology Department, Linnaeus University, SE-352 53 Växjö, Sweden.

PMID: 41755167
DOI: 10.3390/s26041227

Agentic RAG for Maritime AIoT: Natural Language Access to Structured Data

Oxana Sachenkova et al. Sensors (Basel). 2026.

. 2026 Feb 13;26(4):1227.

doi: 10.3390/s26041227.

Authors

Oxana Sachenkova¹, Melker Andreasson¹, Dongzhu Tan¹, Alisa Lincke²

Affiliations

¹ Computer Science and Media Technology Department, Linnaeus University, SE-391 31 Kalmar, Sweden.
² Computer Science and Media Technology Department, Linnaeus University, SE-352 53 Växjö, Sweden.

PMID: 41755167
DOI: 10.3390/s26041227

Abstract

Maritime operations are increasingly reliant on sensor data to drive efficiency and enhance decision-making. However, despite rapid advances in large language models, including expanded context windows and stronger generative capabilities, critical industrial settings still require secure, role-constrained access to enterprise data and explicit limitation of model context. Retrieval-Augmented Generation (RAG) remains essential to enforce data minimization, preserve privacy, support verifiability, and meet regulatory obligations by retrieving only permissioned, provenance-tracked slices of information at query time. However, current RAG solutions lack robust validation protocols for numerical accuracy for high-stakes industrial applications. This paper introduces Lighthouse Bot, a novel Agentic RAG system specifically designed to provide natural-language access to complex maritime sensor data, including time-series and relational sensor data. The system addresses a critical need for verifiable autonomous data analysis within the Artificial Intelligence of Things (AIoT) domain, which we explore through a case study on optimizing ferry operations. We present a detailed architecture that integrates a Large Language Model with a specialized database and coding agents to transform natural language into executable tasks, enabling core AIoT capabilities such as generating Python code for time-series analysis, executing complex SQL queries on relational sensor databases, and automating workflows, while keeping sensitive data outside the prompt and ensuring auditable, policy-aligned tool use. To evaluate performance, we designed a test suite of 24 questions with ground-truth answers, categorized by query complexity (simple, moderate, complex) and data interaction type (retrieval, aggregation, analysis). Our results show robust, controlled data access with high factual fidelity: the proprietary Claude 3.7 achieved close to 90% overall factual correctness, while the open-source Qwen 72B achieved 66% overall and 99% on simple retrieval and aggregation queries. These findings underscore the need for a secure limited-context RAG in maritime AIoT and the potential for cost-effective automation of routine exploratory analyses.

Keywords: GenAI; IoT; LLMs; RAG; maritime industry; sensor data.

PubMed Disclaimer

Grants and funding

Swedish Transport Administration

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Agentic RAG for Maritime AIoT: Natural Language Access to Structured Data

Affiliations

Agentic RAG for Maritime AIoT: Natural Language Access to Structured Data

Authors

Affiliations

Abstract

Grants and funding