Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 23;25(15):4554.
doi: 10.3390/s25154554.

Intelligent Priority-Aware Spectrum Access in 5G Vehicular IoT: A Reinforcement Learning Approach

Affiliations

Intelligent Priority-Aware Spectrum Access in 5G Vehicular IoT: A Reinforcement Learning Approach

Adeel Iqbal et al. Sensors (Basel). .

Abstract

Efficient and intelligent spectrum access is crucial for meeting the diverse Quality of Service (QoS) demands of Vehicular Internet of Things (V-IoT) systems in next-generation cellular networks. This work proposes a novel reinforcement learning (RL)-based priority-aware spectrum management (RL-PASM) framework, a centralized self-learning priority-aware spectrum management framework operating through Roadside Units (RSUs). RL-PASM dynamically allocates spectrum resources across three traffic classes: high-priority (HP), low-priority (LP), and best-effort (BE), utilizing reinforcement learning (RL). This work compares four RL algorithms: Q-Learning, Double Q-Learning, Deep Q-Network (DQN), and Actor-Critic (AC) methods. The environment is modeled as a discrete-time Markov Decision Process (MDP), and a context-sensitive reward function guides fairness-preserving decisions for access, preemption, coexistence, and hand-off. Extensive simulations conducted under realistic vehicular load conditions evaluate the performance across key metrics, including throughput, delay, energy efficiency, fairness, blocking, and interruption probability. Unlike prior approaches, RL-PASM introduces a unified multi-objective reward formulation and centralized RSU-based control to support adaptive priority-aware access for dynamic vehicular environments. Simulation results confirm that RL-PASM balances throughput, latency, fairness, and energy efficiency, demonstrating its suitability for scalable and resource-constrained deployments. The results also demonstrate that DQN achieves the highest average throughput, followed by vanilla QL. DQL and AC maintain fairness at high levels and low average interruption probability. QL demonstrates the lowest average delay and the highest energy efficiency, making it a suitable candidate for edge-constrained vehicular deployments. Selecting the appropriate RL method, RL-PASM offers a robust and adaptable solution for scalable, intelligent, and priority-aware spectrum access in vehicular communication infrastructures.

Keywords: 5G; Internet of Things; priority-aware spectrum management; reinforcement learning; resource allocation; spectrum access.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
System model of RL-PASM showing RSU-based spectrum allocation for HP (red), LP (blue), and BE (white) traffic classes.
Figure 2
Figure 2
Centralized RL-PASM architecture: The RSU observes the system state (C,D,A) and selects an access policy based on learned priorities for HP, LP, and BE vehicular traffic.
Figure 3
Figure 3
Maximum Q-value trend across episodes for each of the four evaluated RL algorithms.
Figure 4
Figure 4
Mean throughput comparison across RL agents.
Figure 5
Figure 5
Mean delay per episode across all RL agents in milliseconds (ms).
Figure 6
Figure 6
CDF of delay across all agents.
Figure 7
Figure 7
Mean energy efficiency, computed as the successful transmissions per unit energy across all agents.
Figure 8
Figure 8
Mean Jain’s fairness index for all RL agents.
Figure 9
Figure 9
CDF of interruption probability for each RL agent.
Figure 10
Figure 10
Mean blocking probability with standard deviation across RL agents.

Similar articles

References

    1. Wang C.X., You X., Gao X., Zhu X., Li Z., Zhang C., Wang H., Huang Y., Chen Y., Haas H., et al. On the road to 6G: Visions, requirements, key technologies, and testbeds. IEEE Commun. Surv. Tutor. 2023;25:905–974. doi: 10.1109/COMST.2023.3249835. - DOI
    1. Shakeel A., Iqbal A., Nauman A., Hussain R., Li X., Rabie K. 6G driven Vehicular Tracking in Smart Cities using Intelligent Reflecting Surfaces; Proceedings of the 2023 IEEE 97th Vehicular Technology Conference (VTC2023-Spring); Florence, Italy. 20–23 June 2023; pp. 1–7. - DOI
    1. Tang M., Cai S., Lau V.K. Radix-partition-based over-the-air aggregation and low-complexity state estimation for IoT systems over wireless fading channels. IEEE Trans. Signal Process. 2022;70:1464–1477. doi: 10.1109/TSP.2022.3158759. - DOI
    1. Twahirwa E., Rwigema J., Datta R. Design and deployment of vehicular Internet of things for smart city applications. Sustainability. 2021;14:176. doi: 10.3390/su14010176. - DOI
    1. Misra S., Mukherjee A., Roy A. Introduction to IoT. Cambridge University Press; Cambridge, UK: 2021. Vehicular IoT; pp. 283–292.

LinkOut - more resources