Meta-path guided policy distillation for resilient coordination in autonomous unmanned swarm
- PMID: 41474790
- PMCID: PMC12755786
- DOI: 10.1371/journal.pone.0339675
Meta-path guided policy distillation for resilient coordination in autonomous unmanned swarm
Abstract
Enhancing the resilience of Autonomous Unmanned Swarms (AUS) requires policies that remain effective under severe, structured disruptions while respecting the heterogeneous semantics of inter-subsystem interactions. Existing reinforcement learning (RL) approaches typically aggregate first-order neighborhoods in a path-agnostic manner, thereby blurring typed, ordered, and directed multi-hop dependencies encoded by domain meta-paths. We propose MPGPD-RC, a Meta- Path Guided Policy Distillation framework for Resilient Coordination that couples: (i) meta-path-guided embeddings learned by path-specific graph attention with contrastive reconstruction and attention fusion, and (ii) a teacher-student scheme in which a PPO teacher trained with a relaxed meta-path mask provides trajectories, and a student aligns both action distributions (KL) and trajectory-level structural codes via path-aware contrastive learning. Empirical evaluations validate that MPGPD-RC consistently surpasses state-of-the-art baselines across diverse perturbation scenarios by modeling complex, high-order dependencies that underpin resilient coordination.
Copyright: © 2025 Han et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures
References
-
- Du Z, Luo C, Min G, Wu J, Luo C, Pu J, et al. A survey on autonomous and intelligent swarms of Uncrewed Aerial Vehicles (UAVs). IEEE Trans Intell Transport Syst. 2025;26(10):14477–500. doi: 10.1109/tits.2025.3569500 - DOI
-
- Li H, Zhong Y, Zhuang X. A soft resource optimization method based on autonomous coordination of unmanned swarms system driven by resilience. Reliability Engineering & System Safety. 2024;249:110227. doi: 10.1016/j.ress.2024.110227 - DOI
-
- Al-lQubaydhi N, Alenezi A, Alanazi T, Senyor A, Alanezi N, Alotaibi B, et al. Deep learning for unmanned aerial vehicles detection: a review. Computer Science Review. 2024;51:100614. doi: 10.1016/j.cosrev.2023.100614 - DOI
-
- Wang Y, Shen C, Huang J, Chen H. Model-free adaptive control for unmanned surface vessels: a literature review. Systems Science & Control Engineering. 2024;12(1). doi: 10.1080/21642583.2024.2316170 - DOI
MeSH terms
LinkOut - more resources
Full Text Sources
