Model-Aided Federated Reinforcement Learning for Multi-UAV Trajectory Planning in IoT Networks

Jichao Chen, Omid Esrafilian, Harald Bayerlein, David Gesbert, Marco Caccamo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

Deploying teams of unmanned aerial vehicles (UAVs) to harvest data from distributed Internet of Things (IoT) devices requires efficient trajectory planning and coordination algorithms. Multi-agent reinforcement learning (MARL) has emerged as a solution, but requires extensive and costly real-world training data. To tackle this challenge, we propose a novel model-aided federated MARL algorithm to coordinate multiple UAVs on a data harvesting mission with only limited knowledge about the environment. The proposed algorithm alternates between building an environment simulation model from real-world measurements, specifically learning the radio channel characteristics and estimating unknown IoT device positions, and federated QMIX training in the simulated environment. Each UAV agent trains a local QMIX model in its simulated environment and continuously consolidates it through federated learning with other agents, accelerating the learning process. A performance comparison with standard MARL algorithms demonstrates that our proposed model-aided FedQMIX algorithm reduces the need for real-world training experiences by around three magnitudes while attaining similar data collection performance.

Original languageEnglish
Title of host publication2023 IEEE Globecom Workshops, GC Wkshps 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages818-823
Number of pages6
ISBN (Electronic)9798350370218
DOIs
StatePublished - 2023
Event2023 IEEE Globecom Workshops, GC Wkshps 2023 - Kuala Lumpur, Malaysia
Duration: 4 Dec 20238 Dec 2023

Publication series

Name2023 IEEE Globecom Workshops, GC Wkshps 2023

Conference

Conference2023 IEEE Globecom Workshops, GC Wkshps 2023
Country/TerritoryMalaysia
CityKuala Lumpur
Period4/12/238/12/23

Fingerprint

Dive into the research topics of 'Model-Aided Federated Reinforcement Learning for Multi-UAV Trajectory Planning in IoT Networks'. Together they form a unique fingerprint.

Cite this