Hu, Yingmeng;
Liu, Yan;
Kaushik, Aryan;
Masouros, Christos;
Thompson, John;
(2023)
Timely Data Collection for UAV-based IoT networks: A Deep Reinforcement Learning Approach.
IEEE Sensors Journal
10.1109/jsen.2023.3265935.
(In press).
Preview |
PDF
Timely_Data_Collection_for_UAV-based_IoT_networks_A_Deep_Reinforcement_Learning_Approach.pdf - Accepted Version Download (1MB) | Preview |
Abstract
In some real-time Internet of Things (IoT) applications, the timeliness of sensor data is very important for the performance of a system. How to collect the data of sensor nodes is a problem to be solved for an unmanned aerial vehicle (UAV) in a specified area, where different nodes have different timeliness priorities. To efficiently collect the data, a guided search deep reinforcement learning (GSDRL) algorithm is presented to help the UAV with different initial positions to independently complete the task of data collection and forwarding. First, the data collection process is modeled as a sequential decision problem for minimizing the average age of information or maximizing the number of collected nodes according to specific environment. Then, the data collection strategy is optimized by the GSDRL algorithm. After training the network using the GSDRL algorithm, the UAV has the ability to perform autonomous navigation and decision-making to complete the complexity task more efficiently and rapidly. Simulation experiments show that the GSDRL algorithm has strong adaptability to adverse environments, and obtains a good strategy for the UAV data collection and forwarding.
Type: | Article |
---|---|
Title: | Timely Data Collection for UAV-based IoT networks: A Deep Reinforcement Learning Approach |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1109/jsen.2023.3265935 |
Publisher version: | http://doi.org/10.1109/jsen.2023.3265935 |
Language: | English |
Additional information: | This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions. |
Keywords: | Data collection, UAV trajectory optimization, age of information, deep reinforcement learning |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Electronic and Electrical Eng |
URI: | https://discovery-pp.ucl.ac.uk/id/eprint/10169266 |
Archive Staff Only
![]() |
View Item |