Energy-efficient collision avoidance algorithms for UAV swarms

Shuangyao Huang

Back

Energy-efficient collision avoidance algorithms for UAV swarms

Doctoral Thesis

Open access

Energy-efficient collision avoidance algorithms for UAV swarms

Shuangyao Huang

Doctor of Philosophy - PhD, University of Otago

University of Otago

2023

Handle:

https://hdl.handle.net/10523/15656

Abstract

UAV

Collision Avoidance

Energy

Multi-Agent Reinforcement Learning

APF

PSO

Unmanned Aerial Vehicle (UAV) swarms can provide promising solutions for unmanned delivery, search and rescue, tracking, monitoring, and post-disaster communication recovery in terms of safety and cost. The key challenges in collision avoidance for UAV swarms are safety, energy efficiency, cooperation, and reaction time. Conventional solutions suffer from low energy efficiency, long reaction times, and ineffective cooperation. Recent methods based on machine learning, such as Multi-Agent Reinforcement Learning (MARL), can achieve high energy efficiency, effective cooperation, and fast reaction. However, they have high failure rates in collision avoidance. How to address all these challenges in collision avoidance remains an open and critical problem. This thesis proposes three solutions to address the challenges progressively. The first solution is the E2Coop algorithm that combines Artificial Potential Field (APF) and Particle Swarm Optimization (PSO) to achieve high energy efficiency and effective cooperation while ensuring safety. In E2Coop, APF provides environmental awareness and coordination to UAVs by constructing a potential field to represent the environment. At the same time, PSO searches optimal trajectories for UAVs, considering safety and energy efficiency under the coordination of APF. The second solution is CoDe, which uses Multi-Agent Reinforcement Learning (MARL) to train cooperative policies for UAVs operating in a swarm. The key contribution in CoDe is a novel credit assignment scheme based on difference rewards and counterfactual policy gradients. The credit assignment scheme requires no assumption on value functions, has low computational complexity, and applies to continuous action space. CoDe is over 90% faster than E2Coop in execution while reducing energy consumption by over 20%. The third solution is CoDe+, which combines E2Coop and CoDe to reduce learning variances and improve sample efficiency. It achieves at least 40% higher average score and saves over 50% of energy than E2Coop and nearly 30% than CoDe on average. The three solutions progressively address the key challenges of collision avoidance in UAV swarms, enabling more applications of UAV swarms in large-scale infrastructure-less and contact-less connections.

Files and links (1)

pdf

thesis_R.pdfDownload View

Metrics

441 File views/ downloads

273 Record Views

Details

Record Identifier: 9926478411201891
Title: Energy-efficient collision avoidance algorithms for UAV swarms
Creators: Shuangyao Huang
Contributors: Haibo Zhang (Advisor / Supervisor)
Zhiyi Huang (Advisor / Supervisor)
Academic Unit: Computer Science
Publisher: University of Otago
Degree Awarded: Doctor of Philosophy - PhD
Project Type: Thesis - Doctoral
Awarding Institution: University of Otago
Date published ; e-published: 2023
Language: English
Resource Type ; Subtype: Doctoral Thesis
Format: application/pdf