Quadtree-based Spatial-Visual Memory for Object Navigation

Juncheng Liu; Brendan McCane; Steven Mills

Back

Quadtree-based Spatial-Visual Memory for Object Navigation

Conference paper

Open access

Quadtree-based Spatial-Visual Memory for Object Navigation

Juncheng Liu, Brendan McCane and Steven Mills

Australasian Conference on Robotics and Automation (ACRA) 2024), Auckland, New Zealand (27/11/2024–29/11/2024)

2024

Handle:

https://hdl.handle.net/10523/47471

Abstract

Object navigation is a fundamental task for autonomous robot control, self-driving cars and other applications. It requires the agent to take effective actions to navigate to a specific target semantic object in a previously unseen environment. We tackle this problem by using reinforcement learning and the proposed spatial-visual joint memory in a recursive quadtree representation. Compared to existing work, our recursive quadtree architecture leverages both visual and occupancy/spatial information. Maintaining an occupancy map also makes it possible to take advantage of deterministic path-planing techniques which lead to better training-sample efficiency and shorter navigation trajecto-ries. Additionally, our quadtree representation further improves efficiency by avoiding processing empty quadrants. We evaluate our proposed method on two publicly available simulators: Habitat and AI2thor in object navigation tasks. Experimental results show our method achieves state-of-the-art performance in both success rate and SPL (success weighted by path length) metrics.

Files and links (1)

url

https://ssl.linklings.net/conferences/acra/acra2024_proceedings/views/includes/files/pap103s2.pdfView

Metrics

1 Record Views

Details

Record Identifier: 9926760642601891
Title: Quadtree-based Spatial-Visual Memory for Object Navigation
Creators: Juncheng Liu
Brendan McCane
Steven Mills
Conference: Australasian Conference on Robotics and Automation (ACRA) 2024), Auckland, New Zealand (27/11/2024–29/11/2024)
Academic Unit: School of Computing
Date published ; e-published: 2024
Comment: The published version is not available in full-text in OUR Archive. Where available, a link to the published version is provided (check the DOI and/or the Files and links section). The full-text item may be open access on the publisher's website. An earlier version of the work (such as authors' accepted manuscript following peer-review or unreviewed preprint/author's original version) may be available in the Files and links section of this record. Alternatively, readers may have subscription access to the full-text from the publisher.
Language: English
Resource Type ; Subtype: Conference paper