Reinforcement Learning-Based Optimal Firewall Placement and Configuration (RL_OFPC)

Zahra S. Torabi; David Eyers; Veronica Liesaputra

doi:10.1007/978-3-032-27993-4_31

Back

Reinforcement Learning-Based Optimal Firewall Placement and Configuration (RL_OFPC)

Conference proceeding

Open access

Peer reviewed

Reinforcement Learning-Based Optimal Firewall Placement and Configuration (RL_OFPC)

Zahra S. Torabi, David Eyers and Veronica Liesaputra

ICT Systems Security and Privacy Protection: 41st IFIP TC11 International Conference, SEC 2026, Proceedings, pp.449-462

IFIP International Conference on ICT Systems Security and Privacy Protection (SEC 2026), 41st (Perth, Australia, 09/06/2026–11/06/2026)

IFIP Advances in Information and Communication Technology, 787

03/06/2026

DOI: https://doi.org/10.1007/978-3-032-27993-4_31

Handle:

https://hdl.handle.net/10523/51273

Abstract

adaptability

firewall placement

network security requirement

reinforcement learning

scalability

virtual firewalls

Firewalls remain foundational to cybersecurity, yet their traditional perimeter-based role is challenged by the dynamic nature of modern zero-trust and virtualised networks. In these environments, virtual firewalls—software-defined security functions deployed within service graphs—provide flexible, fine-grained control over traffic flows. However, their scalability and performance are often constrained by sub-optimal placement and rule configuration, especially in large or rapidly evolving topologies. This research introduces the Reinforcement Learning–based Optimised Firewall Placement and Configuration (RL_(O)FPC) model, which addresses these challenges through two cooperating reinforcement learning agents. The FRC-Agent manages path computation and rule enforcement to satisfy hard security constraints, while the FPO-Agent determines optimal firewall locations that minimise the number of deployed firewalls and rule instances while maintaining proximity to critical network components. The model is evaluated against the state-of-the-art VEREFOO framework using both the Maximum Flow (MF) and Atomic Predicate (AP) algorithms across 120 synthetic topologies. Results demonstrate that RL_(O)FPC achieves up to 97.6% accuracy in Network Security Requirement (NSR) satisfaction, improves runtime efficiency by up to 27% in high-NSR environments compared with VEREFOO. However, as the number of Allocation Points (APs) increases, the model’s exploration overhead grows, occasionally surpassing VEREFOO’s scalability performance. Despite this, RL_(O)FPC consistently adapts better to topology modifications through localised Q-learning updates rather than full recomputation, confirming its suitability for dynamic, high-assurance network environments.

Files and links (1)

url

https://rdcu.be/fnSpEView

Metrics

1 Record Views

Details

Record Identifier: 9926872546401891
Title: Reinforcement Learning-Based Optimal Firewall Placement and Configuration (RL_OFPC)
Creators: Zahra S. Torabi
David Eyers
Veronica Liesaputra
Contributors: Helge Janicke (Editor)
Lynn Futcher (Editor)
Iqbal H. Sarker (Editor)
Kerry-Lynn Thomson (Editor)
Paul Haskell-Dowland (Editor)
Academic Unit: School of Computing
Publication Details: ICT Systems Security and Privacy Protection: 41st IFIP TC11 International Conference, SEC 2026, Proceedings, pp.449-462
Publisher: Springer Nature
Date published ; e-published: 03/06/2026
Conference: IFIP International Conference on ICT Systems Security and Privacy Protection (SEC 2026), 41st (Perth, Australia, 09/06/2026–11/06/2026)
Copyright: Copyright © International Federation for Information Processing 2026. All rights reserved. This work was first published in ICT Systems Security and Privacy Protection: 41st IFIP TC11 International Conference, SEC 2026, Proceedings (Springer Nature). The open access link to the subscription work is provided under the Springer Nature SharedIt Content-Sharing Initiative (https://www.springernature.com/gp/researchers/sharedit) making the view-only full-text work freely and legally accessible to anyone for research purposes and private study via the link: https://rdcu.be/fnSpE.
Comment: The published version is not available in full-text in OUR Archive. Where available, a link to the published version is provided (check the DOI and/or the Files and links section). The full-text item may be open access on the publisher's website. An earlier version of the work (such as authors' accepted manuscript following peer-review or unreviewed preprint/author's original version) may be available in the Files and links section of this record. Alternatively, readers may have subscription access to the full-text from the publisher.
Language: English
Resource Type ; Subtype: Conference proceeding; Conference Paper

Reinforcement Learning-Based Optimal Firewall Placement and Configuration (RL_OFPC)

Abstract

Files and links (1)

Related content

Metrics

Details