Improved Robustness and Hyperparameter Selection in Modern Hopfield Networks

Hayden McAlister; Anthony Robins; Lech Szymanski

doi:10.48550/arxiv.2407.08742

Back

Improved Robustness and Hyperparameter Selection in Modern Hopfield Networks

Preprint

Open access

Improved Robustness and Hyperparameter Selection in Modern Hopfield Networks

Hayden McAlister, Anthony Robins and Lech Szymanski

arXiv.org

Cornell University

30/07/2024

DOI: https://doi.org/10.48550/arxiv.2407.08742

Handle:

https://hdl.handle.net/10523/41201

Abstract

Computer Science - Learning

Computer Science - Neural and Evolutionary Computing

The modern Hopfield network generalizes the classical Hopfield network by allowing for sharper interaction functions. This increases the capacity of the network as an autoassociative memory as nearby learned attractors will not interfere with one another. However, the implementation of the network relies on applying large exponents to the dot product of memory vectors and probe vectors. If the dimension of the data is large the calculation can be very large and result in problems when using floating point numbers in a practical implementation. We describe this problem in detail, modify the original network description to mitigate the problem, and show the modification will not alter the networks' dynamics during update or training. We also show our modification greatly improves hyperparameter selection for the modern Hopfield network, removing hyperparameter dependence on the interaction vertex and resulting in an optimal region of hyperparameters that does not significantly change with the interaction vertex as it does in the original network.

Files and links (1)

url

https://arxiv.org/pdf/2407.08742View

Open

Metrics

20 Record Views

Details

Record Identifier: 9926557108401891
Title: Improved Robustness and Hyperparameter Selection in Modern Hopfield Networks
Creators: Hayden McAlister
Anthony Robins
Lech Szymanski
Academic Unit: School of Computing
Publication Details: arXiv.org
Publisher: Cornell University
Language: English
Resource Type ; Subtype: Preprint