Hopfield Networks

Created October 27, 2021 · Updated March 4, 2026

One of the earliest conceptualization of biological neural networks with feedback architecture.

Hopfield networks are:

No separation of hidden vs visible

Hebbian Learning

Inspired by biological neurons. Key idea: Positively correlated neurons reinforce each other's weights

\frac{d w_{i j}}{d t} \propto \operatorname{correlation}\left(x_{i}, x_{j}\right)

Associative memories <-> No supervision <-> Pattern completion

Binary Hopfield defines neuron states given neuron activation $$a$$

x_{i}=h\left(a_{i}\right)=\left\{\begin{array}{cl} 1 & a_{i} \geq 0 \\ -1 & a_{i}<0 \end{array}\right.

Continuous Hopfield defines neuron states given neuron activation $$a$$

x_{i}=\tanh \left(a_{i}\right)

Note the feedback connection!

Neuron $x_{1}$ influences $x_{3}$ , but $x_{3}$ influences $x_{1}$ back
Who influences whom first?
Synchronous updates: $a_{i}=\sum_{j} w_{i j} x_{j}$ Or
Asynchronous updates: one neuron at a time (fixed or random order)

Network updates $x_{i} \in\{-1,1\}$ till convergence to a stable state

Stable means $x_{i}$ does not flip states anymore.

Hopfield networks minimize the quadratic energy function

f_{\theta}(x)=\sum_{i, j} w_{i j} x_{i} x_{j}+\sum_{i} b_{i} x_{i}

Hopfield networks are Lyapunov functions meaning:

Lyapunov functions converge to fixed points. Hopfield energy is a Lyapunov functions:

We can replace the state variables with continuous-time variables
At time $$t$$ we compute instantaneous activations

a_{i}(t)=\sum_{j} w_{i j} x_{j}(t)

The neuron response is governed by a differential equation

\frac{d}{d t} x_{i}(t)=-\frac{1}{\tau}\left(x_{i}(t)-h\left(a_{i}\right)\right)

For steady $a_{i}$ the neuron response goes to stable state.

Retrieving from stored memory "patterns" by providing a partial pattern
Retrieval update rule is equivalent to the Attention > Scaled Dot Product Attention mechanism in Transformers