A small constraint result on learned transitions in attractor-style models

Hi all,

I wanted to share a small, constraint-oriented paper that came out of me trying to clarify something that’s often left implicit in discussions of grid-like representations, internal simulation, and Thousand Brains-style models.

In spatial settings, displacement is explicit: velocity signals, oscillatory-interference style mechanisms, or sensorimotor loops drive movement along an attractor manifold, while the attractor itself stabilizes and integrates state. In more abstract settings (e.g. internal simulation, replay, or non-spatial “grid-like” representations), that displacement-like structure is often assumed to come from elsewhere: simulated sensory activity, higher-level control, or another interacting system.

What I explored was a deliberately stripped-down question: if that displacement structure is left implicit or withheld, what does local recurrent attractor learning converge to on its own? Not as a biological claim, but as a way of understanding what work the attractor is, and isn’t, doing inside a larger architecture.

The short answer: it reliably finds a shortcut.

Across simple successor tasks, unless evaluation explicitly enforces late-horizon stability, learning converges to what I ended up calling impulse cheating: the network briefly flashes the correct successor state to satisfy the loss, but the activity collapses during free-run. You get the right answer for the wrong dynamical reason.

Only when persistence is explicitly tested do genuinely self-sustaining dynamics emerge, and even then they are strongly shaped by geometry (a continuous ring works; a folded “snake” manifold fails at discontinuities). The geometric result itself isn’t the main point. It mostly serves to illustrate how much can be hidden until transient solutions are ruled out.

I don’t think this contradicts how anyone expects biological systems to work. If anything, it sharpens the intuition that stable internal simulation requires explicit displacement-like structure somewhere in the system, and that local attractor recurrence alone is strongly biased toward predictive shortcuts unless constrained otherwise.

Sharing this mainly as a boundary-setting result, and as a reminder that prediction accuracy can quietly hide non-attractor dynamics unless persistence is measured directly.

Paper + code here:

https://github.com/javadan/can-paper/blob/main/paper/learning_discrete_successor_transitions_in_continuous_attractor_networks_emergence_limits_and_topological_constraints.pdf

https://github.com/javadan/can-paper

- Daniel

4 Likes

Very cool! One of the things I enjoy about this forum is the extreme diversity of approaches, backgrounds, interests, etc.

2 Likes

I love how you basically express that you have no strong feelings about attractors, but then you go on and school attractor researchers about a basic pitfall they’ve likely been hand-waving for years :laughing:

2 Likes

Very loosely speaking, the way I have been thinking about the results in relation to both grid cells and Thousand Brains is as follows.

For grid cells, I see the results as bolstering the canonical view. In spatial settings, attractor-like dynamics are typically discussed together with continuous external drive.

In relation to Thousand Brains, the results suggest that local recurrent learning in isolation is unlikely to give rise to sustained activity, at least under simple Hebbian-style learning where ‘impulse cheating’ is the default regime. This makes it feel more plausible that persistence is maintained through continuous drive from larger closed-loop circuits.

From that perspective, if CAN-like mechanisms do play a role in cortical columns, then both the effective ‘displacement’ and the persistence of activity are likely supported by sensorimotor loops, internally simulated input, or cortico-cortical feedback, with local attractor dynamics primarily constraining and stabilizing state rather than generating transitions or sustaining fire, on their own.

I am less familiar with the neuroscience side, so I suspect much of this already aligns with current assumptions. If anything, the results may just help sharpen intuitions about what work local attractor dynamics are, and are not, doing inside a larger system.

1 Like