A Possible Approach ... Darwin | Cover | A Thousand Brains

A Possible Approach to AGI.pdf (2.2 MB)

As promised.

Thoughts appreciated.

2 Likes

Interesting read, I particularly like the concept of hyper-tetrahedrons as the optimal solution
in an n-dimensional problem space, if I am understanding it correctly. Does rather beg the question
‘why aren’t all fund managers running this algorithm, sitting back and raking in the profits’,
or perhaps they are.

I struggle with how to get from sensory nerve signals, to hyper-tetrahedrons and back
to motor nerve signals.

A short thought experiment:

  1. Driving a car requires intelligence we presume, it’s not something even the smartest ape could do.

  2. If I train an LLM on all the information on cars, roads and driving there is in the world it will
    be able to correctly answer any question about driving a car you ask. Does it know how to drive a car ?

  3. I expect the answers will be yes to part 1 and no to part 2. Driving a car requires muscle memory and extensive situational awareness only acquirable through practical experience. The intelligence required and acquired eventually gets utilised at an almost subconscious level without the driver being aware of it at the fully conscious level a learner driver must use.

So, I have an artificial creature with a whole bunch of sensory nerves from all of the artificial muscles and eyes and ears and balance organs, and another bunch of motor nerves stimulating the artificial muscles. What goes in between is the AGI system, Monty, Cover or some such. But lower down it’s not an n-dimensional problem to optimise. It’s a hole where I went to place the foot, a step to climb over, a room to navigate. This feels more like an n-dimensional feedback control loop, something I like to call behavioural intelligence. Unfortunately the term is commonly used to mean intelligence about human behaviour, whereas I refer to creating behaviour in an artificial creature, which could be said to be intelligent behaviour. This control loop could be implemented in the form of a recurrent neural network with pliable hidden states.

The higher functions would then be built on top of the n-dimensional feedback control loop, modifying the control as it extracts more complex patterns from the sensory data and formulates more sophisticated responses. I think what I am saying is that something like an old brain implementation is required as an interface between a neocortex model and the real world, and it’s real world behaviour that we perceive, not unreasonably, as intelligence.

Fund managers operate primarily as risk takers and speculators imagining (against mountains of evidence) that they can beat markets. The most sophisticated ones attempt to lay risk off on other participants (this often looks like market manipulation, e.g., high frequency trading accessing information not yet available to the public).

Cover’s universal portfolio doesn’t have any excitement to it. It just works, positioning one to do as well as one can reasonably do and to do so with low volatility over the long run.

I think A Thousand Brains (or some Cover variant of it) does operate via n-dimensional feedback control loops. In my understanding, process wise they differ in the order of operations.

A Thousand Brains:

  1. encounters something,
  2. models the input,
  3. identifies (applying the just created model).

Cover variant:

  1. encounters something,
  2. identifies (applying the last created model), then
  3. updates the model for the next encounter.

A Cover approach operates faster in real time with the trade off of accuracy. That said, over the longer run, it gets to the same success in respect to the unknowable future. Cover’s algorithms operate like evolution, over time.

I think lots of high-level intelligence operates at a subconscious level.

Psychiatrist, Anton Ehrenzweig published The Hidden Order of Art: A Study in the Psychology of Artistic Imagination, in 1967. In sections of the work, he posits that complexity of certain tasks go beyond the capacity of a conscious mind to do them efficiently, effectively, or practically at all.

Yet we (humans) do them. Ehrenzweig observes that those that can get out of their own way access the deeper and broader capacity of our sub-conscious minds - the organizational powers of the sub-conscious do extraordinary things. We all have experiences of this. Creativity. Lateral thinking. Flow states. Inspiration. Epiphanies.

Federico García Lorca famously described Flamingo’s idea of duende as “
a power, not a work; a struggle, not a thought,” something outside of the individuals consciousness.

“No mind states,” or the concept of “no-mind”, from meditative traditions like Zen and Daoism (mushin or wuxin) described a state as not an absence of awareness or a complete cessation of all brain activity, but rather a state of pure consciousness, freedom from conceptual thought, emotional attachment, and the ego-based “thinking mind” which accesses the whole mind.

I don’t see that one needs much beyond the operation of reference frames (maybe + Cover) in silicon to replicate this.

Unless I’m mistaken, Monty doesn’t actually build a fresh model prior to object identification. It uses existing hypotheses to interpret sensation immediately, then updates the model if and when appropriate. So Monty’s operation order, to me, seems closer to the Cover variant you described.

That said, I like your framing of Cortical Columns as a kind of distrobuted MoE network. Thats a fun way of viewing it. Using simplex-geometry to model module concensus is also clever.

As for thoughts
 I’d love to see an expansion of the memo’s ‘Innovation and Creativity’ section. To me, evolution-style reweighting optimizes that which already exists, but it doesn’t really explain how the system invents new experts. So from within this framework, where would you say abstration comes from? How is it achieved?

2 Likes

The same way brains already do it – extending a reference frame to incorporate some combination of stuff that we hadn’t previously combined, which, in being combined, enables something or solves a problem we or even experts in a field have previously failed to solve.

We don’t create ex nihilo. We can only combine things or ideas about things that already exist in the world (world certainly includes the body of human knowledge about the world).

A story came down to me that illustrates this. P&G sent a group of summer interns to Washington, D.C. to look for things at the US Patent Office the company could use.

An intern, a young woman in her teens, came across a patent that described a paper “gortex” like material that water could pass through in only one direction. Being just paper she didn’t think it had sufficient substance to use for anything, e.g., you couldn’t make space suits out of it or waterproof shoes or jackets. She set the patent aside.

Later that day, the same intern came across a 2nd patent. This patent’s illustration looked like a cloud and it described a material that could absorb thousands of times its weight in water.

The intern, an experienced expert as a baby sitter, reportedly picked up the two patents, put them together, and invented Pampers and - as told to me - received a patent even though it incorporated two patents already filed.

Not certain if it really happened this way, but the story makes the point.

Certainly, innovation/invention/originality in any area of human endeavor can seem strange, when we don’t know all the bits that someone accessed to arrive at the innovation.

Patent applications require a review of “prior art”.

Magic tricks can’t do anything beyond physics, they just hide or divert us from seeing all the pieces.

I still find Shakespeare strange and wonderful.

I feel the same way about A Thousand Brains.

2 Likes

Great. I had only come to A Thousand Brains via the book and have only begun to dig into the more recent work.

I think I phrased this poorly. I agree with your stance here, I was more probing for how you’d propose doing it (or how you think the brain does it).

It’s been a minute since I really thought about it myself, but personally, I suspect the brain employs a kind of resting state manifold learning function, potentially observed in things such as DMN activity.

Your questions have had me thinking the past several months. The following attempts to make sense of it.

Innovation, abstraction, creativity, even the capacity to generate explanations evolved.

Cover fits into this as the mathematician of Darwinian competence. Tom formalized why blind variation plus selection works so well, while remaining largely silent on knowledge, meaning, or explanation.

The mathematics of blind success

Cover’s theorem (1965) stated informally:

A complex pattern classification problem that is not linearly separable in low dimensions is likely to become linearly separable when projected into a sufficiently high-dimensional space.

Thereby:

  • You do not need insight, understanding, or foresight to solve hard problems.

  • You need rich variation and a simple selection rule.

Cover proved:

  • High-dimensional representations make discrimination easy

  • The power of random feature expansion

  • Selection (error minimization) does the organizing

  • Performance can arise without comprehension

Cover expresses Darwin’s algorithm in geometry, explaining why:

  • Evolution works

  • Machine learning works

  • Scaling beats cleverness

He never published anything that claimed these systems understand anything, although I had conversations with him that pointed the way.

Essentially Cover formalized Dawkins’s “Blind Watchmaker”

Richard Dawkins explains conceptually what Cover shows mathematically.

Creativity: Innovation, Abstraction, and Explanation in Cover:

Some questions:

  • Can brains evolve the capacity to expand a solution space?

  • What constitutes abstraction?

  • Where does the capacity to form explanations evolve?

  • Could abstraction = explanation?

Innovation combines things and/or processes (nouns and verbs) not previously combined to solve problems not previously solvable.

Abstraction works in an almost opposite direction, identifying something common to a class of things or processes.

Innovation combining abstractions not previously combined might get us to explanation.

Prior to his death, Tom Cover and I spoke at length about expanding solution spaces e.g. , adding new experts (models). Tom saw no issue with his universal processes incorporating new experts into their optimization framework. Neither did he think doing so would de-optimize the process.

The question becomes, how does a brain add new experts?

Evolution already solved this question.

Conjecture:

  • Abstraction always (and naturally) emerges as one or more experts/models within a cluster of cortical columns.

  • As example, in a reference frame considering a cup, one expert/model casts its “vote” as an abstraction of a cup, a vessel.

  • The abstraction (vessel) then becomes available in the modeling of bowls, vases, bottles, even variation like stainers operating across multiple reference frames.

  • Participation of abstractions across multiple reference frames introduces innovation.

  • Innovation yields a new expert/model that could participate across new reference frames or even across the reference frames from which its components reside.

  • The innovation, as any expert/model, gets assessed time step after time step for fitness/usefulness.

  • Abstraction and innovation emerge/evolve.

I don’t think one needs anything other than this sort of evolutionary innovation to get to abstraction.

How does one then get to explanation?

  • Abstraction extracts invariants across instances.

  • Explanation organizes abstractions into a generative constraint structure that answers counterfactuals.

(I rely on Popper and Deutsch in this.)

Abstraction compresses. Explanation constrains.

An abstraction like “vessel” compresses cups, bowls, bottles.
An explanation adds mechanism and counterfactual reach:

  • vessels hold fluids because geometry + containment constraints

  • if you puncture containment, holding fails

  • if gravity changes, vessel behavior changes

So abstraction alone does not equal explanation


Explanation requires abstraction plus a structure that governs behavior across interventions.

What mechanism generates new experts?

Restating the above conjecture:

  • cortical columns act as local modelers (Hawkins),

  • clusters of columns produce candidate models,

  • some models compress variation across contexts → abstractions,

  • abstractions become reusable modeling primitives,

  • cross-frame participation of abstractions enables innovation,

  • innovations spawn new experts,

  • universal selection evaluates them over time.

An evolutionary learning loop looks like this:

variation → abstraction → recombination → new expert → selection

Essentially, a cognitive analog of Darwin + Cover.

The “vessel” example illustrates it (again repetitive, but maybe worth the repetition):

  • object-centric columns encode cups, bowls, bottles

  • clustering reveals invariant containment geometry

  • one column (or small ensemble) stabilizes that invariant → abstraction “vessel”

  • abstraction participates across reference frames → compositional reuse

  • recombination yields new experts (e. g. , strainer as constrained vessel)

Nothing mystical enters. Abstraction emerges as a byproduct of compressive modeling pressure, evolutionary natural selection.

Does abstraction + solution-space expansion yield explanation?

The conjecture captures two necessary ingredients:

  • expansion of the expert space (innovation)

  • compression across instances (abstraction)

Together they create the raw material for explanation.

But explanation requires one additional step:

Explanation = abstraction embedded in a constraint network

It seems that abstractions become explanations when they:

  • govern behavior under intervention,

  • unify multiple phenomena under shared constraints,

  • survive criticism across contexts.

So the evolutionary loop naturally generates abstraction.
But explanation emerges only when selection pressures favor:

  • counterfactual robustness,

  • cross-context invariance,

  • minimal generative structure.

It seems as if:

  • abstraction provides the vocabulary of explanation

  • selection over counterfactual performance provides the grammar.

Innovation as cross-reference-frame abstraction recombination

The key idea lies in:

Innovation combining abstractions across reference frames yields new experts.

This gives a mechanistic account of creativity:

  • each reference frame develops local abstractions,

  • cross-frame interaction reveals compatibility or analogy,

  • recombination creates candidate generative structures,

  • selection filters for usefulness.

That mechanism naturally yields explanation because cross-frame consistency imposes constraints. If an abstraction works across many frames, it likely captures a real invariant.

I think Daniel Dennett would view this as a real pattern.

I further suggest that David Deutsch (see David Deutsch, Aeon, Creative blocks, 2012) would view this as an explanation with reach (I emailed him when the Aeon piece came out. He graciously responded. He might have an interest in chiming in on this).

Summary

The evolutionary abstraction-innovation loop can generate explanation, but only under one added pressure:

**selection must reward counterfactual stability, not only predictive fit.**
  • Innovation expands the hypothesis space.

  • Abstraction compresses variation.

  • Selection across contexts filters abstractions.

  • Cross-frame reuse imposes invariance.

  • Counterfactual robustness elevates abstraction into explanation.

Thus explanation emerges as an evolutionary equilibrium in the space of abstractions.

Concisely

A brain expands its solution space by generating new experts through recombination and structural mutation. Some experts compress variation across instances and stabilize as abstractions. Abstractions participate across multiple reference frames, enabling innovation through recombination. Universal selection evaluates innovations over time. When an abstraction survives cross-context criticism and supports reliable counterfactual reasoning, the abstraction functions as an explanation. Computational irreducibility limits how far explanations compress dynamics, but it does not prevent abstractions from emerging. Explanation therefore appears as a selected subset of abstractions that achieve invariant generative reach.

1 Like

I like this.

I think you’re probably correct here. I actually think one of the core functions of the default mode network is to compress high-dimensional information into a low-dimensional state (e.g., generalizing it). At one point, I think I used the term ‘resting-state manifold learner’ to describe the DMN in more ML-specific terms.

The CEN and saliance networks, I would suspect, also have a role to play in generating explantion.

Regarding the selection pressure: You assert that explanation emerges when selection rewards counterfactual stability rather than just predictive fit. But what generates that pressure? Predictive fit is metabolically cheap to evaluate. But counterfactual robustness isn’t. You’re essentially simulating interventions which haven’t happened yet. So why might a brain—one which evolved under tight energy constraints—seem to prefer this?

1 Like

I’ve been rereading Darwin, Dawkins, and Dennet, so I may have a kind of contact bias. Apologies if I’ve missed something in your questions.

At the highest level, evolutionary survival of the fittest within the environment the human species found itself.

Don’t all internally referencing reference frames do this? We visualize things not in front of us. We can visualize things in our heads that we’ve never seen in real life. Our brains evolved these capacities. They must have provided fitness advantages.

Counterfactual robustness and the accompanying capacity to form explanations more distinguishes humans—from all other beings—than anything else (I first encountered this idea in David Deutsch’s The Beginning of Infinity). More so even than language or tool making which apparently lots of creatures do.

Why human brains evolved this under the environmental (including energy) constraints they faced, seems less important than that they evolved to do so.

While we may not have identified the exact biological mechanism enabling this (although Thousand Brains seems plausible and very close), we certainly know that one exists. While we may not understand the step-by-step process that evolved the mechanism, we certainly know that it occurred.

Evolution deems counterfactual robustness as worth the cost. It could only do so, because it conferred fitness.

1 Like

I think your idea aligns directly with Thousand Brains consensus formation (whether a Cover implementation or Monty).

During rest (especially DMN activity), the brain can internally explore and reorganize the geometric structures (e.g., reference frames) of its learned representations, refining the latent manifold that supports perception, imagination, and abstraction. Consistent with the brain reflecting on itself.

Rest = geometric self-organization of experience.

Different sleep states may support different access to these capacities.
Matthew Walker’s, Why We Sleep, provides a useful lay-person description of all of this,

I think the experience of coming to insights during phases of light sleep coping up into consciousness, enables some memory of the process.

The Sufi poet, Jalāl al-Dīn Muងammad Rƫmī, advised (paraphrased):

Should you wake in the night, welcome the early hours, the morning winds have messages for you.

Given a Cover | Universal Portfolio | epistemic framing:

  • DMN manifold exploration resembles expert weighting over hypotheses
  • Replay and simulation approximate Cover-style universal search
  • Consensus across cortical columns parallels Hawkins’s ensemble voting

Pretty direct link:

DMN dynamics ↔ evolutionary manifold search ↔ Thousand Brains consensus formation

Evolution selected for us to spend nearly 1/3 of our lives asleep - a huge cost, but maybe that sleep pushes everything else.

1 Like