Calyx — Foundations

PRINCIPLE 1

Living things persist by keeping a model of the world true

The Free Energy Principle (Karl Friston's framework) says that any system which persists — a cell, an animal, an institution — does so by maintaining a model of its environment and acting to minimise surprise: the gap between what it predicts and what it encounters. This is active inference — you don't just update your model to fit the world, you also act on the world to fit your model. Sense, model, act, repeat. It's the same loop cybernetics described decades ago.

A company does exactly this, badly. It senses its market, holds a picture of itself, and acts to close the gap — except the picture lives smeared across a dozen tools and somebody's head, and nobody can keep it current. Calyx is the attempt to give that loop a real substrate.

PRINCIPLE 2

Meaning is what a system needs in order to persist

The physicist David Wolpert, with Artemy Kolchinsky (2018), made this precise: semantic information is the information a system holds that is causally necessary for it to keep existing. A bacterium tracking a chemical gradient holds semantic information about it; a rock holds none. Meaning isn't projected onto the world by minds — it's the subset of information that bears on survival.

That reframes a company's context. The decisions, the history, the model of itself — that is its semantic information, the part it must keep true to persist. Lose it and the organism goes blind.

Hold onto this name — Kolchinsky & Wolpert. It reappears in the reasoning lens below, doing the same job for language that it does here for companies. That's not a coincidence; it's the hinge where the two lenses are secretly the same lens.

PRINCIPLE 3

Intelligent systems need legible, low-entropy environments

A system can only act on a world it can model accurately. Drop an agent into an opaque, high-entropy substrate and its predictions degrade; give it a legible, ordered one and it reasons reliably. This isn't a preference — it falls straight out of the principle above.

So Calyx is a folder of plain markdown files: deterministic, inspectable, low-entropy. The file-first decision isn't aesthetic. It's the condition under which agentic reasoning actually works — which, as it happens, is also what makes the workspace pleasant for a human. The same choice serves both.

PRINCIPLE 4

A company is a living system — so the context is the moat

Put the three together. A company is a cybernetic system that persists by keeping a true model of itself; that model is its semantic information; and it can only run well on a legible substrate. Calyx is built to be that substrate — sensing your connected tools, updating the vault, acting to close the gap between what the organisation believes and what's actually true.

And the more of the company that lives inside it, the more capable its agents become and the more expensive it is to leave. That's a switching cost and a network effect that compound per customer, every day. Competitors can copy an interface in a weekend. They cannot copy a continuously-reconciled, living model of your company.

Agents are a commodity. The living context is the asset — and it's the only part that compounds.

Language models get better at reasoning as they scale. This is empirically undeniable and almost completely unexplained.

Scaling laws describe that it happens. Interpretability work describes which circuits appear. But there's no satisfying answer to the question underneath: why does training a system to predict the next word on a pile of internet text produce something that can solve frontier mathematics? Here's the account I find most convincing.

CLAIM 1

Language is a relational graph, shaped by selection

Take Kolchinsky & Wolpert's semantic information again — meaning as the information a system needs to persist — and point it at language. The words and structures that survive aren't random; they're the residue of a long selection process. Every word that exists is a distinction someone needed often enough to name. So the corpus of human language carries, implicitly, a vast graph of concept-to-concept relationships, and that graph isn't arbitrary — it was shaped by reality, because the people who produced it were.

Some links are strong and obvious (cat/animal, hot/cold). Others are weak and rare — the connection between thermodynamic entropy and information entropy, say. Borrowing the sociologist Mark Granovetter's term, these are the weak bridges: sparse links between distant regions of the graph. Rare, but structurally critical — they're what make non-obvious inference possible.

CLAIM 2

Scaling resolves the graph at higher fidelity

At low scale a model only learns the strong, common links — a blurry map, major landmarks only. At higher scale the weak bridges resolve: minor roads, then footpaths. The model can traverse paths that were always there but invisible at lower resolution. And the links become context-sensitive — A relates to B differently depending on what else is present. That's the thing that separates reasoning from lookup.

CLAIM 3

Capabilities emerge when critical edges resolve at once

This explains why capabilities seem to appear suddenly. Per-token accuracy climbs smoothly, but a real task requires traversing a chain of weak bridges in sequence — and if any single link is still below resolution, the whole inference fails. When scale crosses the threshold where every link in the path resolves, the capability snaps into existence. Smooth underneath, discontinuous on the surface, because the task needs all the edges at once.

CLAIM 4

Chain-of-thought builds the surface reasoning happens on

Each token a model emits materialises a concept into its working context. Before it's said, the concept is latent in the weights, reachable only via the right path; once it's said, it's a concrete vector the next computation can build off. Each token a model lays down changes which tokens become possible next — the way each stone in a vault changes what the next stone can bear. Words the bricks, grammar the mortar, reasoning the cathedral that only stands once the whole thing is in place.

So "think step by step" doesn't teach a model to reason — it gives it permission to lay down the surface it needs to reason on. A model can't reason about what it hasn't yet said.

CLAIM 5

Truth-tracking comes from the corpus's selection history

Here's the part that should stop you. The model has never touched the world — no eyes, no hands, no experiment — and it converges on truth anyway. How? Because the structure it learned wasn't designed. It was selected — calibrated against reality by billions of people across thousands of years, the same way natural selection calibrates a body to its environment. Language is the fossil record of every distinction that ever helped someone survive. Train on it deeply enough and you inherit the map.

It even predicts the failures: a model confabulates most confidently exactly where the corpus is internally coherent but reality-detached — pseudoscience, folk etymology, superseded theory. The map is dense there, so it moves fluently. The paths just don't track truth, because the selection pressure was social, not empirical.

The ideas Calyx is built on

Living systems

Reasoning

Why agents and companies are built this way