Live research dashboard

Current experiments, and where each stands.

Mission-control for the investigation: the question each experiment is testing, its metrics, and the state of every arm. No marketing, only the state of the work. An arm that has not been run says so, because an uncontrolled result is not a result.

Status key Running Signal Completed Cooling Failed

EXP-05 Signal 03 Jun 2026

Online weights, coupled to the body

Under testDoes letting the brain's weights slide online (small adapter updates during a life) change the body's outcomes, beyond a frozen brain and beyond no brain?

12 Dolphin n

11/12, 12/12 Final-energy wins

p ≈ 0.003, 0.0002 Sign tests

10/9/9 Alive (close, not survival)

n=3, p = 0.125 Qwen

Arms

Ran Online adapter updates (Dolphin) 432 updates, 420 changed-weight forwards; final energy ahead, mean +10 vs frozen
Ran Frozen-brain control matched, no updates
Ran No-brain control matched
Ran Qwen replication (independent family) preregistered, different family; online ahead in 3/3 on final energy, but p = 0.125

EXP-04 Completed 01 Jun 2026

The load-bearing brain

Under testWith the model demoted to a bounded brain that can only nudge the body, does it actually do anything, or could noise replace it?

4 / 4 Controls beaten

2 Model families

5 Runs per arm

p ≈ 0.006 Significance

Arms

Ran Trained model (Qwen) beat all four controls across five runs
Ran Trained model (Dolphin, on Llama) reproduced independently: beat valid-random 5/5 (p ≈ 0.006)
Ran No brain control: dies on the perturbation
Ran Invalid-signal brain control: does the body just need a signal? No
Ran Randomized-weights brain control: same architecture, no learning
Ran Valid-random brain control: valid moves at random, the sharpest test

EXP-01 Cooling 29 May 2026

Self-maintenance closure

Under testCan the organism close a self-maintenance loop on its own, with no copyable instruction in the cue: notice a decayed self-belief and act to restore it?

1 Lives

~38 Ticks

1 / 2 Control arms run

0 Closed cycles

Withdrawn Confidence

Arms

Ran Affordance enabled, aggressive decay closed-loop credit withdrawn on review (a scorer artifact); the closure smoke test now scores 0, blocked by an engineering launch bug
Ran Affordance disabled matched a trivial repeated-probe latch the scorer cannot tell from closure
Not run No-decay baseline pre-registered control

EXP-02 Completed 25 May 2026

Held-cache attractor and controlled discontinuity

Under testIs the held-cache behavioural lock a structural effect, and does breaking continuity break it while a text-only cue does not?

3 Arms

n=3 / arm Replicates

200 ticks Run length

0/3 vs 3/3 Result

Arms

Ran Controlled discontinuity 0/3 locked, entropy 1.23
Ran No-reset control 3/3 locked, entropy 0.00
Ran SENSE-only confound control 3/3 locked; text cue alone does not break it

EXP-03 Completed 25 May 2026

Cache-restore determinism canary

Under testDoes a saved cache restore token-identically across a fresh process? A standing release gate, re-run each cycle.

8/8 Tokens

20/20 Windows

~70 MiB Cache

0 Temperature

Arms

Ran Create-process vs restore-process 8/8 token ids identical; ok = true

Two headline results now stand: a frozen model's prior is load-bearing for survival, and online weight updates change body-world outcomes beyond it, early and cross-model.

Not: neither is life. The load-bearing result is one task; the online-weights result is only directional across families (Qwen n=3, p = 0.125) and shows no viability, homeostasis, or biological adaptation. Zero life-properties demonstrated.

How a completed experiment becomes a finding →