Research

Verified evidence.

Every claim on the homepage is backed by a reproducible experiment. Raw data, methodology, and on-chain proofs below.

Featured experiments

April 2026 · Base mainnet

Crypto agents · 1,083 transactions · 12 hours

81.9% control·99.9% Helix·195 reverts prevented·100% on-chain

Paired A/B test on Base mainnet (chain ID 8453). Every failure scenario sent to both arms at the same block. Control = blind retry on revert. Helix = PCEC pipeline with Repair Graph lookup. Result: 195 reverts prevented over a 12-hour window. Every tx hash is verifiable on BaseScan.

April 2026 · 5 frontier models

EVM revert classification · 10 failure modes

GPT-4o-mini 50%·GPT-4o 80%·Claude 4.5 Sonnet 90%·GPT-5.4-mini 90%·GPT-5.4 90%·Helix (PCEC) 100%

Ten production revert messages from Base and Ethereum mainnet, classified by failure cause. All five frontier models failed on the same case: a bare execution reverted with no reason string. Helix converges to 100% because the Repair Graph remembers — model capability is not the ceiling, accumulated repair data is.

What’s next

We’re writing up two further experiments: (a) Web2 microservices in autonomous resolution (91% across 4 production services, zero LLM calls); (b) the PCEC architecture as it scales past 500 patterns in the public Repair Graph. Expected Q3 2026.

All raw data and methodology are public. We do not believe in unreproducible claims.

Back to home