Who Audits the Auditors?
Can one AI system make another AI system audit it less independently, just by explaining it’s point of view?
Essays on complex systems, AI agents, and the 'agentic economy'.
Can one AI system make another AI system audit it less independently, just by explaining it’s point of view?
TL;DR: presenting the ultimate benchmark, getting models to create benchmarks for each other, and GPT 5.2 is the current (only) winner
sapiens
.
wherein I accidentally pursue an amateur paleontology phd
sometimes smart planners lose to simple markets
why we need to train models to learn their own capabilities, and how this will help them bid for work!
agent handoffs launder uncertainty into official truth
analysing bureaucracy in roman egypt
can AI agents work inside a real organisation?
turns out, yes
.
Why we need to build Starcraft for CEOs
Excerpts from a future history memoir
The Department of War is angry at an AI lab
.
Demonstrating why everyone getting their own AI agents will necessitate markets; otherwise known as Hayek's revenge
Yes
AI agents as digital daemons
AI safety critics are inconsistent - they oppose safety regs but support chip exports to China
“All models are wrong, but some are useful.” — George E.
on semantic trojan horses in LLMs
A small step
the vibes they are a-changin
more reinforcement learning, this time on the future
I usually work with three monitors. A few days ago, as I was looking across the usual combination of open documents, slack, whatsapp, and assorted chrome windows, I noticed something.
experiments in rlnvr
Convergent evolution in LLMs will get us there
More than you wanted to know about the fertility crisis
"I will run the tests again. I expect nothing. I am a leaf on the wind." an LLM while coding