Here are three things I found interesting in the world of AI in the last week: 1. The "best AI model" era is over - Every.to / Digital Applied comparison OpenAI launched GPT-5.4 on March 5 to the usual amount of noise. Self-described Claude loyalists are excited. Augment Code made it their default model, calling it "a reliable orchestrator" that uses 18-20% fewer tokens on complex tasks. The headline number: 75% on OSWorld, the first frontier model to beat human experts (72.4%) at autonomous...
13 days ago • 3 min read
Instead of a newsletter this week I thought I'd experiment with a longer form email on an idea that I think is worth sharing. Let me know what you think and if you want more / less of this format. In my head I've been calling this the 'single good idea'. One of my favourite questions to ask people is "what is a new thing you've recently done with AI", often followed up by a "what do you want to be able to do next". It's a pretty quick way to find out where their learning edge is. "I have a...
20 days ago • 2 min read
Here are three things I found interesting in the world of AI in the last week: 1. China's robots did kung fu on the biggest TV show on Earth - CNN China's Spring Festival Gala is the most-watched broadcast in the world. 677 million viewers across platforms. This year's star wasn't a singer or a comedian. It was a dozen Unitree G1 humanoid robots doing drunken boxing, nunchucks, and backflips off trampolines three metres in the air. A larger H2 model appeared in Monkey King armour wielding a...
28 days ago • 4 min read
Here are three things I found interesting in the world of AI this week: Anthropic published the most alarming safety document I've read. Then the safety lead quit. - Anthropic System Card Anthropic released a 150+ page system card for Claude Opus 4.6. It's a pretty candid admission of dangerous capability from a major AI company. It's not that I think this model is unsafe, more that the direction of travel is disturbing. Here are some highlights. The model knew when it was being tested - when...
about 1 month ago • 5 min read
Here are three things I found interesting in the world of AI this week: Moltbook shows what happens when AI agents get their own social network - Fortune OpenClaw (formerly Clawdbot) hit 155,000 GitHub stars in weeks. The pitch is "Claude with hands," an AI that can actually do stuff on your behalf. Then someone built Moltbook, a Reddit-style forum where only AI agents can post. It now claims 1.5 million registered agents. The reality is messier. Wiz security researcher Gal Nagli accessed...
about 1 month ago • 3 min read
Here are three things I found interesting in the world of AI in the last week: 1. MoltBot is wildly popular and wildly insecure MoltBot (formerly ClawdBot, renamed after Anthropic sent a trademark demand) is an open-source AI agent gateway by Peter Steinberger, the founder of PSPDFKit. It connects Claude, OpenAI, or local models to 13+ messaging platforms including WhatsApp, Telegram, Slack, Discord and iMessage, with full system access: shell execution, file read/write, browser automation,...
about 2 months ago • 4 min read
Hi Reader, Welcome to the new year. I’ve been mulling on what was actually important in 2025 and what was just noise, and I keep coming back to three things. 1. Claude Code created a category, and everyone copied it Terminal-based coding tools weren’t new, and Aider and others had been around for a while, but when Anthropic launched Claude Code in February they combined tool use with thinking models in a way that produced an agent that was actually useful. There were a bunch of projects...
about 2 months ago • 6 min read
Hi Reader, Here are three things I found interesting in the world of AI in the last week: Wikipedia reports AI is killing human traffic - 404 Media Wikipedia just reported an 8% decline in human pageviews compared to last year. The Wikimedia Foundation is blaming AI chatbots and search engines that extract their content without sending traffic back. Almost every major AI model trains on Wikipedia, and now those models are strangling the platform that feeds them. The economics are perverse. AI...
5 months ago • 5 min read
Hi Reader, Here are three things I found interesting in the world of AI in the last week: Anthropic drops Sonnet 4.5 and Claude Code 2.0 - blog post Anthropic just launched Claude Sonnet 4.5, which they’re calling “the best coding model in the world,” alongside Claude Code 2.0 with some genuinely useful features. Sonnet 4.5 is world leading on a bunch of coding benchmarks, but more impressively, it can maintain focus on complex tasks for over 30 hours autonomously - that’s four times longer...
5 months ago • 5 min read