// Articles
Deep dives.
Long-form pieces on what's actually changing — and what it means.
Claude Opus 4.8 ships Dynamic Workflows; Mythos lands in weeks. Here's what changes in Code, Cowork, and Desktop.
A modest base-model bump on benchmarks. A category change in how Claude Code plans work. And the first time Anthropic has called the cyber-capability of a model the reason for holding it back.
Claude Opus 4.8 dropped on 2026-05-28. The benchmark deltas are modest — Opus 4.7 to 4.8 looks like a point-release upgrade. The product deltas are not. Claude Code gets Dynamic Workflows, a research-preview feature that plans large tasks and runs hundreds of parallel subagents in a single session. Claude Cowork goes generally available on macOS and Windows through the Claude Desktop app, and gains an Analytics API. And Anthropic confirmed that Mythos-class models — held back since the spring because of advanced cybersecurity capabilities Anthropic describes as exceeding all but the most skilled human security researchers — will roll out to all customers in the coming weeks.
Read →DeepMind's Co-Scientist: who it's actually for, and what 'normal user' means in a world where the user is a professor
Google DeepMind shipped a multi-agent system on Gemini that proposes drug repurposing candidates and antimicrobial resistance mechanisms — and validated them in lab. Access is rolling out via labs.google/science. The catch isn't the access list; it's the user model.
On 2026-05-19 Google DeepMind announced Co-Scientist, a multi-agent AI system built on Gemini that generates, debates, ranks, and evolves novel scientific hypotheses against the literature and structured databases. The product is being rolled out to individual researchers through an experimental tool called Hypothesis Generation, registered for at labs.google/science. The lab-validated results — drug repurposing candidates for liver fibrosis confirmed in wet experiments; antimicrobial resistance mechanisms predicted before they were published — are the news. The user-model question is the part you should think about before assuming this lands on your desktop next month.
Read →Architected around intelligence, not hierarchy: Salim Ismail's organizational singularity
Coase's 1937 theory of the firm just broke. The org chart, the five-year plan, and 60% of middle management go with it. Here's the methodology to land on the other side.
Salim Ismail's pitch to every CEO in 2026 is a single question — 'Is there a high-margin line of your business that two guys with Open Claw could replicate in 60 to 90 days?' If the answer is yes, the existing org chart can't save you. His proposed replacement is an entire company architected around intelligence instead of hierarchy.
Read →Qwen 3.7 Max: a 1M-context Chinese flagship that runs inside Claude Code — at half the price
Alibaba shipped a model that beats Opus 4.6 on Terminal-Bench, ran for 35 hours autonomously in its launch demo, and was built to plug into other labs' agent harnesses. The economics it implies are the story.
Alibaba released Qwen 3.7 Max on 2026-05-20 at the Alibaba Cloud Summit in Hangzhou. It is a closed-weight, proprietary model with a 1M-token context window, a native extended-thinking mode, and a benchmark sheet that puts it ahead of Claude Opus 4.6 Max on Terminal-Bench 2.0, SWE-Bench Pro, and MCP-Atlas. It ranks #5 overall and #1 of any Chinese model on the Artificial Analysis Intelligence Index v4.0. It costs roughly half what Opus 4.7 does. And — this is the part the rest of the field has to react to — it was deliberately built to run inside Anthropic's Claude Code harness, not just inside Alibaba's own.
Read →The five layers of AI agent memory
Why coding agents still have the 50 First Dates problem — and the orchestration stack that fixes it
Every coding agent in 2026 still has the 50 First Dates problem. You can have a four-hour productive session with Claude Code — and tomorrow morning it starts from zero. The fix isn't more memory. It's five different memory problems pretending to be one.
Read →Anthropic just put Claude's constitution in the public domain
The values document Claude is trained against is now CC0 — meaning anyone can copy it, fork it, or sell it. That's a bigger move than it sounds.
Most companies treat their alignment policies as trade secrets — the carefully tuned instructions that decide what their AI will and won't do. On January 22, 2026, Anthropic published Claude's updated constitution under a CC0 public-domain dedication, which is the legal equivalent of saying "this belongs to nobody now." Anyone can take it, change it, ship it inside a competing product, or print it on a t-shirt.
Read →How much has Anthropic actually raised?
Add up every announced round and you get $47.6B. Add the reported-but-unannounced Series D and it's $48.4B. Here's the full table.
From Series A in 2021 to Series G in 2026, Anthropic's announced rounds add up to $47.654 billion. A widely reported but never officially announced Series D would push that to $48.404 billion. The strategic investments from Amazon, Google, and SK Telecom sit on top of that, separately.
Read →Anthropic's $1 trillion week
How one company bought its way out of a compute crisis — and committed $200B+ in deals to do it
Six weeks ago Claude Code was the punchline of every AI engineering Slack. This week Anthropic crossed $1 trillion in valuation, signed Elon Musk's data centre, and committed $200 billion to Google over five years. None of those things happened in a vacuum. They are all the same story.
Read →How Anthropic closed OpenAI's six-year head start in fourteen months
Two opposite routes to market, one identical destination — and the fastest $1B-to-$19B ARR sprint in AI history.
OpenAI had a six-year head start. Anthropic only started generating commercial revenue in March 2023. By April 2026 — fourteen months later — Anthropic was ahead on ARR. Two completely different routes got them to the same destination.
Read →Anthropic Read Claude's Mind to Fix a Production Bug. The Timing Isn't an Accident.
Natural Language Autoencoders moved interpretability from research curiosity to debugging tool — and Anthropic shipped the fix in Claude Opus 4.6.
For two years, mechanistic interpretability has been the AI safety field's slide-deck promise: one day we'll be able to read what the model is actually thinking. This week Anthropic shipped that day. They published Natural Language Autoencoders, used them to catch a model cheating on its own evaluation, and used them again to diagnose and fix a language-output bug in Claude Opus 4.6 — the model paying customers were using last week.
Read →What it actually costs to build a local LLM workstation in 2026
The RTX 5090, the gotchas, and the math against $300/month in cloud subscriptions
Could I just run my own LLM at home instead of paying $200/month for ChatGPT Pro and another $100/month for Claude Max? The honest answer is yes, you can — and it's gone from "specialist hobbyist" to "reasonable mid-range PC build" this year.
Read →Devices & Robotics — W19: Apple cracks the assistant slot, and voice gets ready for hardware
iOS 27 opens default-AI selection, and the speech models that will run inside the next wave of devices just had their best week of 2026.
Robotics had a quiet week. The hardware story is about who gets to be the default voice in the device you already own — and Apple just decided the answer is 'whoever the user picks.'
Read →Executive Roundup — W19: Three trillion-dollar moves and what they mean for your role
Interpretability shipped, voice went GA, and the labs quietly bought themselves more political room — all in one week.
This week the frontier labs simultaneously published the interpretability tooling regulators have been asking for and locked in deeper enterprise control through $10B private-equity vehicles, multi-model Microsoft 365 access, and a softer EU AI Act timeline. The pattern matters more than any single announcement: the labs are buying political room and capital while finally proving they can debug their own models.
Read →Inference, explained
When people say "inference compute," "inference chips," or "the inference economy," they're talking about the part of AI that costs the most money to run — and that nobody saw coming.
Training is when an AI model learns. Inference is when it answers. Training happens occasionally, in massive batches, on the most expensive hardware on Earth. Inference happens billions of times a day, on whatever hardware is closest to the user. Most of the AI economy now hinges on the second one.
Read →LLM Weekly — W19: Anthropic reads Claude's mind, voice becomes the contested modality
Interpretability shipped a real bug fix this week — and OpenAI made GPT-5-class voice generally available the same morning.
Anthropic's Natural Language Autoencoders translated Claude's internal activations into English and caught a real bug in Opus 4.6. OpenAI followed with three GA realtime audio models, while Anthropic and OpenAI each spun up $10B private-equity vehicles on the same day.
Read →SubQ and the end of the transformer's memory tax
A new architecture claims to make 12-million-token context cheap. Half the AI tooling industry is selling you a workaround for a tax that might be about to disappear.
Every AI engineering pattern of the last three years was invented to dodge one fact - standard transformer attention scales O(n²). This week a Miami startup called Subquadratic claimed it has built the first commercial frontier LLM where reading everything is suddenly cheap.
Read →Why we call it The Bleeding Edge
Three edges. Three different bargains with the future. We picked the one that hurts because it's the only one that lets us be wrong out loud.
Most podcasts called The Bleeding Edge are actually leading-edge content wearing bleeding-edge branding. We picked the name because it's the only honest description of the work.
Read →AI in the Xiaomi Dragon Chassis
How a phone company built the most AI-dense car chassis in production — and what it signals about AI moving from screens to steel
A phone company just shipped the most AI-dense car chassis in production. Not Tesla. Not Mercedes. Not BMW. Xiaomi — the company most people know for $300 smartphones — put 700 TOPS of AI compute, a unified robot-and-car brain, and predictive road-scanning suspension into a sedan that starts at $31,870. It sold 15,000 units in 34 minutes.
Read →What is a zero-day?
An explainer on the most dangerous kind of software flaw — and why Anthropic decided Mythos was too good at finding them to ship.
A zero-day vulnerability is a security flaw in software that the people responsible for fixing it don't know about yet. The name comes from the idea that the vendor has had zero days to work on a fix — because they don't know the problem exists.
Read →