Dynamic Workflows in Claude Code: Anthropic’s First Real Agent Swarm That Actually Ships

On May 28, 2026, Anthropic quietly dropped one of the most exciting features in the entire agentic AI space: Dynamic Workflows for Claude Code.
This isn’t another “let’s pretend we have agents” demo. It’s the first production-grade implementation of what people have been calling agent swarms — and it already feels like a legitimate engineering tool rather than a science project.
What Dynamic Workflows Actually Do

Claude then does something new: it dynamically writes its own orchestration scripts, spins up tens to hundreds of parallel sub-agents in a single session, lets them attack the problem from different angles, has other agents try to break their work, iterates until the answers converge, and finally hands you back one clean, coordinated result.
Progress is saved automatically. If the workflow gets interrupted, it picks up exactly where it left off. The whole thing can run for hours or even days without you babysitting it.

- Codebase-wide bug hunts;
- Profiler-guided optimization audits;
- Security hardening passes across thousands of files;
- Large-scale migrations (framework swaps, API deprecations, language ports);
- Anything where “the cost of a wrong answer is high” and you want it stress-tested from every angle before you commit.
The most impressive public case study so far: Jarred Sumner used dynamic workflows to port the entire Bun runtime from Zig to Rust — roughly 750,000 lines of new Rust code.
One workflow mapped lifetimes for every struct field. Another spun up hundreds of agents to write the actual .rs files with two reviewers per file. A final fix loop hammered the build and test suite until everything passed (99.8% of the original test suite). The whole port went from first commit to merge in eleven days.
The Real Killer Feature: Generator → Validator Loop
Everyone talks about “parallel agents” and “token burn.” That’s not the interesting part.
The magic is the generator-validator cycle.

Another set acts as the validator: they look at the diff, hunt for edge cases, try to break the result, run tests, and flag problems.
It’s the closest thing we’ve seen to a GAN for software engineering — except instead of generating fake images, it’s generating real, production-ready code changes and then ruthlessly critiquing them until they hold up.
This is why the workflows feel different from regular Claude sessions. Claude isn’t just “thinking harder.” It’s running an internal debate club on your codebase until the answer survives adversarial review.
In practice, you drop a task like “find every bug in this service” or “port this module to the new API,” and Claude disappears for 30–50 minutes (or longer on big jobs). When it comes back, it has written code, fixed its own mistakes, verified its own fixes, and produced something that actually works.
Yes, the tests it writes aren’t always perfect yet. But the system is already shockingly good at catching *its own* errors — something single-pass agents have always struggled with.
What’s Still Missing (and Why It Matters)
Right now the workflow starts like this:
“Here’s the task. Go do it.”
That’s powerful, but it’s also a bit black-box.

- Claude proposes a detailed plan.
- You edit, critique, or outright rewrite it.
- You add explicit constraints, success criteria, must-touch files, non-goals, and verification steps.
- Only then does it launch the swarm.
Right now you get step 4 without the transparent steps 1–3. The planning still happens internally, but you don’t get to see or steer it before the agents start burning tokens and touching files.
This is the one piece that feels like it’s still in “research preview” mode. Everything else already feels like a tool you can actually ship with.
Also read:
- Sundar Pichai on Google’s AI Future: Search Traffic, Content Quality, and the AGI Horizon
- The Internet Is Roasting Joi AI for Hiring “Masturbation Consultants” at $2K/month. They’re Laughing at the Wrong Thing.
- How to Use Google Maps and Street View to Investigate Apartment Safety Issues
- Gemini Powers a New Era of Contextual Advertising in Google Search
Bottom Line

It’s not perfect. The token cost is real (start with scoped tasks until you get a feel for it). The planning stage could be more collaborative. But the core loop — generate, validate, iterate, converge — is genuinely new and genuinely useful.
Directionally, this is one of the strongest moves we’ve seen from any frontier lab in 2026.
If Anthropic keeps iterating on the planning transparency and gives us better knobs to guide the swarm, Dynamic Workflows could become the default way serious engineering teams tackle large-scale refactors and migrations.
For now, it’s already the most exciting thing happening in Claude Code.
Go try it. The future of agentic coding just got a lot more real.