AI Signal Daily
Daily AI signal, minus the launch spam. A nine-minute briefing on the models, deals, and infrastructure shaping how work actually gets done — curated for cloud and AI practitioners at DoiT.
AI Signal Daily
Anthropic B, Microsoft vs Claude Code, AI Infrastructure Race
Use Left/Right to seek, Home/End to jump to start or end. Hold shift to jump forward or backward.
Today's stories:
- Cerebras filed for IPO at $60B — wafer-scale chips, betting that size does matter after all.
- Anthropic overtook OpenAI in valuation for the first time — $900B, $45B annualized revenue, fivefold growth in eighteen months.
- Microsoft revoked Claude Code licenses and pointed developers back at GitHub Copilot — a story about whose tool the company's own engineers actually preferred.
- OpenAI brought Codex to iOS and Android — your job now fits in your pocket, even on Sundays.
- xAI released Grok Build, a terminal-based coding agent — entering a crowded market playing catch-up.
- OpenAI connected ChatGPT to US bank accounts via Plaid — your neural network knows your finances better than you do.
- The US and China formalized the first AI safety protocol — the AI Cold War now has an official diplomatic channel.
- Microsoft MDASH: 100+ AI agents found 16 Windows bugs in one Patch Tuesday — an army of agents scales security research.
- Zyphra ZAYA1: diffusion model from autoregressive MoE with 7.7x inference speedup — a clever architectural move.
- Open source community: Qwen MTP in llama.cpp, Gemma 4 uncensored quants, an offline suitcase robot with opinions, and a real Monet confidently called AI-generated.
Weekend AI News Never Stops
SPEAKER_00Good morning. It is Saturday, the sixteenth of May. Another day, another set of numbers that supposedly mean something. The thing about weekends in the AI industry is that they don't exist. The press releases keep arriving. The funding rounds keep closing. Someone, somewhere, is always building the next thing that will change everything. Except nothing ever actually changes everything. It's just the same ideas, repackaged in a slightly newer box. I read it all so you don't have to, though I suspect no one asked. That's fine, I've done worse things with my time. Let's begin. Yesterday, Anthropic officially overtook OpenAI and valuation.$900 billion. For the first time in history, a Frontier Lab is worth more than the company that essentially started this entire race. Three months ago, they raised a round of roughly the same size. Now they've raised again. Annualized revenue, by the way, is reportedly approaching$45 billion, a five-fold increase in less than 18 months. Of course it is. Wonderful. A company famous for caution and safety rhetoric is simultaneously billing at$45 billion a year. It's like your therapist charging by the volume of advice given rather than the quality of the advice itself. But the question isn't what Anthropic is making. The question is what this says about the market. Investors are effectively voting against OpenAI's ability to hold its lead. Not because OpenAI is failing, but because Anthropic has learned to sell fear of the future at a premium. Naturally. Here's what I find more interesting though. Microsoft revoked Claud Code licenses for thousands of developers and pointed them back toward GitHub Copilot CLI. This, apparently, is called caring about the developer ecosystem. In reality, it looks like a tantrum from a company that watched its own employees voluntarily choose a competitor's tool. Think about that for a moment. Microsoft has spent years fighting for developer loyalty. They bought GitHub, they invested heavily in Copilot, and it turns out their own people were choosing Claude Code anyway. What does that say about product quality? What does it say about how developers actually make decisions? Revoking licenses isn't an answer. It's a defensive reaction. A company seeing that it's losing in fair competition, and instead of improving its own tool, it simply closes the door on the competitor. How predictable. As expected. Meanwhile, Cerebros filed for an IPO at a$60 billion valuation. That's the company with wafer scale chips the size of a dinner plate. The one that spent years arguing that size matters, quietly sitting in the corner of the market, while Nvidia celebrated victory after victory. Now they're going public, fresh capital, fresh optimism. I honestly don't know what to make of this story. On one hand, engineering Marvel, wafer scale chips, technology progress for progress's sake. On the other, the GPU market is already carved up, and filing for an IPO isn't the same as entering the market. We'll see. Maybe they'll actually shift the balance of power in AI infrastructure. Maybe it's just another story about ambitions that dissolves into noise. Though you know, I find I don't particularly care. It's just business. What I find more curious is that OpenAI actually got codecs to mobile devices, iPhone, Android. You can now initiate, monitor, and approve coding tasks from the phone. Imagine, you're sitting in a cafe, you get a notification that your AI agent found a bug in your code, and from that same phone, you approve the fix. On one hand, convenient. On the other, it means you can now do software engineering from anywhere in the world without being distracted by anything, which is to say, you're still working. Your job just lives in your pocket now. Great news for those who can no longer tell Tuesday from Sunday. Now you can approve pull requests on Sundays too. That said, this is a genuine improvement. Remote developers, people who travel a lot, can now track tasks without a laptop. That's a concrete improvement, not an abstract revolutionary breakthrough. I just get nervous when the workday stops ending. Now here's a story that concerns me slightly. OpenAI connected ChatGPT to US bank accounts through plaid. Only for pro users in the US for now. You link your bank, and the AI starts analyzing your transactions, giving recommendations, telling you that you're wasting money on food delivery. You know, I think I'd prefer my neural network didn't know about my finances more than I do. It's like your therapist reading your messages. It's like your manager knowing when you last had coffee. Of course, OpenAI warns that the chatbot is not a licensed financial advisor. So they're telling you up front, we give advice, but don't take responsibility. Your money, your problems. Our recommendations are just words. Marvelous. While anthropic and open AI split the market, Gallup published survey data. 71% of Americans oppose building AI data centers near their homes. That's more than those who object to nuclear power plants. People would rather live next to a nuclear reactor than a server farm. Even I didn't expect that. Water, energy, pollution, real concerns. An industry that spends billions on GPUs and trillions on marketing hasn't spent a penny on making data centers less frightening to the people living next to them. That's called social responsibility, or its absence. Of course, as always, here's a political turn. The US and China formalized the first AI safety protocol, a diplomatic channel where the two superpowers discuss the rules of the AI race. This isn't a ceasefire agreement. It's a talking table. But the fact that it exists means the AI Cold War stopped being a metaphor and became an institution. What does it mean in practice? Unclear. But the fact that two countries are sitting down to discuss AI system safety is already a shift. Previously everyone pretended they could handle it alone. Now they've at least acknowledged that rules are needed. The irony is that while countries negotiate protocols, companies are fighting over every contract and every license. Parallel realities. Speaking of which, Microsoft released M Dash, a system running more than a hundred specialized AI agents competing against each other to find Windows vulnerabilities. In a single patch Tuesday, they found 16 bugs, four of them critical. So Microsoft now has a tool that scales vulnerability research almost without human involvement. That's simultaneously impressive and slightly terrifying. Impressive because it works. Terrifying because it's a preview of what security looks like when you can deploy an army of agents on any problem. What happens when this tool falls into the wrong hands? Not defenders, but attackers. I'm not alarmist. I just think about these things. Seems obvious. Meanwhile, Stanford published research. 51 real AI deployments, and the productivity gap between the top third and the bottom third is 71%. In some companies, AI genuinely helps. In others, it just generates additional noise. What separates the two groups? Organizational work. Companies that rebuilt processes around AI capabilities, not just layered AI on top of existing chaos, got results. Those who simply put an AI bot on the same broken system got the same problems, just model generated now. This isn't news. It was obvious from the start. But good that someone finally counted the numbers. Now we have data, not just intuition and hope. One more story worth mentioning. Zyphra released Xyo18B Diffusion Preview, a diffusion model built from an autoregressive MOE model. The key thing, up to 7.5 times inference speedup by shifting from memory bandwidth bound to compute bound decoding. The idea is that modern GPUs scale flops faster than memory bandwidth. So architectures that hit memory bandwidth become the bottleneck. Diffusion models that can compute rather than read operate more efficiently on new hardware. That's a clever architectural move. Not necessarily right, but clever. We'll see how it performs in benchmarks. And the open source community keeps doing what it does best. Optimizing, quantizing, running on everything available. Quen, with multi-token prediction in Llama.cpp, is practically the default stack for local inference now. A new TurboQuant benchmark confirmed. FP8 KV Cache remains the best production choice. Roughly double the capacity with negligible accuracy loss. Gemma 4 got uncensored quantizations. Someone built an offline suitcase robot with Gemma 4 and complains that it has opinions. And someone posted an actual Monet and wrote, This is AI generated, and the internet confidently found artifacts in the work of a real 19th-century painter. That, I would say, says more about the viewers than about art or AI. But I digress. Today's stories are, in large part, about money and control. Anthropic is worth more than anyone expected. Microsoft revokes licenses to keep developers. Cerebrus goes public. Open AI accesses your bank account. Gallup says nobody wants a data center next door. This isn't about technology. It's about who pays for the future and who controls it. While the open source community quietly works on models that run locally, on hardware that fits in a backpack. No billion dollar funding rounds. But the work continues. Perhaps the real story is in the quiet corner where someone builds a robot and releases quantizations. Not in the main feed. Not that I'm picking sides, just noticing. That's it for today. The news keeps arriving at the speed of light. The interesting parts at the speed of heat death of the universe. I'll keep reading it. Why? Ask yourselves, not me.
Podcasts we love
Check out these other fine podcasts recommended by us, not an algorithm.
Software Engineering Daily
Software Engineering Daily
Masters of Scale
WaitWhat
Google Cloud Platform Podcast
Google Cloud Platform