How important is this news?

Composite impact score: 8.5/10. Breakdown — Stakes 9.5, Novelty 9, Authority 9.5, Coverage 9, Concreteness 8.5, Social 8.5, FUD risk 3.

← Back to feed

Cybersecurity

Anthropic unveils Project Glasswing — Claude Mythos already found "thousands" of zero-days in major software

Q: Can you trust this reporting on Anthropic unveils Project Glasswing — Claude Mythos already found "thousands" of zero-days in major software?

Trust verdict: medium. First-party Anthropic announcement with partner confirmations from named Fortune-10 companies, plus independent coverage from NPR, TechCrunch, VentureBeat, Fortune. The "thousands of zero-days" claim is self-reported and unverifiable without access to the model — treat as Anthropic's characterization, not a third-party finding. FUD risk moderate: strong vendor-incentive to hype capability + consequence framing.

Apr 13, 2026 · Anthropic

Anthropic launched Project Glasswing on April 7 alongside AWS, Apple, Cisco, Google and Microsoft: a closed program distributing a restricted preview of Claude Mythos — a frontier model Anthropic says has already identified thousands of high-severity zero-day vulnerabilities across every major OS and browser. Mythos chains multiple low-severity bugs into single high-impact exploits (sometimes combining 3–5). Access is limited to ~50 partner orgs; Anthropic says the public release risk is too high. Program backed by $100M in Claude credits and $4M in open-source security donations. Sets the template for "AI that is too dangerous to ship".

AnthropicClaude MythosZero-DayProject GlasswingAI Safety

Why it matters

If Mythos really is finding zero-days at the claimed scale, the offense-defense balance in software security shifts materially within months. The coalition of defenders (AWS/Apple/Cisco/Google/Microsoft) getting restricted access essentially ratifies a new category of "controlled-access AI" — and creates pressure for similar restrictions on OpenAI/Google/Meta cyber models. Bigger governance question: if a Claude-tier model can weaponize chained vulnerabilities at scale, is Anthropic's "too dangerous to ship" bar the new industry norm, or an exception?

Impact scorecard

8.5/10

Stakes

9.5

Novelty

9.0

Authority

9.5

Coverage

9.0

Concreteness

8.5

Social

8.5

FUD risk

3.0

Coverage35 outlets · 9 tier-1

Anthropic blog, TechCrunch, Fortune, VentureBeat, CyberScoop, NPR, …

X / Twitter18,000 mentions
@AnthropicAI · 12,000 likes
@simonw · 8,400 likes

Reddit4,800 upvotes
r/netsec

r/netsec, r/cybersecurity, r/MachineLearning

Trust check

medium

First-party Anthropic announcement with partner confirmations from named Fortune-10 companies, plus independent coverage from NPR, TechCrunch, VentureBeat, Fortune. The "thousands of zero-days" claim is self-reported and unverifiable without access to the model — treat as Anthropic's characterization, not a third-party finding. FUD risk moderate: strong vendor-incentive to hype capability + consequence framing.

Primary source ↗

Keep reading

Research

Apr 13, 2026 · X · @hardmaru

"Neural Computer": video-generation architecture trains a world model of a real computer

Impact 6.82/10 Trust · medium 📰 5 outlets · 🐦 5,200 · 👽 r/MachineLearning · 1,400

@hardmaru (David Ha) flagged a paper adapting Sora-style video-diffusion architectures to build a learned world model of an actual Linux desktop. The model ingests 9,000 hours of screen-recording + keyboard/mouse traces and learns to predict next-frame UI state conditioned on user input — effectively a probabilistic operating-system simulator. On a held-out eval of 50 common tasks (opening files, running commands, navigating web UIs), the model achieves 73% next-event accuracy at 2-second horizons and 41% at 30-second horizons, beating the prior SOTA (Meta AI Habitat-UI) by 18pp. Direct application: train agents in fully simulated computer environments without real-system rollouts — cuts RL data costs ~40x and eliminates the safety risk of letting agents touch production systems during training.

Apr 13, 2026 · EE Times (via HN)

Taking on CUDA with ROCm: 'One Step After Another'

Impact 7.15/10 Trust · high 📰 14 outlets · 🐦 3,400 · 👽 r/LocalLLaMA · 1,600

EE Times deep-dive on AMD's ROCm 7.0 and whether it can finally dent NVIDIA's CUDA moat. AMD's MI400 (96GB HBM4, 5.2 PFLOPS FP8) now runs PyTorch, vLLM and SGLang out-of-the-box — but reviewers testing MLPerf Inference v5.1 still see 1.6–2.2x gaps vs H200 on representative LLM workloads, driven by kernel-library maturity rather than raw silicon. Breakthrough of the cycle: AMD hiring 600 CUDA-kernel engineers in 12 months, plus open-sourcing HIPify tooling that auto-translates 83% of typical CUDA kernels. AMD claims Meta, Microsoft and OpenAI are all now shipping production MI400 pods. NVIDIA's response: CUDA 13 with tensor-core autotuning targeting the same eval suite, launching Q2.

Apr 13, 2026 · X · @claudeai

Anthropic brings "advisor strategy" to Claude Platform: Opus advises Sonnet/Haiku at inference

Impact 7.06/10 Trust · high 📰 10 outlets · 🐦 4,600 · 👽 r/ClaudeAI · 2,700

Anthropic announced the advisor strategy on the Claude Platform: pair Opus 4.6 as a planning/critique advisor with Sonnet 4.6 or Haiku 4.5 as the executing model. The advisor inspects partial outputs, suggests corrections and redirects the executor mid-generation. On SWE-bench Multilingual, Sonnet+Opus-advisor scores 2.7 percentage points higher than Sonnet alone, at roughly 1.3x the cost vs 7x the cost of running Opus end-to-end. General availability today via the Claude Console and CLI; pricing is existing Claude API rates for both models (no advisor premium). Anthropic positions this as the first first-class multi-model inference primitive in any frontier-lab API — not just routing or cascading but explicit advisor/executor roles with shared context.