Alibaba open-sources Qwen3.6-35B-A3B: 35B MoE, 3B active params, runs on a laptop, outdraws Claude Opus 4.7
·Alibaba / Qwen
Alibaba's Qwen team releases Qwen3.6-35B-A3B as fully open-source on HuggingFace (Apache license). The model uses a Mixture-of-Experts architecture with 35B total parameters but only 3B active per token — making it runnable on consumer hardware. Simon Willison's post 'Qwen3.6-35B-A3B on my laptop drew me a better pelican than Claude Opus 4.7' lands 404 HN pts and 84 comments, while the original release thread hits 100+ on r/LocalLLaMA. Pitched as 'agentic coding power, now open to all.'
A laptop-runnable 35B model that can compete with a frontier closed model on creative tasks is a significant efficiency milestone. Open-sourcing it under Apache license means the practitioner community can fine-tune, quantize, and deploy without API dependency. The simonw comparison directly benchmarks it against Anthropic's latest Opus, increasing signal that the capability gap between open and closed frontier models is shrinking.
Impact scorecard
7.28/10
Stakes
7.0
Novelty
8.0
Authority
7.0
Coverage
6.0
Concreteness
8.0
Social
7.0
FUD risk
2.0
Coverage9 outlets · 2 tier-1
HuggingFace, HN, Reddit/LocalLLaMA, simonwillison.net, Alibaba_Qwen on X
X / Twitter1,400 mentions @Alibaba_Qwen · 900 likes
Reddit1,100 upvotes r/LocalLLaMA
r/LocalLLaMA, r/MachineLearning
Trust check
high
Official HuggingFace release by Alibaba_Qwen. Apache license confirmed. Cross-verified by simonw benchmark and r/LocalLLaMA community testing.
OpenAI publishes 'Codex for almost everything', a major capability expansion for its Codex coding agent. The post details how Codex can now handle a far broader range of software engineering tasks end-to-end, including autonomous debugging and deployment steps. A companion demo 'Codex Hacked a Samsung TV' shows the agent autonomously reverse-engineering and exploiting a consumer device — drawing 100+ HN points. HN main thread: 874 pts, 449 comments on launch day.
A developer reports a €54,000 unexpected billing spike in just 13 hours after a Firebase browser key without API restrictions was used to make Gemini API requests — presumably by a malicious third party. The Google AI developer forum post goes viral with 386 HN pts and 281 comments. The incident exposes a critical gap in Google's abuse detection and billing caps for Gemini APIs: client-side Firebase keys often have no restrictions by default, and Gemini does not enforce spending caps out of the box.
Anthropic ships Claude Opus 4.7, its most capable Opus model yet. The release centres on long-running agentic tasks: more thinking tokens, an extended thinking mode, and increased API rate limits across all subscriber tiers to match. HN erupts with 1,752 points and 1,257 comments — the biggest AI model thread in weeks. @bcherny: 'Dogfooding Opus 4.7 the last few weeks, I've been feeling incredibly productive.' System card and model card published simultaneously.