A new foundation model — the brain.
Shifts what's possible. New ceiling on reasoning, coding, context.
latestClaude Fable 5Anthropic
What just shipped in AI that's actually worth knowing about.
A new foundation model — the brain.
Shifts what's possible. New ceiling on reasoning, coding, context.
latestClaude Fable 5Anthropic
A consumer app or feature.
How non-builders actually feel AI in their day.
watchingnone in feed yet — when one ships, it lands here
A developer API, SDK, or capability.
What you can build with this week — the new primitives.
latestChatGPT memory upgradeOpenAI
Hardware, platforms, the backbone.
Who runs what at what price — the rails everyone shares.
watchingnone in feed yet — when one ships, it lands here
Most launches in this feed come from the same handful of labs. Here's who they are, where they're based, and what they're actually known for — so the org chips below stop being acronym soup.
Set the pace for the modern wave — ChatGPT made conversational AI a household tool overnight.
GPT-5 · ChatGPT · Sora · Realtime API
Founded by ex-OpenAI safety researchers; Claude is the model coding tools quietly standardize on.
Claude Sonnet · Claude Opus · MCP · Skills
DeepMind + Brain merged into the Gemini team; the only lab with the chips, the data, and the products.
Gemini 3 · AI Studio · Vertex · Veo
Open-weights leader in the West — Llama set the floor every other open lab is measured against.
Llama · Movie Gen · Ray-Ban Display
Musk's lab. Massive Memphis training cluster, edgier brand voice, integrated tightly with X.
Grok · Colossus supercluster
Europe's flagship lab — punches above its weight on small, efficient, mostly open-weights models.
Mistral Large · Mixtral · Codestral
Sparked the 2025 cost-collapse — frontier-class reasoning at a fraction of the price, weights and all.
DeepSeek-R1 · V3 · open weights
Qwen team ships open-weights models at every size with ruthless cadence — the workhorse of Chinese AI.
Qwen 3 · Qwen-VL · Qwen-Coder
Sells the picks and shovels. Every other lab on this list trains on its chips — H100, B200, GB200, and counting.
Blackwell · CUDA · NeMo · Nemotron
Plenty of others ship too — Cursor, GitHub, Cloudflare, Midjourney, Hugging Face, Runway, ByteDance, MiniMax, Z.ai. They show up in the chips below as their launches land.
Most launches are real. Some are framing. These are the six tells that separate "actually ships" from "press release with a roadmap" — handy when a headline beats the feed below to your inbox.
watch forCharts that only show the benchmarks they win. Comparisons against last year's model, not this week's.
what it meansReal on a narrow slice — often weaker on the harder evals (GPQA, ARC-AGI, Humanity's Last Exam) that didn't make the slide.
watch for"Rolling out over the coming weeks." Waitlists. A blog post but no API page, no pricing, no model ID.
what it meansMarketing landed today; the product might land later. Treat the date as when it became real to journalists, not to you.
watch forHeadlines built on "10× cheaper" or "3× faster" with no matching jump on capability evals.
what it meansBig win if you're spending on inference. Not a new ceiling — same model territory at a better price.
watch forHighlight reels, narrator voiceover, "hand-picked examples," only logged-out demos.
what it meansStage version ≠ API version. The cherry-picked scene is the ceiling, not the median run you'll get.
watch for"Llama community license," commercial caps ("700M monthly users"), custom acceptable-use clauses.
what it meansFree to download. Not free to ship. Read the license before betting a product on a model.
watch forPerformance claims with no methodology. "Internal evals" footnotes. No system card on launch day.
what it meansIf they didn't publish the eval setup, the numbers are vibes. Wait a week for independent runs (Artificial Analysis, LMSYS).
None of these mean a launch is fake — just that the spin is doing work. The feed below tries to flag the spin in the summary when it spots it.
Anthropic — Anthropic's biggest jump in a while — Fable 5 is the first 'Mythos-class' model made safe for general use, a new tier above the Opus line with state-of-the-art coding, knowledge work, and vision (it plays Pokémon from pixels alone) plus long-context handling across millions of tokens; live today on claude.ai, Claude Code, the API ($10/M in, $50/M out — half the price of Mythos Preview), and Bedrock, and it's included free on Pro and Max plans through June 22, so you can kick the tires before the credits meter starts
read more →NVIDIA — NVIDIA dropped its Nemotron 3 family of fully-open models, and the headline for solo builders is the Nano: a 4B model with GGUF weights on Hugging Face that runs right on your laptop, no cloud bill; the lineup scales up to a 550B Ultra reasoning model with a 1M-token context that went live day-zero on OpenRouter, Together, and Perplexity, and NVIDIA shipped the training data and recipes too under a permissive open license, so you can actually fine-tune instead of just downloading
OpenAI — ChatGPT's memory grew up — a new 'dreaming' architecture quietly revises what it knows about you in the background as your life changes ('you're going to Singapore in July' becomes 'you went'), so recommendations stay current instead of stale; paid users get double the memory capacity with summaries you can review and edit, and a 5x compute cut means real memory is finally rolling out to Free users over the coming weeks — your assistant stops forgetting who you are
Microsoft — Microsoft's first homegrown coding model lands inside GitHub Copilot for individual users — turn on auto-pick or grab it from the model picker in VS Code, no setup needed; it beats Claude Haiku 4.5 on coding benchmarks while burning up to 60% fewer tokens, and dials reasoning up or down per request so simple edits stay snappy; one of seven new MAI models (plus a reasoning model and a voice cloner) Microsoft shipped at Build to cut its OpenAI bill and pass the savings to devs
Microsoft — Microsoft open-sourced a way to test whether your AI agent actually behaves — write your rules in plain English ('never give medical advice', 'always cite a source') and ASSERT turns them into scored test cases, generating tricky scenarios and grading the output; MIT-licensed and framework-agnostic, it plugs into LangChain, CrewAI, OpenAI's Agents SDK and 100+ models via LiteLLM, so a solo dev shipping an agent gets real evals instead of vibes
MiniMax — open-weight frontier coding model with a 1M-token context and native image/video input — it can even operate a desktop; on SWE-Bench Pro it edges past GPT-5.5 and Gemini 3.1 Pro and lands just behind Opus 4.7, while the API runs $0.60/M in and $2.40/M out (cents on the dollar next to the closed labs); weights and the technical report hit Hugging Face within days, so you can self-host the whole thing
TwelveLabs — video-understanding startup TwelveLabs takes its tech straight to creators — describe the shot you want in plain English and Rodeo searches your entire footage library, finds the clips, and assembles a rough cut for you; powered by their Marengo 3.0 + Pegasus 1.5 models, it turns 'I know the clip is in here somewhere' into an instant edit instead of an afternoon of scrubbing timelines
A hand-curated log of AI launches that actually move the field — new flagship models, agentic products, developer APIs, and infrastructure shifts. No press releases. No minor patches. No prerelease beta noise.