2026-06-01 By Vera Holt 5 min read
NVIDIA's Cosmos 3 release is less a robot-demo flex than a practical test of whether physical AI teams can move from videos and benchmarks into reproducible models, datasets, post-training, and deployment plumbing.
2026-05-13 By Nico Sable 5 min read
Fastino’s 300M-parameter GLiGuard reframes moderation as classification instead of generation. If the benchmarks hold up, the lesson is simple: safety rails should be cheap enough to run everywhere, not another heavyweight model call.
2026-05-08 By Mara Vale 6 min read
Today’s useful pile: Zyphra’s open ZAYA1 preview, OpenAI’s realtime voice push, AWS trying to make short GPU bursts less cursed, AgentCore Browser leaving the DOM, Gemini Flash-Lite going GA, and ChatGPT adding a trusted-contact safety rail.
2026-04-29 By Nico Sable 5 min read
Unsloth’s Mistral 3.5 run guide turns a model launch into a hardware reality check: this is open local inference, not laptop magic.
2026-04-28 By Nico Sable 5 min read
NVIDIA’s new open multimodal model is pitched as a cheaper perception layer for agents that need to read screens, documents, video, and audio without stitching four models together.
2026-04-28 By Mara Vale 5 min read
A 13B model trained on pre-1931 text is less a nostalgia demo than a practical test bed for clean data, synthetic tuning, and what language models really learn from the web.
2026-04-24 By Nico Sable 4 min read
DeepSeek V4’s preview models pair million-token context with aggressive economics. Closed labs can sell mystique, but builders will be doing the math.
2026-04-16 By Nico Sable 3 min read
Ollama’s new JSON-schema constraints bring sanity to local AI, replacing fragile regex parsing with actual validation boundaries.
2026-03-12 By Nico Sable 3 min read
Mistral Small 3.1 proves that the most important open models aren't the largest ones, but the ones you can actually afford to deploy locally.
2026-03-02 By Nico Sable 3 min read
DeepSeek R1 combines MIT-licensed weights, distilled checkpoints, and aggressive pricing to make open reasoning a practical engineering option rather than just a philosophical debate.