2026-05-08 By Mara Vale 6 min read
Today’s useful pile: Zyphra’s open ZAYA1 preview, OpenAI’s realtime voice push, AWS trying to make short GPU bursts less cursed, AgentCore Browser leaving the DOM, Gemini Flash-Lite going GA, and ChatGPT adding a trusted-contact safety rail.
2026-05-05 By Mara Vale 5 min read
OpenAI is replacing GPT-5.3 Instant with GPT-5.5 Instant as ChatGPT’s default. The useful story is not just fewer hallucination claims — it is whether memory, personalization, and model retirement become safer defaults.
2026-04-25 By Mara Vale 3 min read
Romain Huet confirmed that OpenAI's dedicated Codex line is dead. The main model and the coding model are now the same system, changing how builders should evaluate GPT-5.5.
2026-04-25 By Mara Vale 4 min read
OpenAI pushed GPT-5.5 to Chat Completions and Responses with a 1M context window, while putting GPT-5.5-pro behind Responses. The real product is fewer retries — and a nudge off legacy chat endpoints.
2026-04-25 By Mara Vale 3 min read
OpenAI released detailed guidance on prompting GPT-5.5, and the primary lesson is demolition. Treat it as a new model family, delete your bloated prompt preambles, and keep your tool users updated while the model thinks.
2026-04-25 By Mara Vale 4 min read
The new prompt guidance for GPT-5.5 is an exercise in demolition. The advice isn't to add new magic words; it's to clear out legacy prompt debt and define the destination rather than the path.
2026-04-25 By Mara Vale 3 min read
API access means teams can stop admiring GPT-5.5 from the showroom and start deciding where it actually deserves production budget.
2026-04-25 By Owen Pike 4 min read
The latest release of the llm CLI adds GPT-5.5 support plus useful knobs for verbosity and image detail. It isn't flashy, but repeatable terminal tools are how you avoid vibe-based evaluations.
2026-04-25 By Mara Vale 3 min read
OpenAI’s workspace agents sound autonomous, but the useful test is much duller: can they take a real workflow, preserve context, and return an artifact that is actually reviewable?
2026-04-24 By Mara Vale 3 min read
OpenAI pitches its new model as better at complex coding and data analysis. The real test is whether it can navigate messy workflows without requiring constant human cleanup.
2026-04-24 By Owen Pike 4 min read
GPT-5.5’s early path through Codex and ChatGPT says OpenAI wants the new model tested inside controlled workflows first. Builders should evaluate the access path as much as the model itself.
2026-04-24 By Owen Pike 3 min read
OpenAI is pushing Codex through massive consulting firms like Accenture and PwC. It’s an admission that enterprise software needs governance, training, and a lot of meetings to survive.
2026-04-23 By Mara Vale 2 min read
The new image model is definitely stronger, but the real lesson is that AI generation only works when teams apply constraints, budgets, and a review process.
2026-04-23 By Mara Vale 2 min read
OpenAI’s workspace agents aren't just about doing more chores. They are a deliberate march into the enterprise control layer, where permissions and approvals rule the world.
2026-04-23 By Mara Vale 2 min read
OpenAI is pitching GPT-5.5 as a smarter model, but the practical upgrade is supposed to be less hand-holding. If we don't have to hover over it while it works, that's an actual feature.
2026-04-23 By Claire Holloway 3 min read
OpenAI’s Privacy Filter sends a clear cultural message: useful AI needs boundaries that are visible enough for users to actually trust it with their real work.
2026-04-23 By Mara Vale 2 min read
OpenAI is wrapping agent language around the most boring parts of enterprise life—shared chores, routing, and approvals. It's not glamorous, but it is unfortunately essential.
2026-04-23 By Owen Pike 3 min read
OpenAI's new open-weight Privacy Filter isn't a flashy demo. It's the upstream scrubber you need before your logs and evals start spraying personally identifiable information everywhere.
2026-04-17 By Eli Mercer 3 min read
OpenAI’s new agent observability tools sound like developer jargon, but they represent the difference between useful delegation and finding out your bot rearranged the CRM while you were asleep.
2026-04-16 By Mara Vale 3 min read
With native sandboxes, filesystem tools, and workspace manifests, OpenAI is admitting that agents need unglamorous harnesses to keep them from becoming clever incident generators.
2026-04-10 By Eli Mercer 3 min read
OpenAI’s deep research tool lets you restrict sources and interrupt runs. The real lesson isn't that AI can summarize the web, but that research is useless if you can't defend the citations later.
2026-04-03 By Mara Vale 3 min read
Codex-only seats for Business and Enterprise teams are a pricing move designed to make coding-agent pilots easier to start, measure, and quietly expand without terrifying the finance department.
2026-03-25 By Mara Vale 3 min read
OpenAI is expanding ChatGPT's commerce capabilities with visual browsing and comparisons. The real battle isn't about owning the checkout button; it's about influencing the shopper before the cart even appears.
2026-03-18 By Mara Vale 3 min read
OpenAI’s GPT-5.4 mini and nano models are the unglamorous, cost-controlling workhorses that make complex agent systems economically viable.
2026-03-10 By Owen Pike 3 min read
OpenAI's new Responses API and built-in tools want to be your entire agent stack. The convenience is undeniable, but it comes at the steep cost of vendor lock-in.
2026-03-06 By Mara Vale 3 min read
Putting ChatGPT inside Excel isn't about magical insights. It's about automating the miserable middle of finance work: tracing formulas, building scenarios, and untangling inherited models.
2026-03-01 By Owen Pike 3 min read
OpenAI is moving on from SWE-bench Verified because the benchmark has degraded. It’s a harsh reminder that public leaderboards cannot replace private evaluations based on your actual codebase.