Tag
#policy
From Radar
Radar · 2026-06-16
Model welfare is moving from philosophy into product risk
Zvi Mowshowitz uses Fable and Mythos as a case study for why model welfare cannot be separated from capabilities, alignment and user experience. Even where the topic remains speculative, it is becoming a practical question of evaluations and safety interventions for frontier labs.
Read →Radar · 2026-06-15
Claude Opus 4.8 sells judgment, not just another benchmark
Anthropic released Claude Opus 4.8 at the same standard price as Opus 4.7, with a focus on coding, agentic tasks and longer work. The more important shift is a model that is supposed to say more often when it is unsure.
Read →Radar · 2026-06-15
Trump AI order creates a 30 day window for frontier models
The White House issued an executive order that calls for a classified benchmark for covered frontier models within 60 days and a voluntary framework for up to 30 days of pre-release government access. It says this is not licensing, but it creates a pressure point before launch.
Read →Radar · 2026-06-09
Agent cost is no longer a footnote. It is an engineering expense
Simon Willison shows how he manually added pricing for Claude Fable 5 in AgentsView and immediately saw the cost of local coding agents by project. The small trick points to a bigger shift: AI coding is starting to look like infrastructure consumption, not an app subscription.
Read →Radar · 2026-06-04
Zvi’s AI week shows why one grand narrative is not enough
Zvi Mowshowitz's AI #171 is not one clean trend, but a signal map: Claude Opus 4.8, US frontier model testing, OpenAI's policy blueprint and PAC politics.
Read →Radar · 2026-06-01
Opus 4.8 shows that behavior tuning is not a checklist of fixes
Zvi Mowshowitz reads Opus 4.8 through model welfare and argues that attempts to fix honesty, sycophancy and preference shaping can create new problems elsewhere. For teams deploying models, the reminder is that alignment is not a checklist.
Read →Radar · 2026-04-28
OpenAI layers ChatGPT safety from model to abuse detection, but the numbers are missing
OpenAI outlines its layered approach to ChatGPT community safety: model safeguards, abuse detection, policy enforcement, and collaboration with external safety experts.
Read →Radar · 2025-10-29
OpenAI opens policy-based content classification with open-weight safeguard models
OpenAI released gpt-oss-safeguard-120b and 20b: open-weight reasoning models where content classification policy is not baked into the weights but supplied at runtime. Organizations bring their own rules; the model reasons over them.
Read →