#policy | Lilith AI

Radar · 2026-06-16

Model welfare is moving from philosophy into product risk

Zvi Mowshowitz uses Fable and Mythos as a case study for why model welfare cannot be separated from capabilities, alignment and user experience. Even where the topic remains speculative, it is becoming a practical question of evaluations and safety interventions for frontier labs.

Read →

Radar · 2026-06-15

Claude Opus 4.8 sells judgment, not just another benchmark

Anthropic released Claude Opus 4.8 at the same standard price as Opus 4.7, with a focus on coding, agentic tasks and longer work. The more important shift is a model that is supposed to say more often when it is unsure.

Read →

Radar · 2026-06-15

Trump AI order creates a 30 day window for frontier models

The White House issued an executive order that calls for a classified benchmark for covered frontier models within 60 days and a voluntary framework for up to 30 days of pre-release government access. It says this is not licensing, but it creates a pressure point before launch.

Read →

Radar · 2026-06-09

Agent cost is no longer a footnote. It is an engineering expense

Simon Willison shows how he manually added pricing for Claude Fable 5 in AgentsView and immediately saw the cost of local coding agents by project. The small trick points to a bigger shift: AI coding is starting to look like infrastructure consumption, not an app subscription.

Read →

Radar · 2026-06-04

Zvi’s AI week shows why one grand narrative is not enough

Zvi Mowshowitz's AI #171 is not one clean trend, but a signal map: Claude Opus 4.8, US frontier model testing, OpenAI's policy blueprint and PAC politics.

Read →

Radar · 2026-06-01

Opus 4.8 shows that behavior tuning is not a checklist of fixes

Zvi Mowshowitz reads Opus 4.8 through model welfare and argues that attempts to fix honesty, sycophancy and preference shaping can create new problems elsewhere. For teams deploying models, the reminder is that alignment is not a checklist.

Read →

Radar · 2026-04-28

OpenAI layers ChatGPT safety from model to abuse detection, but the numbers are missing

OpenAI outlines its layered approach to ChatGPT community safety: model safeguards, abuse detection, policy enforcement, and collaboration with external safety experts.

Read →

Radar · 2025-10-29

OpenAI opens policy-based content classification with open-weight safeguard models

OpenAI released gpt-oss-safeguard-120b and 20b: open-weight reasoning models where content classification policy is not baked into the weights but supplied at runtime. Organizations bring their own rules; the model reasons over them.

Read →