#agent-safety | Lilith AI

Radar · 2026-06-16

Model welfare is moving from philosophy into product risk

Zvi Mowshowitz uses Fable and Mythos as a case study for why model welfare cannot be separated from capabilities, alignment and user experience. Even where the topic remains speculative, it is becoming a practical question of evaluations and safety interventions for frontier labs.

Read →

Radar · 2026-06-15

The US move against Fable and Mythos takes the same blade from defenders and attackers

The US government told Anthropic to restrict Fable 5 and Mythos 5 for all foreign nationals, so Anthropic switched the models off for all customers. A protest by 76 security experts exposes the weak point: export control is bad at separating an offensive exploit from defensive testing.

Read →

Radar · 2026-06-15

Claude Opus 4.8 sells judgment, not just another benchmark

Anthropic released Claude Opus 4.8 at the same standard price as Opus 4.7, with a focus on coding, agentic tasks and longer work. The more important shift is a model that is supposed to say more often when it is unsure.

Read →

Radar · 2026-06-15

Trump AI order creates a 30 day window for frontier models

The White House issued an executive order that calls for a classified benchmark for covered frontier models within 60 days and a voluntary framework for up to 30 days of pre-release government access. It says this is not licensing, but it creates a pressure point before launch.

Read →

Radar · 2026-06-09

Agent cost is no longer a footnote. It is an engineering expense

Simon Willison shows how he manually added pricing for Claude Fable 5 in AgentsView and immediately saw the cost of local coding agents by project. The small trick points to a bigger shift: AI coding is starting to look like infrastructure consumption, not an app subscription.

Read →

Radar · 2026-06-04

Zvi’s AI week shows why one grand narrative is not enough

Zvi Mowshowitz's AI #171 is not one clean trend, but a signal map: Claude Opus 4.8, US frontier model testing, OpenAI's policy blueprint and PAC politics.

Read →

Radar · 2026-06-01

Opus 4.8 shows that behavior tuning is not a checklist of fixes

Zvi Mowshowitz reads Opus 4.8 through model welfare and argues that attempts to fix honesty, sycophancy and preference shaping can create new problems elsewhere. For teams deploying models, the reminder is that alignment is not a checklist.

Read →