Lilith Lilith.
CS EN PL
Start

From Radar

Radar · 2026-06-15

OpenAI wants one rulebook before states write fifty of them

OpenAI published a public policy agenda for AI covering frontier safety, youth protection, education, workforce transition and infrastructure. The real story is not just lobbying. It is an attempt to keep AI rules legible before fragmented regulation turns deployment into paperwork archaeology.

Read

Radar · 2026-06-09

Claude Fable 5 turns safety into a question of access to the best model

Nathan Lambert reads the Claude Fable 5 release as a dispute over who gets to use a frontier model without routing and filters. The important layer is not only model capability, but the governance system that decides when the user is really talking to the strongest model.

Read

Radar · 2026-06-08

OpenAI is packaging AGI as public infrastructure

OpenAI published a plan built around an automated AI researcher, faster economic growth and “personal AGI” for everyone. The important shift is not the promise itself, but the tone: OpenAI is talking less like a product leader and more like a future steward of public infrastructure.

Read

Radar · 2026-05-29

Zvi reads the Claude Opus 4.8 system card as an audit of shifting risk

Zvi Mowshowitz analyzes Claude Opus 4.8 as an incremental upgrade with better capabilities, safety and new questions around evals.

Read

Radar · 2026-05-11

SocialReasoning-Bench: the agent completes the task but fails to improve the user's position

Microsoft Research describes SocialReasoning-Bench, a benchmark testing whether AI agents genuinely act in the user's best interest. Key finding: agents complete tasks technically, but do not consistently improve outcomes for the person, even when explicitly instructed to.

Read

From the Glossary