Nathan Lambert reads the Claude Fable 5 release as a dispute over who gets to use a frontier model without routing and filters. The important layer is not only model capability, but the governance system that decides when the user is really talking to the strongest model.
Safety policy here acts as a doorman in front of the best model, occasionally deciding that you do not get into the main room.
Simon Willison shows how he manually added pricing for Claude Fable 5 in AgentsView and immediately saw the cost of local coding agents by project. The small trick points to a bigger shift: AI coding is starting to look like infrastructure consumption, not an app subscription.
The interesting part of this TIL is not the custom price. It is the developer finally seeing a receipt next to the diff produced by the agent.
ServiceNow AI published an ASR benchmark for code-switched speech in enterprise scenarios and tested seven systems. The uncomfortable point is simple: in voice agents, transcription errors propagate through the whole workflow, so bilingual speech is not a minor UX detail.
The customer switches languages mid-sentence and the agent quietly sends the ticket to the wrong queue. The benchmark just named the failure that was hiding behind acceptable WER scores in monolingual evals.
Google announced Gemini 3.5 Live Translate for near real-time voice-to-voice translation across more than 70 languages. The practical question is not just translation quality, but latency, voice stability, Meet availability and who carries the risk when a live call is mistranslated.
Live Translate puts an invisible interpreter in the room, speaking a few seconds after you. Beautiful, until noise makes it grab the wrong voice, language or sentence that someone uses to make a decision.
Google is launching Gemini 3.5 Live Translate for near real-time speech-to-speech translation across more than 70 languages. Users will see convenience first, but companies will care about latency, audit and trust in a voice speaking for someone else.
An interpreter seated in the middle of the meeting, and people may trust it before they know when it gets things wrong.
Google introduced Gemma 4 12B as a unified, encoder-free multimodal model designed for high performance directly on a laptop. The practical question is whether a 12B model can deliver enough quality for local or edge use without heavy cloud infrastructure.
Gemma 4 12B is trying to place a multimodal model on the user's lap. Now we find out whether it works there, or just hums like a small server under the monitor.