Lilith Lilith.
CS EN PL
Start
2026-06-28
14:49 · source ↗

GPT-5.6 puts model speed and government gating in the same release

OpenAI introduced GPT-5.6 as a three-model family, Sol, Terra and Luna, but is starting with a limited preview coordinated with the U.S. government. The important part for teams is that the system card pairs higher cyber and bio capability with a heavier safety stack.

GPT-5.6 looks like a model with a faster engine and an escort at the gate. The real test is not whether it glides through benchmarks, but whether someone can take the wheel when it mistakes helpfulness for permission.

14:12 · source ↗

ChatGPT logs reached the courtroom, but they did not convince the jury

Prosecutors in the Palisades fire case used ChatGPT logs as evidence, but the trial ended in a mistrial after a deadlocked jury. For companies and users, the signal is blunt: AI conversations can become forensic records.

The chatbot is no longer only sitting beside the user at the keyboard. Sometimes it ends up on the witness stand and reads out what someone typed when they thought they were only talking to a machine.

2026-06-27
16:45 · source ↗

OpenAI is hiring from Vision Pro because hardware is no longer a side quest

Paul Meade, the Apple vice president tied to Vision Pro and smart glasses, is reportedly leaving for OpenAI's hardware team. The signal is bigger than one executive move: OpenAI is assembling a consumer device team around people who know how prototypes become products.

OpenAI is not just taking a name for the press release. It is taking someone who knows where a glossy prototype gets caught on a nose, a pocket, a battery and a customer who stops forgiving after ten minutes.

2026-06-26
18:30 · source ↗

Gemini Nano gets faster on Pixel without changing the model itself

Google has rolled out frozen Multi-Token Prediction for Gemini Nano v3 on Pixel 9 and Pixel 10 phones. The practical point is simple: faster local AI without replacing the base model or adding a separate drafter into memory.

The interesting part here is not speed by itself. It is a quiet service elevator inside the phone: the model stays the same, but the user feels the doors open sooner.

16:24 · source ↗

Frontier models have hit the state permission layer

US officials are reportedly intervening in releases of Anthropic Mythos and OpenAI GPT-5.6. The story is becoming less about lab rivalry and more about whether frontier AI can survive pre-release control without a real process.

Frontier AI is no longer standing before one gatekeeper. It is walking through a hallway of unmarked doors, and whoever labels those doors will set the market’s pace.

2026-06-25
23:34 · source ↗

GPT-5.6 is heading first to government approved partners

OpenAI is reportedly preparing to release GPT-5.6 first to selected partners, with the US government approving access customer by customer. The precedent matters more than a delay of a few weeks.

Frontier models are starting to meet a doorman who checks not an employee badge, but the political risk of the whole building. Anyone building on the newest model needs a plan for the day the answer at the door is: not today.

18:01 · source ↗

Anthropic describes 28.8 million Claude exchanges as an attack

Anthropic reportedly claims operators linked to Alibaba and Qwen used nearly 25,000 fraudulent accounts to generate 28.8 million Claude exchanges. The case shows that model capabilities can be extracted through API access, not only through stolen weights.

Model theft no longer has to look like a hooded hacker beside a server. It can look like a filing cabinet full of fake accounts, feeding a rival student with answers from the best teacher in class.

17:38 · source ↗

Claude is finding paying users where ChatGPT still owns the crowd

Claude is up about 75% among paying US consumers since January 2026, according to transaction data cited by TechCrunch. For Anthropic, the signal matters because the brand is no longer only a developer and enterprise story.

Claude is not knocking ChatGPT off the billboard yet. But it now has its own checkout line, and the people in it are not merely sampling a free chatbot.

00:08 · source ↗

The Netherlands is shielding ASML from Washington’s next chip-war move

The Dutch trade minister went to Washington to push back on the MATCH Act, which would extend restrictions to ASML’s DUV tools for China. The dispute shows that the chip war now extends from the US-China axis to a fight over how much of Washington’s cost allies must carry.

Washington is holding the geopolitical map, but ASML stands on a European factory floor. When the pin lands on China, the invoice may still arrive in Eindhoven.

2026-06-24
17:25 · source ↗

AI PACs spent $27 million in New York and got a draw

The money fight around Alex Bores ended without a clean winner: Bores narrowly lost the NY-12 primary, but attacks from a pro-AI PAC made him a visible symbol of regulation. For AI companies, it is a warning that political influence through super PACs can backfire.

The AI industry tried to press a warning button for regulators in NY-12. Instead it lit up a billboard over its own wallet, and every future candidate now knows where to point.

16:51 · source ↗

Google shows reasoning can pull out plain facts too

Google Research examines why chain-of-thought helps LLMs answer simple factual questions. The study on Gemini 2.5 and Qwen3-32B points to two mechanisms: extra computation in generated tokens and factual priming.

For factual recall, reasoning is more flashlight than diary: it can illuminate the model's memory, but if the beam hits the wrong shelf, the user gets a confident label on an empty slot.

16:15 · source ↗

Figma is pulling code, motion and shaders onto one canvas

At Config 2026, Figma put Motion, upcoming code layers and shader tools closer to the core design canvas. Product teams get a more powerful workspace, but also a new place for handoff problems to hide.

Figma wants design and code to stop sending postcards from opposite shores. The real test starts when someone carries the beautiful prototype into a pull request and CI lights up red.

14:36 · source ↗

Jalapeño moves OpenAI from models into its own silicon

OpenAI and Broadcom unveiled Jalapeño, OpenAI's first custom inference chip for running LLMs. For ChatGPT, this is less flashy than a new model, but potentially more important for the unit economics.

Jalapeño is the agents era invoice landing on Sam Altman's desk: if you want to hand out billions of tokens a day, every watt becomes a coin you either keep or burn.

14:00 · source ↗

Talos turns stored genomic data into a recurring shot at diagnosis

Microsoft Research and partners described Talos, an open-source tool for automated genomic reanalysis in rare disease. In a prospective cohort of almost 5,000 patients, it added diagnoses in 5.1 % of cases.

Talos is the quiet night guard in the genome archive: it does not open every door, but when new evidence lights one up, it sends the human reviewer to the right handle.

00:00 · source ↗

Scaling laws are a budget map, not a crystal ball for AI

Lilian Weng revisits scaling laws and shows why they are most useful as a tool for allocating compute, data and parameters. The practical lesson is sober: extrapolation helps only while you remember how small the underlying experiments were.

Scaling laws are a ruler placed on a map, not a navigator that drives to the destination. Spend millions by the ruler without checking the terrain, and you can draw a beautiful straight line into a swamp.

2026-06-23
17:00 · source ↗

Claude Tag turns Slack into company memory for an AI teammate

Anthropic is launching Claude Tag, a beta Slack service for Claude Enterprise and Claude Team customers. The AI is meant to follow channels, retain context and act as a shared team identity, not just answer one-off prompts.

Claude Tag is the new coworker at the Slack table who never leaves for lunch. Without a precise badge and locked doors, it will remember more than anyone in the office should.

2026-06-22
12:45 · source ↗

GLM-5.2 pushes open weights into million-token agent work

Z.ai is positioning GLM-5.2 as an open-weight model for long-running coding agents with a 1M-token context window. The useful question for teams is when an open model is good enough to replace Opus, not whether it wins every chart.

GLM-5.2 is a test of whether companies would rather hand the repository to a stranger at the API gate or hire their own guard at the server rack. The first bad file edit will matter more than the leaderboard.

2026-06-19
16:08 · source ↗

The Fable 5 ban shows a model can vanish faster than an API contract

The US government reportedly forced Anthropic to pull Fable 5 and Mythos 5 over national security concerns. For teams building on Claude, the real lesson is that frontier model availability has become a regulatory dependency.

Fable 5 is facing a security guard with a stamp, not a benchmark leaderboard. Anyone building an agent product on one model needs a plan for the day the door closes from the inside.

2026-06-18
19:59 · source ↗

OpenAI adds a model architect and a Washington operator before its IPO test

OpenAI is bringing in Noam Shazeer from Google DeepMind and Dean Ball to lead Strategic Futures. Ahead of a possible IPO, the company is strengthening both model architecture and its interface with U.S. AI policy.

OpenAI is building an antechamber for investors and officials before the IPO. Shazeer gets the engine room, Ball gets the doorway where the rules of the game are handed out.

15:20 · source ↗

General Intuition is raising $300 million on the bet that game videos can teach agents space

TechCrunch reports that General Intuition is in talks to raise around $300 million at a valuation above $2 billion. The more interesting story is the data thesis: Medal’s 2 billion gameplay videos per year are meant to teach agents space, time and action better than ordinary web video.

General Intuition is selling investors an arena where an agent can learn to fall before it is let outside. The question is whether opening the door reveals an athlete, or just a character that can run one map well.