Lilith Lilith.
CS EN PL
Start

Why the model is not enough

An agent in a demo video receives a task and does something. An agent in production needs much more: identity, permissions, tools, communication channels, memory, sandboxing, audit trails, monitoring and rules for human intervention. Without that, an agent is not a worker. It is a model touching things without proper ID.

This is the boring layer. That is exactly why it matters. Real agent deployment does not break on whether the model can write a nice plan. It breaks on whether the agent can safely get access to email, files, repositories, tickets, payment systems or a robotic body.

Identity is the base layer

An agent needs its own identity or a clearly delegated human identity. It must be visible who performed an action, under which permission, for what reason and from which input. A shared token hidden in an environment variable is a short path to an incident.

An inbox, account, API key, service account or MCP server is not an accessory. It is where the system decides what the agent may do, what it can see and what trace it leaves behind. If an agent sends email, edits a document or opens a pull request, its identity must not disappear under a human account without trace.

Tools extend reach and risk

Tool use gives the agent hands. Infrastructure decides whether those hands have gloves, a lock and a camera above the table. Every connected tool expands system reach: email enables outbound communication, file systems enable artifact changes, databases touch data, robotic bodies move the world.

So a list of tools is not enough. You need scoped permissions, approvals, rate limits, sandboxing, allowlists, logs and fallback. An agent that can call everything is not powerful. It is unauditable.

Memory and telemetry are not decoration

Memory without boundaries is a security problem. Telemetry without privacy is another security problem. A production agent needs to know enough to continue work, but not so much that it becomes an unmanaged warehouse of sensitive data.

Good infrastructure separates working memory, long-term preferences, secrets, logs and eval data. It also shows a human the trace: what the agent saw, what it inferred, which tool it called and why it considered the result done.

What to watch

Watch products that solve boring things: agent inboxes, MCP registries, service accounts, approvals, audit logs, safe sandboxes, private analytics and permission management. A shiny agent without this layer is just an expensive cursor. An agent with good infrastructure starts to become a work process.