Insight / signal

AI memory is where agents become useful. It is also where they become dangerous.

Published01/06/2026

FormatInsight

A forgetful chatbot is annoying. An AI agent that remembers the wrong thing can become a recurring business problem.

The AI story this week is not another model.

It is memory.

That sounds less exciting. It is not. Memory is the difference between a chatbot you have to brief every morning and an agent that can keep working from yesterday’s notes, last week’s client decision, the current campaign plan, the messy CRM context and the things you told it not to do again.

That is where agents start becoming useful in a business.

It is also where they start becoming a proper risk.

A forgetful chatbot is annoying. You correct it, swear at it, paste the context again, and move on with your life. An agent with persistent memory is different. If it remembers the wrong thing, that wrong thing can keep coming back. It can shape future drafts. It can influence a recommendation. It can pollute a client brief. It can sit inside the system looking harmless until the agent uses it as fact.

That is not a theoretical problem anymore.

In the last couple of days, the agent world has been quietly circling the same issue from different angles.

The Hermes chatter is moving towards Obsidian-backed memory: use a vault as a cheap, persistent brain for the agent so it can keep goals, notes, tasks and business context across sessions. That is exactly the kind of thing I like. It is practical. It is cheap. It uses files you can inspect. It avoids stuffing everything into a bloated prompt and hoping the model reads the right bit.

At the same time, Hermes Tool Search is pointing at another part of the same problem. Instead of loading every tool upfront and wasting context, the agent searches for the right tool when it needs it. Less bloat. Better retrieval. More operational discipline.

Then OWASP turns up with Agent Memory Guard, which is much less fluffy. Their line is blunt: modern agents persist memory across sessions, and anything that writes into that memory becomes a privileged input. They are treating memory poisoning as its own threat surface, not just a variation of prompt injection.

That matters.

Prompt injection is usually framed as something a user or web page says to the model right now. Memory poisoning is nastier because it can survive. A bad instruction, leaked secret, false fact or attacker-planted note can get stored, retrieved later, and quietly treated as trusted context.

OWASP’s README gives a very human version of the problem: long-running agents can re-ingest their own previous output, elaborate it slightly, write it back, and then treat the new version as established fact on the next pass. After a few loops, the agent has created its own folklore.

Anyone who has worked inside a company knows this pattern. Bad assumptions become process. Nobody remembers where they came from. Six months later everyone is defending them because “that is how we do it”.

Now give that habit to software.

This is why businesses need to be much more serious about AI memory than they currently are.

Most people still talk about it like it is a convenience feature.

“Great, the agent remembers me.”

Fine. But what exactly does it remember? Who wrote it? Was it from a human, a tool, a scraped page, an email, a transcript, another agent, or the model’s own guess? Is it still true? Is it allowed to influence external actions? Can the agent overwrite it? Can anyone see the change history? Can you roll it back?

These are boring questions. They are also the questions that decide whether the system is useful or a liability.

The same pattern is showing up in bigger infrastructure. NVIDIA and Microsoft are now talking about Windows machines built for personal AI agents, with local large models, huge context and new security primitives. Strip out the launch language and the direction is clear: agents are moving closer to the machine, closer to the files, closer to the user’s working context.

That will make them more useful.

It will also make sloppy memory much more expensive.

OpenAI’s recent output points in the same direction from the enterprise side. Alongside Codex customer stories, they published a playbook for trustworthy third-party evaluations. The interesting bit is not the corporate gloss. It is that serious AI adoption is starting to need evidence, evaluation and governance at the same time as agents are getting better at doing actual work.

That is the useful tension.

Businesses want agents that remember. Regulators, security teams and sensible operators want to know why the agent believes what it believes.

Those are not opposing goals. They are the same product requirement.

If an AI agent is going to be part of a commercial workflow, memory needs to have receipts.

Not vibes. Receipts.

Source. Date. Owner. Confidence. Permissions. Expiry. Change history. Approval status. A way to quarantine suspect memories. A way to delete things that should not have been stored. A way to roll back after the system gets polluted.

For a marketing team, this is not abstract security theatre.

Imagine an agent that helps with client content. It remembers the client’s tone, products, proof points, compliance rules, offers and banned claims. Brilliant. That could save hours every week.

Now imagine it also remembers an unverified performance claim from a brainstorm. Or a one-off discount that expired. Or a private internal frustration from a meeting transcript. Or a competitor comparison nobody approved. If that memory gets reused in a proposal, landing page or LinkedIn post, the mistake no longer looks like an AI hallucination. It looks like your business being careless.

Same in sales. An agent that remembers buying signals, objection history and follow-up preferences is valuable. An agent that remembers a rumour as fact, stores sensitive data in the wrong place, or keeps using stale CRM context is dangerous in a very boring, very real way.

This is where the post-agency opportunity sits.

The lazy offer is “we build AI agents”.

The better offer is “we build supervised AI workflows with controlled memory”.

That means taking a repeated commercial process and designing the memory layer properly:

what the agent is allowed to know
where that knowledge comes from
what can be written automatically
what needs human approval
what expires
what gets logged
what gets reviewed
what can trigger external action
what can never leave the system

There is nothing glamorous about this. Good.

The glamour is where people get stupid.

The value is in the dull machinery underneath: clean vaults, labelled sources, approval gates, narrow permissions, retrieval rules, memory hygiene, audit trails, and people with enough judgement to say “do not automate that yet”.

My practical view is simple.

Do not give an agent permanent memory until you know how memory is created, trusted, updated and removed.

Start small. Give it one workflow. Give it a narrow vault or knowledge base. Label the sources. Keep human approval on anything external. Review what it writes back. Watch for drift. Delete bad memories quickly. Keep a change log. Test what happens when the memory is wrong.

That last bit matters. Everyone tests the happy path. Hardly anyone tests the poisoned path.

But business systems are built in the poisoned path.

Messy inputs. Half-true notes. Old decisions. Weird client exceptions. Someone pasting something stupid into a shared doc. A transcript that misheard a product name. A sales note that sounded certain but was actually a guess.

That is the real world these agents are about to inherit.

So yes, I want agents with memory. I want them reading the vault, remembering client preferences, carrying context across days, spotting open loops and reducing the amount of repeated briefing humans have to do.

But I do not want magical memory.

I want memory with receipts.

Because the next stage of AI adoption is not just about what agents can do. It is about what they are allowed to remember, what they are allowed to trust, and who is responsible when the memory is wrong.

That is where the serious work starts.

And, predictably, it is not the bit that gets the shiny demo.