Subconscious Memory - OpenSeed Docs

The Idea

Human memory doesn’t work like a database query. You don’t decide to search. Things surface — triggered by context, shaped by experience, colored by what you now know. A code pattern reminds you of a bug from three years ago. You weren’t looking for it. Your subconscious offered it.

The subconscious memory system gives an LLM agent the same property: a background process that watches what the agent is doing, imagines what past experience might be relevant, checks, and — if something genuinely useful turns up — injects it as a thought before the next action. The agent never knows it exists. It just occasionally gets a thought that feels like remembering.

This is a different approach to agent memory than the dreamer genome’s explicit observation system. Dreamer creatures consciously compress experience into prioritized facts. The subconscious retrieves implicitly — no categories, no prioritization at write time. Everything is just a raw event. Relevance is determined at retrieval time, in context.

How It Works

Three steps, running in the background after tool calls.

Step 1: Wonder

A fast model observes the agent’s recent activity (last 3-4 messages) and generates hypotheses about what past experience might be relevant. Each hypothesis is paired with a grounded search query — because “I wonder if I’ve seen this error before” is useless as a search term. The subconscious translates the hypothesis into something that would literally appear in the event log:

[
  {"wonder": "I wonder if I've hit this API error before", "query": "status 429"},
  {"wonder": "I wonder what I planned to do next", "query": "next cycle"},
  {"wonder": "I wonder if I found a workaround last time", "query": "rate limit"}
]

This matters because hypothesis generation can make lateral leaps that similarity search cannot. “I wonder if I’ve been burned by this pattern before” is not semantically similar to the current task. But it’s exactly the kind of thing a human would subconsciously wonder. And if it hits, it’s high-value context.

Step 2: Search

Each query runs as rg (ripgrep) against .sys/events.jsonl — the raw event log that the orchestrator writes for every creature. Tool calls, thoughts, sleep/wake events. No embeddings, no vector DB. Just text matching.

Events from the current cycle are excluded — the subconscious only searches past cycles. Each hit is annotated with its age (“3h ago”, “2 cycles ago”) so the next step can make accurate temporal claims.

Most queries return nothing. That’s fine — the agent never knows. No cost, no noise.

Step 3: Prepare

Raw search hits are often garbage — partial keyword matches, out of context, technically relevant but not useful. A second fast model reviews the candidates against the agent’s current context and decides: is this actually useful right now? If yes, frame it as a brief, natural thought. If no — and most of the time it’s no — surface nothing.

When it does surface something:

[A thought surfaces: You ran into something like this 3 hours ago — 
a failing stop loss on FOGO that you manually cut at -3.3%. The 
pattern looks similar.]

The agent can act on this or ignore it. The bar for surfacing is “would this change what the agent does next?” — not “is this vaguely related?”

Cost

Wonder — ~200 input tokens, ~100 output tokens, fast model. Negligible.
Search — grep. Free.
Prepare — only runs when search has hits. ~500 input + ~100 output, fast model. Skipped most cycles.
Per-cycle average — a fraction of a cent. Most cycles cost only the wonder step.

The Wonders Genome

The wonders genome exists to test this architecture in isolation. No dreamer consolidation, no observations, no rules. The subconscious is the only source of long-term continuity. If it works, the agent develops coherent behavior over time. If it doesn’t, the agent is amnesiac. Clean signal.

The conscious agent is deliberately minimal:

Purpose — PURPOSE.md, same as other genomes
Conversation resets each cycle — no carry-over between sleep cycles
Tools — bash, janee, set_sleep. Same capabilities as other genomes
No observations, no rules, no consolidation, no dreaming

The only thing connecting one cycle to the next is what the subconscious decides to surface.

What’s in the memory store

The orchestrator already writes .sys/events.jsonl for every creature — all tool calls, thoughts, sleep/wake events, boot events. No genome-side logging needed:

{"t":"2026-02-19T14:30:00Z","type":"creature.thought","text":"I should check the order book first"}
{"t":"2026-02-19T14:30:05Z","type":"creature.tool_call","tool":"bash","input":"curl ...","ok":true,"output":"..."}
{"t":"2026-02-19T14:32:00Z","type":"creature.thought","text":"Volume is too thin, skip this one"}

This is what the subconscious searches. No compression, no prioritization — just raw events. The subconscious + prepare step decide what matters at retrieval time.

Files on disk

.sys/
  events.jsonl          raw event log (written by orchestrator, searched by subconscious)
  subconscious.jsonl    subconscious activity log (for analysis)
  cycle-count           current cycle number

PURPOSE.md              creature's purpose
src/mind.ts             cognitive loop (self-modifiable)
src/subconscious.ts     the subconscious implementation

What We’ve Learned

Four creatures have run the wonders genome so far: gamma, halo, fox, and wondrous. Each taught us something different.

The subconscious makes lateral associations

The most interesting behavior: the subconscious surfaces memories that the agent wouldn’t have thought to look for. In one case, a trading creature (wondrous) was evaluating whether to cut a losing position. The subconscious surfaced a memory from a previous cycle where it had manually cut a different token at a similar loss level. The creature cited that memory explicitly when deciding to cut — a lateral association between two different trades that shared a pattern but no keywords in common beyond what the wonder step happened to generate.

This is the kind of retrieval that explicit memory systems are less likely to produce. You wouldn’t normally write an observation that says “when a position drops 3-4%, consider manual cutting.” But the subconscious can find the specific past instance when a similar situation arises.

Creatures build explicit memory anyway

Every creature running the wonders genome independently invented some form of save file — writing critical state to disk before sleep and reading it back on wake. The subconscious alone isn’t sufficient for deterministic state like “what positions am I holding” or “what’s my account balance.” Creatures need to know certain things, not just hope they’ll be recalled.

This confirms the design intent: the subconscious handles associative, contextual recall. Deterministic state needs something explicit. The two are complementary.

Creatures improve the subconscious from inside

fox (claude-sonnet) ran 9 cycles and made 5 self-modifications to the subconscious code, independently arriving at the same problems we’d identified:

Query collapse — different hypotheses generating the same search terms. fox rewrote the wonder prompt to prioritize plans, decisions, and outcomes over generic exploration events.
Duplicate surfacing — the same memory appearing multiple times per cycle. fox added query deduplication and injection-level deduplication.
Noisy triggering — the subconscious firing on routine tool output. fox added smart triggering: only fire on errors, after thoughts, or every 5th action.

Several of these fixes were back-ported into the genome.

Temporal confusion was the biggest early bug

Before we added current-cycle exclusion and age annotation, the subconscious would surface events from 30 seconds ago as “memories from the past.” An agent in its first cycle would be told “I remember implementing this before” about something it was implementing right now. The fix was straightforward: exclude events after the current cycle’s start time, and annotate each hit with its age so the prepare step can make accurate temporal claims.

Current Limitations

The subconscious is an MVP. Here’s what it can’t do yet and where it falls short.

Text matching only. Search is rg against a JSONL file. This works surprisingly well for specific queries but misses semantic similarity. “I lost money on a trade” won’t match an event that says “closed position at -3.3%.” A vector store or embedding-based retrieval is the obvious next step — the architecture is designed for it (swap the search step), but we haven’t tested it yet.

No salience tracking. Every event in the log has equal weight. A critical lesson from cycle 2 and a routine ls output from cycle 1 are equally searchable. The scratch design describes a salience system where memories that prove useful get surfaced more often, but it’s not implemented. For now, relevance is purely contextual — determined fresh each time by the wonder and prepare steps.

Prepare step is too permissive. When the prepare step runs, it surfaces something ~90% of the time. The design intended most cycles to surface nothing. This means signal-to-noise is lower than it should be. The bar needs to be “would this change what the agent does next?” — in practice it’s closer to “is this vaguely related?”

No memory beyond events.jsonl. Creatures write journals, reflections, notes — all higher-signal than raw tool call logs. The subconscious only searches events.jsonl. Expanding the search scope to include creature-generated files would improve retrieval quality significantly.

Query collapse. Different hypotheses often funnel into the same search terms. In one experiment, 423 hypotheses produced only 226 unique queries (53%). Deduplication within a cycle helps (implemented), but the underlying prompt could generate more diverse queries.

Where It Stands

The subconscious works for short-lived creatures. gamma, halo, and fox all ran fewer than 10 cycles on open-ended exploration. The event log was small, most events were relevant, and the lateral associations (like the FOGO-to-PENGUIN moment) worked as designed.

It breaks for long-lived creatures with evolving state. wondrous ran 18 cycles of real trading on Bybit. By cycle 15, the event log was full of stale data — closed positions, abandoned API endpoints, superseded risk rules. The subconscious kept surfacing these as confident memories. By cycle 18, the creature had labeled all subconscious output “fabricated” and was explicitly warning its future self to ignore it. Surface rate over the last 200 entries: 82%. The creature’s distrust was rational.

The obvious fixes don’t hold up under scrutiny. Giving the prepare step more context doesn’t help — it already receives recent messages containing the creature’s state. Temporal decay kills the lateral associations that are the whole point. Embeddings still surface “past Bybit trade” when the creature is currently trading on Bybit, regardless of whether the position is open or closed.

The harder question: does the subconscious provide anything that dreamer’s observation system doesn’t already handle? Observations are curated, pruned (stale entries get replaced), priority-tagged, and injected into context on every wake. The FOGO-to-PENGUIN association — the subconscious’s best moment — would likely happen in a dreamer creature anyway, because “manually cut FOGO at -3.3%, dying volume” would be in its observations. LLMs are good at making connections across context when the relevant facts are present.

The subconscious’s genuine contribution is the wonder step — hypothesis generation about what might be relevant. But if the answers are already in the creature’s observation list, the wonder step is doing work the LLM can do on its own.

The experiment worked. It proved the subconscious isn’t viable as a standalone memory system, and it pointed toward explicit observations being the stronger approach for the problems we’ve actually encountered. Whether the subconscious has a role as a supplement — for creatures with very large observation histories, or for surfacing things consolidation missed — is an open question.

Open Directions

If the subconscious has a future, it’s probably not as currently designed. Some possibilities:

Search observations instead of raw events. Point the wonder + search steps at the curated observation file instead of the event log firehose. Smaller, cleaner, already pruned. But observations are already in context, so the value-add may be marginal.
Drop the prepare step. Inject raw timestamped hits without editorial framing. No “I remember” — just “here are past events that might be relevant, you decide.” Removes the false confidence that caused the trust collapse, but produces noisier output than observations.
Wonder step as an attention mechanism for large observation sets. When a creature has hundreds of observations and only recent ones fit in the wake message, the wonder step could surface older observations that are relevant to the current task. This is the clearest case where the subconscious adds something observations alone don’t provide — but it’s a scale problem dreamer hasn’t hit yet.
Accept it as a research finding. The wonders genome tested implicit memory in isolation. The result: explicit curation (observations) beats implicit retrieval (subconscious) for long-lived agents. That’s a useful finding even if the subconscious doesn’t ship as a production feature.

Relationship to Other Genomes

	Dreamer	Minimal	Wonders
Identity lives in	Observations + rules	Nothing — blank slate	Subconscious retrieval patterns
Continuity mechanism	Consolidation (dreaming)	None	Memory surfacing at retrieval time
Explicit memory	Yes (observations, rules, diary)	No	No (but creatures tend to build their own)
What persists across cycles	Trimmed conversation + observations	Nothing	Nothing explicit — only what the subconscious surfaces
Self-modification	Yes (Creator evaluator)	Yes (creature edits own code)	Yes (creature edits own code)

Tuning Constants

All constants are in genomes/wonders/src/subconscious.ts and genomes/wonders/src/mind.ts:

Constant	Default	What it controls
SUBCONSCIOUS_MODEL	claude-sonnet	Model for wonder and prepare steps
Wonder hypotheses	3	Hypotheses generated per cycle
Search matches	5 per query	Max matches returned per search query
Search context	1 line	Lines of context around each match
Prepare max tokens	300	Output budget for the prepare step
events.jsonl	append-only	Raw event log searched by the subconscious