The problem with plaintext memory
Plaintext agent memory stores facts by position. Context compression destroys position. A file like:Encoding
Each fact is encoded as an amplitude-shift keyed (ASK) signal on a dedicated carrier frequency. All 40 channels are superimposed into a single composite waveform, then quantized to 512 token IDs drawn from the Hermes3 vocabulary. The result is a[MEM]...[/MEM] block — 512 tokens of opaque signal:
Decoding
When the agent needs a fact, the fine-tuned Hermes3-3B reader model performs implicit frequency decomposition: given the 512-token signal and a natural-language question, it returns the encoded value directly.Fast path vs. slow path
The/decode endpoint has two paths:
- Fast path (default): reads from persistent server-side state. ~0ms. Used for routine agent queries.
- Slow path (
use_model=true): sends the.memblock to the Hermes3 inference endpoint. ~400ms. Used to verify spectral encoding integrity or when state is unavailable.