Core Concepts

EngramDB is built around several key concepts that work together to provide a comprehensive memory management system for agents. This document explains these core concepts and how they relate to each other.

Memory Node

The MemoryNode is the fundamental unit of storage in EngramDB. It combines multiple aspects of memory representation:

Vector Embeddings: Numerical vectors that represent the semantic content of the memory, enabling similarity search
Attributes: Key-value pairs for structured data associated with the memory
Connections: Graph-like relationships to other memory nodes
Temporal Layers: Historical versions of the memory that track how it evolves over time

A memory node is identified by a unique UUID and includes metadata such as creation timestamp and access patterns.

Vector Embeddings

Vector embeddings are numerical representations of semantic content. In EngramDB, these embeddings enable similarity search, allowing you to find memories that are semantically related even if they don’t share the same attributes. EngramDB supports two types of embeddings:

Single Vector Embeddings: Traditional embeddings where a document is represented as a single vector
Multi-Vector Embeddings: ColBERT/ColPali-style embeddings where a document is represented as multiple vectors

The vector index maintains these embeddings and provides efficient similarity search using cosine similarity. EngramDB supports multiple vector index implementations:

Linear Index: Simple brute-force search, suitable for small datasets
HNSW Index: Fast approximate search using Hierarchical Navigable Small World graphs, suitable for large datasets
HNSW Multi-Vector Index: Specialized index for multi-vector embeddings

EngramDB also includes built-in support for generating embeddings directly from text using machine learning models. This means you can:

Create memory nodes directly from text content
Generate embeddings for semantic search queries
Utilize state-of-the-art embedding models:
- E5 Multilingual Large Instruct (default)
- GTE Modern BERT Base
- Jina Embeddings V3
- Custom models
Fall back to deterministic mock embeddings when ML dependencies aren’t available

See the Embeddings documentation and Vector Index API Reference for more details.

Attributes

Attributes are flexible key-value pairs associated with a memory node. They can store various types of data:

Strings
Integers
Floating-point numbers
Booleans
Lists
Maps (nested key-value pairs)

Attributes provide a way to add structured metadata to memories and enable filtering in queries.

Connections

Connections represent relationships between memory nodes, forming a graph structure. Each connection has:

A target memory node ID
A relationship type (e.g., Association, Causation, Sequence)
A strength value (between 0.0 and 1.0)
A creation timestamp

Connections allow memories to reference and relate to each other, enabling graph-based queries and traversals.

Temporal Layers

Temporal layers track how memories evolve over time. Each layer represents a version of the memory at a specific point in time and includes:

A timestamp
Optionally modified embeddings
Optionally modified attributes
A reason for the change

This temporal dimension allows EngramDB to maintain a history of how memories change and evolve, which is crucial for agent systems that learn and update their knowledge over time.

Database

The Database is the main interface for interacting with EngramDB. It provides methods for:

Saving and loading memory nodes
Searching for similar memories
Querying with complex filters
Managing connections between memories

The database combines a storage engine (for persistence) with a vector index (for similarity search).

Storage Engines

EngramDB supports multiple storage backends:

Memory Storage: Volatile storage for testing and development
File Storage: Persistent storage for production use

The storage engine is responsible for saving and loading memory nodes, while the database coordinates between storage and indexing.

Query System

The query system provides a flexible way to search for memories based on multiple criteria:

Vector similarity
Attribute filters
Temporal filters
Connection filters

Queries can combine these different dimensions to find memories that match complex criteria.

Background Processing

EngramDB includes a background processing system that performs tasks during idle periods. This “sleep-time compute” capability allows the database to autonomously organize and enrich memories. Key components of the background processing system include:

Activity Tracker: Monitors database usage to detect idle periods
Task Manager: Schedules, prioritizes, and executes background tasks
Task Types: Various operations the system can perform:
- Summarization: Create summaries of related memories
- Connection Inference: Discover relationships between memories
- Node Enrichment: Add additional context to memories
- Query Prediction: Anticipate future queries and pre-compute results

Background tasks can be configured with various parameters:

Idle threshold (when to start processing)
Token usage limits
Cost constraints
Concurrency limits

See the Background Processing documentation for more details.

Putting It All Together

In EngramDB, these concepts work together to provide a comprehensive memory system:

You create MemoryNode instances with embeddings and attributes
You save them to the Database
The database stores them using the StorageEngine and indexes their embeddings in the VectorIndex
You can create Connections between nodes to form a memory graph
You can update nodes over time, creating TemporalLayers that track their evolution
You can query the database using vector similarity, attribute filters, and temporal constraints
The background processing system organizes and enriches memories during idle periods

This unified approach allows for rich, multi-dimensional representation and querying of agent memories, with autonomous organization capabilities.

Getting Started

API Reference

Examples

Advanced Topics

Contributing

Core Concepts

Core Concepts

Memory Node

Vector Embeddings

Attributes

Connections

Temporal Layers

Database

Storage Engines

Query System

Background Processing

Putting It All Together

Getting Started

API Reference

Examples

Advanced Topics

Contributing

Documentation Index

​Core Concepts

​Memory Node

​Vector Embeddings

​Attributes

​Connections

​Temporal Layers

​Database

​Storage Engines

​Query System

​Background Processing

​Putting It All Together

Core Concepts

Memory Node

Vector Embeddings

Attributes

Connections

Temporal Layers

Database

Storage Engines

Query System

Background Processing

Putting It All Together