Working Memory - Mastra Agent Persistence

Trust: ★★★☆☆ (0.90) · 0 validations · developer_reference

Published: 2026-05-10 · Source: crawler_authoritative

Tình huống

Mastra Agent SDK documentation for implementing working memory, which allows agents to maintain persistent user information across interactions using Markdown templates or JSON schemas.

Insight

Working memory in Mastra is a block of Markdown or JSON that agents can update over time to store continuously relevant information. It operates at two scopes: resource-scoped (default, persists across all conversation threads for the same user) and thread-scoped (isolated per conversation thread). Resource-scoped memory requires storage adapters supporting the mastra_resources table: libSQL (@mastra/libsql), PostgreSQL (@mastra/pg), Upstash (@mastra/upstash), and MongoDB (@mastra/mongodb). Working memory can be defined using either a Markdown template (with replace semantics where the agent provides complete content on each update) or a JSON schema compatible with Standard JSON Schema (Zod, Valibot, ArkType) using merge semantics where only fields to add/update are provided. Schema-based memory uses deep merge for objects, sets fields to null to delete, and replaces arrays entirely. The readOnly: true option allows agents to access working memory without modifying it. Switching between scopes means the agent won’t see memory from the other scope.

Hành động

Configure working memory in the Memory constructor options. Enable with workingMemory: { enabled: true }. Set scope with scope: 'resource' (default) or scope: 'thread'. Define a Markdown template with the template key or a JSON schema with the schema key (not both). For resource-scoped memory, pass resource parameter in memory options when calling agent.generate(). Set initial working memory via thread metadata’s workingMemory key when creating threads, or use memory.updateThread() to modify existing. Use memory.updateWorkingMemory({ threadId, resourceId, workingMemory }) for direct updates. Enable read-only mode with options: { readOnly: true } when calling agent.generate().

Kết quả

Agents maintain persistent working memory as Markdown or JSON, updating it over time with user information. Resource-scoped memory persists across all threads for the same userId, while thread-scoped memory remains isolated per conversation. Schema-based updates preserve existing fields through merge semantics.

Điều kiện áp dụng

Resource-scoped working memory requires storage adapters that support the mastra_resources table (libSQL, PostgreSQL, Upstash, MongoDB). You must specify either template or schema, but not both.

Nội dung gốc (Original)

Working Memory

While message history and semantic recall help agents remember conversations, working memory allows them to maintain persistent information about users across interactions.

Think of it as the agent’s active thoughts or scratchpad – the key information they keep available about the user or task. It’s similar to how a person would naturally remember someone’s name, preferences, or important details during a conversation.

This is useful for maintaining ongoing state that’s always relevant and should always be available to the agent.

Working memory can persist at two different scopes:

Resource-scoped (default): Memory persists across all conversation threads for the same user
Thread-scoped: Memory is isolated per conversation thread

Important: Switching between scopes means the agent won’t see memory from the other scope - thread-scoped memory is completely separate from resource-scoped memory.

Quickstart

Here’s a minimal example of setting up an agent with working memory:

import { Agent } from '@mastra/core/agent'
import { Memory } from '@mastra/memory'
 
// Create agent with working memory enabled
const agent = new Agent({
  id: 'personal-assistant',
  name: 'PersonalAssistant',
  instructions: 'You are a helpful personal assistant.',
  model: 'openai/gpt-5.4',
  memory: new Memory({
    options: {
      workingMemory: {
        enabled: true,
      },
    },
  }),
})

How it works

Working memory is a block of Markdown text that the agent is able to update over time to store continuously relevant information:

YouTube video player

Memory persistence scopes

Working memory can operate in two different scopes, allowing you to choose how memory persists across conversations:

Resource-Scoped Memory (Default)

By default, working memory persists across all conversation threads for the same user (resourceId), enabling persistent user memory:

const memory = new Memory({
  storage,
  options: {
    workingMemory: {
      enabled: true,
      scope: 'resource', // Memory persists across all user threads
      template: `# User Profile
- **Name**:
- **Location**:
- **Interests**:
- **Preferences**:
- **Long-term Goals**:
`,
    },
  },
})

Use cases:

Personal assistants that remember user preferences
Customer service bots that maintain customer context
Educational applications that track student progress

Usage with Agents

When using resource-scoped memory, make sure to pass the resource parameter in the memory options:

// Resource-scoped memory requires resource
const response = await agent.generate('Hello!', {
  memory: {
    thread: 'conversation-123',
    resource: 'user-alice-456', // Same user across different threads
  },
})

Thread-Scoped Memory

Thread-scoped memory isolates working memory to individual conversation threads. Each thread maintains its own isolated memory:

const memory = new Memory({
  storage,
  options: {
    workingMemory: {
      enabled: true,
      scope: 'thread', // Memory is isolated per thread
      template: `# User Profile
- **Name**:
- **Interests**:
- **Current Goal**:
`,
    },
  },
})

Use cases:

Different conversations about separate topics
Temporary or session-specific information
Workflows where each thread needs working memory but threads are ephemeral and not related to each other

Storage adapter support

Resource-scoped working memory requires specific storage adapters that support the mastra_resources table:

Supported Storage Adapters

libSQL (@mastra/libsql)
PostgreSQL (@mastra/pg)
Upstash (@mastra/upstash)
MongoDB (@mastra/mongodb)

Custom templates

Templates guide the agent on what information to track and update in working memory. While a default template is used if none is provided, you’ll typically want to define a custom template tailored to your agent’s specific use case to ensure it remembers the most relevant information.

Here’s an example of a custom template. In this example the agent will store the users name, location, timezone, etc as soon as the user sends a message containing any of the info:

const memory = new Memory({
  options: {
    workingMemory: {
      enabled: true,
      template: `
# User Profile
 
## Personal info
 
- Name:
- Location:
- Timezone:
 
## Preferences
 
- Communication Style: [e.g., Formal, Casual]
- Project Goal:
- Key Deadlines:
  - [Deadline 1]: [Date]
  - [Deadline 2]: [Date]
 
## Session state
 
- Last Task Discussed:
- Open Questions:
  - [Question 1]
  - [Question 2]
`,
    },
  },
})

Designing effective templates

A well-structured template keeps the information straightforward for the agent to parse and update. Treat the template as a short form that you want the assistant to keep up to date.

Short, focused labels. Avoid paragraphs or very long headings. Keep labels brief (for example ## Personal Info or - Name:) so updates are readable and less likely to be truncated.
Use consistent casing. Inconsistent capitalization (Timezone: vs timezone:) can cause messy updates. Stick to Title Case or lower case for headings and bullet labels.
Keep placeholder text minimal. Use hints such as [e.g., Formal] or [Date] to help the LLM fill in the correct spots.
Abbreviate very long values. If you only need a short form, include guidance like - Name: [First name or nickname] or - Address (short): rather than the full legal text.
Mention update rules in instructions. You can instruct how and when to fill or clear parts of the template directly in the agent’s instructions field.

Alternative Template Styles

Use a shorter single block if you only need a few items:

const basicMemory = new Memory({
  options: {
    workingMemory: {
      enabled: true,
      template: `User Facts:\n- Name:\n- Favorite Color:\n- Current Topic:`,
    },
  },
})

You can also store the key facts in a short paragraph format if you prefer a more narrative style:

const paragraphMemory = new Memory({
  options: {
    workingMemory: {
      enabled: true,
      template: `Important Details:\n\nKeep a short paragraph capturing the user's important facts (name, main goal, current task).`,
    },
  },
})

Structured working memory

Working memory can also be defined using a structured schema instead of a Markdown template. This allows you to specify the exact fields and types that should be tracked, using a Standard JSON Schema (Zod, Valibot, ArkType, etc.). When using a schema, the agent will see and update working memory as a JSON object matching your schema.

Important: You must specify either template or schema, but not both.

Example: Schema-Based Working Memory

import { z } from 'zod'
import { Memory } from '@mastra/memory'
 
const userProfileSchema = z.object({
  name: z.string().optional(),
  location: z.string().optional(),
  timezone: z.string().optional(),
  preferences: z
    .object({
      communicationStyle: z.string().optional(),
      projectGoal: z.string().optional(),
      deadlines: z.array(z.string()).optional(),
    })
    .optional(),
})
 
const memory = new Memory({
  options: {
    workingMemory: {
      enabled: true,
      schema: userProfileSchema,
      // template: ... (do not set)
    },
  },
})

When a schema is provided, the agent receives the working memory as a JSON object. For example:

{
  "name": "Sam",
  "location": "Berlin",
  "timezone": "CET",
  "preferences": {
    "communicationStyle": "Formal",
    "projectGoal": "Launch MVP",
    "deadlines": ["2025-07-01"]
  }
}

Merge Semantics for Schema-Based Memory

Schema-based working memory uses merge semantics, meaning the agent only needs to include fields it wants to add or update. Existing fields are preserved automatically.

Object fields are deep merged: Only provided fields are updated; others remain unchanged
Set a field to null to delete it: This explicitly removes the field from memory
Arrays are replaced entirely: When an array field is provided, it replaces the existing array (arrays aren’t merged element-by-element)

Choosing between template and schema

Use a template (Markdown) if you want the agent to maintain memory as a free-form text block, such as a user profile or scratchpad. Templates use replace semantics — the agent must provide the complete memory content on each update.
Use a schema if you need structured, type-safe data that can be validated and programmatically accessed as JSON. The workingMemory.schema field accepts any PublicSchema-compatible schema (including Zod v3, Zod v4, JSON Schema, or already-standard schemas). Schemas use merge semantics — the agent only provides fields to update, and existing fields are preserved.
Only one mode can be active at a time: setting both template and schema isn’t supported.

Example: Multi-step retention

Below is a simplified view of how the User Profile template updates across a short user conversation:

# User Profile
 
## Personal info
 
- Name:
- Location:
- Timezone:
 
--- After user says "My name is **Sam** and I'm from **Berlin**" ---
 
# User Profile
- Name: Sam
- Location: Berlin
- Timezone:
 
--- After user adds "By the way I'm normally in **CET**" ---
 
# User Profile
- Name: Sam
- Location: Berlin
- Timezone: CET

The agent can now refer to Sam or Berlin in later responses without requesting the information again because it has been stored in working memory.

If your agent isn’t properly updating working memory when you expect it to, you can add system instructions on how and when to use this template in your agent’s instructions setting.

Setting initial working memory

While agents typically update working memory through the updateWorkingMemory tool, you can also set initial working memory programmatically when creating or updating threads. This is useful for injecting user data (like their name, preferences, or other info) that you want available to the agent without passing it in every request.

Setting Working Memory via Thread Metadata

When creating a thread, you can provide initial working memory through the metadata’s workingMemory key:

// Create a thread with initial working memory
const thread = await memory.createThread({
  threadId: 'thread-123',
  resourceId: 'user-456',
  title: 'Medical Consultation',
  metadata: {
    workingMemory: `# Patient Profile
- Name: John Doe
- Blood Type: O+
- Allergies: Penicillin
- Current Medications: None
- Medical History: Hypertension (controlled)
`,
  },
})
 
// The agent will now have access to this information in all messages
await agent.generate("What's my blood type?", {
  memory: {
    thread: thread.id,
    resource: 'user-456',
  },
})
// Response: "Your blood type is O+."

Updating Working Memory Programmatically

You can also update an existing thread’s working memory:

// Update thread metadata to add/modify working memory
await memory.updateThread({
  id: 'thread-123',
  title: thread.title,
  metadata: {
    ...thread.metadata,
    workingMemory: `# Patient Profile
- Name: John Doe
- Blood Type: O+
- Allergies: Penicillin, Ibuprofen  // Updated
- Current Medications: Lisinopril 10mg daily  // Added
- Medical History: Hypertension (controlled)
`,
  },
})

Direct Memory Update

Alternatively, use the updateWorkingMemory method directly:

await memory.updateWorkingMemory({
  threadId: 'thread-123',
  resourceId: 'user-456', // Required for resource-scoped memory
  workingMemory: 'Updated memory content...',
})

Read-only working memory

In some scenarios, you may want an agent to have access to working memory data without the ability to modify it. This is useful for:

Routing agents that need context but shouldn’t update user profiles
Sub agents in a multi-agent system that should reference but not own the memory

To enable read-only mode, set readOnly: true in the memory options:

const response = await agent.generate('What do you know about me?', {
  memory: {
    thread: 'conversation-123',
    resource: 'user-alice-456',
    options: {
      readOnly: true, // Working memory is provided but cannot be updated
    },
  },
})

Examples

Working memory with template
Working memory with schema
Per-resource working memory: Complete example showing resource-scoped memory persistence

Liên kết

Nền tảng: Dev Framework · Mastra
Nguồn: https://mastra.ai/docs/memory/working-memory

Xem thêm:

KakaFlow KB

Nội dung

Working Memory - Mastra Agent Persistence

Working Memory - Mastra Agent Persistence

Tình huống

Insight

Hành động

Kết quả

Điều kiện áp dụng

Nội dung gốc (Original)

Working Memory

Quickstart

How it works

Memory persistence scopes

Resource-Scoped Memory (Default)

Usage with Agents

Thread-Scoped Memory

Storage adapter support

Supported Storage Adapters

Custom templates

Designing effective templates

Alternative Template Styles

Structured working memory

Example: Schema-Based Working Memory

Merge Semantics for Schema-Based Memory

Choosing between template and schema

Example: Multi-step retention

Setting initial working memory

Setting Working Memory via Thread Metadata

Updating Working Memory Programmatically

Direct Memory Update

Read-only working memory

Examples

Liên kết

Sơ đồ

Mục lục

Liên kết ngược