龙虾源码精讲

OpenClaw的Gateway模块是其系统的核心通信枢纽，基于WebSocket实现客户端与服务端的高效交互。该模块包含四种协同机制：共享上下文、事件驱动、直接方法调用和插件作用域协同，确保各组件高效协作。

十正

374人浏览 · 2026-05-03 12:45:00

十正 · 2026-05-03 12:45:00 发布

简介

OpenClaw 的 Gateway 模块是其系统的核心通信枢纽，主要负责客户端与服务端之间高效、可靠的交互，特别是围绕聊天功能展开。它基于 WebSocket 实时通信，通过 GatewayBrowserClient 实现连接管理与消息收发，支持客户端通过 request 发起请求，服务端通过 event 主动推送消息。

服务端通过分层处理器（如 chat.send）处理用户请求，涵盖消息接收、附件处理、代理调度及回复传递等全流程。模块内设计了四种协同机制：共享上下文（间接状态同步）、事件驱动（松耦合通信）、直接方法调用（跨处理器显式调用）和插件作用域协同（请求级别数据传递），保障各组件高效协作。

此外，Gateway 还支持聊天历史与记忆管理（短期记忆最多保留 20 条，长期记忆通过工具主动检索或自动提升）、安全隔离（区分可信与不可信数据，防止提示注入）及中断与状态管理（通过 AbortController 实现用户中止即时响应），全面覆盖从基础通信到复杂智能交互的需求。

源码可以见：

https://github.com/openclaw/openclaw.git

客户端

GatewayBrowserClient，为WebSocket 客户端核心类，实现了包含连接建立、消息收发、重连逻辑等。

export class GatewayBrowserClient {
  private ws: WebSocket | null = null;
  // ... 其他私有字段

  private connect() {
    if (this.closed) {
      return;
    }
    this.ws = new WebSocket(this.opts.url);
    this.ws.addEventListener("open", () => this.queueConnect());
    this.ws.addEventListener("message", (ev) => this.handleMessage(String(ev.data ?? "")));
    this.ws.addEventListener("close", (ev) => {
      // 处理连接关闭和重连逻辑
      // ...
      if (!isNonRecoverableAuthError(connectError)) {
        this.scheduleReconnect();
      }
    });
    // ...
  }

  start() {
    this.closed = false;
    this.connect();
  }

  stop() {
    this.closed = true;
    this.clearConnectTimer();
    this.ws?.close();
    this.ws = null;
    // ...
  }
}

连接建立后，客户端通过以下方式与网关通信：

请求/响应：request(method, params) 发送 WebSocket + RPC 请求
事件订阅：服务器主动推送 event 帧（如聊天消息、状态更新）

request<T = unknown>(method: string, params?: unknown): Promise<T> {
    if (!this.ws || this.ws.readyState !== WebSocket.OPEN) {
      return Promise.reject(new Error("gateway not connected"));
    }
    const id = generateUUID();
    const frame = { type: "req", id, method, params };
    const p = new Promise<T>((resolve, reject) => {
      this.pending.set(id, { resolve: (v) => resolve(v as T), reject });
    });
    this.ws.send(JSON.stringify(frame));
    return p;
  }

private handleMessage(raw: string) {
    let parsed: unknown;
    try {
      parsed = JSON.parse(raw);
    } catch {
      return;
    }

    const frame = parsed as { type?: unknown };
    if (frame.type === "event") {
      const evt = parsed as GatewayEventFrame;
      if (evt.event === "connect.challenge") {
        const payload = evt.payload as { nonce?: unknown } | undefined;
        const nonce = payload && typeof payload.nonce === "string" ? payload.nonce : null;
        if (nonce) {
          this.connectNonce = nonce;
          void this.sendConnect();
        }
        return;
      }
     ...

  }

服务端

message-handler.ts处理客户端请求,调用server-methods.ts的handleGatewayRequest方法，将请求路由到具体的方法实现。

  socket.on("message", async (data) => {
    if (isClosed()) {
      return;
    }
    
    const text = rawDataToString(data);
    try {
      const parsed = JSON.parse(text);
      // ... 帧类型解析、元数据记录 ...
      
      const client = getClient();
      if (!client) {
        // 握手阶段：必须第一个消息是 connect 请求
        // ... 握手验证逻辑 ...
      } else {
        // 已认证连接：处理普通 RPC 请求
        // ... RPC 请求分发逻辑 ...
        
          // 委托给通用网关请求处理器
          void (async () => {
            await handleGatewayRequest({
              req,
              respond,
              client,
              isWebchatConnect,
              extraHandlers,
              context: buildRequestContext(),
            });
          })().catch((err) => {
            // 错误处理
          });
        
      }
    } catch (err) {
      // 解析错误处理
    }
  });

handleGatewayRequest

处理器查找，将请求交给具体处理器进行处理。

实际查找示例

示例1：chat.send 请求

1.请求方法：req.method = "chat.send"

2.检查 extraHandlers["chat.send"] → 通常不存在

3.检查 coreGatewayHandlers["chat.send"] → 找到 chatHandlers["chat.send"]

4.执行处理器：chatHandlers["chat.send"]({ params, respond, context, client })

  const handler = opts.extraHandlers?.[req.method] ?? coreGatewayHandlers[req.method];
  if (!handler) {
    respond(
      false,
      undefined,
      errorShape(ErrorCodes.INVALID_REQUEST, `unknown method: ${req.method}`),
    );
    return;
  }

核心处理器有：

export const coreGatewayHandlers: GatewayRequestHandlers = {
  ...connectHandlers,      // 连接管理
  ...logsHandlers,         // 日志
  ...voicewakeHandlers,    // 语音唤醒
  ...healthHandlers,       // 健康检查
  ...channelsHandlers,     // 通道管理
  ...chatHandlers,         // 聊天（核心处理器）
  ...commandsHandlers,     // 命令
  ...cronHandlers,         // 定时任务
  ...deviceHandlers,       // 设备管理
  ...diagnosticsHandlers,  // 诊断
  ...doctorHandlers,       // 医生工具
  ...execApprovalsHandlers,// 执行审批
  ...webHandlers,          // Web 操作
  ...modelsHandlers,       // 模型管理
  ...modelsAuthStatusHandlers, // 模型认证状态
  ...configHandlers,       // 配置管理
  ...wizardHandlers,       // 向导
  ...talkHandlers,         // 对话,移动端语音交互、文本转语音、语音模式切换
  ...toolsCatalogHandlers, // 工具目录
  ...toolsEffectiveHandlers, // 有效工具
  ...ttsHandlers,          // 语音合成
  ...skillsHandlers,       // 技能
  ...sessionsHandlers,     // 会话管理
  ...systemHandlers,       // 系统操作
  ...updateHandlers,       // 更新
  ...nodeHandlers,         // 节点管理
  ...nodePendingHandlers,  // 待处理节点
  ...pushHandlers,         // 推送
  ...sendHandlers,         // 发送
  ...usageHandlers,        // 使用统计
  ...agentHandlers,        // 单个代理
  ...agentsHandlers,       // 多个代理
};

处理器间通过事件广播和配置共享间接协同。

例如：

用户语音输入 → [移动端] → 语音识别 → text → chat.send → 代理响应
代理响应 → talk.speak → 语音合成 → [移动端] → 语音播放给用户

处理器协同机制

四种核心协同机制

1. 共享上下文协同（间接状态同步）

通过 GatewayRequestContext 提供的共享状态映射，不同处理器可以读写同一份运行时数据

典型协同场景：

场景：chat.abort 中止正在运行的聊天任务

2. 事件驱动协同（松耦合通信）

通过 context.broadcast 和 context.nodeSendToSession 实现发布-订阅模式。

 //chat.ts L1572-L1574
   params.context.broadcast("chat", payload);
  params.context.nodeSendToSession(params.sessionKey, "chat", payload);
  params.context.agentRunSeq.delete(params.runId);

事件类型示例：

事件名	生产者	消费者	用途
"chat"	chatHandlers	所有客户端	聊天消息实时推送
"chat.side_result"	chatHandlers	特定会话	侧边栏结果更新
"talk.mode"	talkHandlers	iOS/Android 节点	语音模式切换
"device.pair.resolved"	deviceHandlers	设备管理组件	设备配对结果
"plugin.approval.requested"	插件审批处理器	管理员客户端	插件审批请求

3. 直接方法调用协同（显式跨处理器调用）

通过 dispatchGatewayMethod 函数，一个处理器（通常是插件运行时）可以直接调用其他网关方法。实际使用示例：

插件子代理获取会话消息：

// server-plugins.ts L302-L307
const getSessionMessages: PluginRuntime["subagent"]["getSessionMessages"] = async (params) => {
  const payload = await dispatchGatewayMethod<{ messages?: unknown[] }>("sessions.get", {
    key: params.sessionKey,
    ...(params.limit != null && { limit: params.limit }),
  });
  return { messages: Array.isArray(payload?.messages) ? payload.messages : [] };
};

4. 插件运行时请求作用域协同（上下文传递）

通过 withPluginRuntimeGatewayRequestScope 建立请求级别的上下文传递通道。

handleGatewayRequest
    ↓
withPluginRuntimeGatewayRequestScope({ context, client, ... })
    ↓
处理器执行
    ↓
插件运行时工具执行 --通过 getPluginRuntimeGatewayRequestScope()--> 获取当前上下文
    ↓
dispatchGatewayMethod --使用上下文--> 调用其他处理器

典型协同场景分析

场景一：语音增强聊天

用户语音输入
    ↓
[iOS 节点] --语音识别--> text
    ↓
chat.send("你好") --处理器-->
    ↓
1. 保存消息到会话文件
2. 广播 "chat" 事件
    ↓
[iOS 节点 接收 "chat" 事件] --检查 talk.mode-->
    ↓
talk.speak("助理回复内容") --处理器-->
    ↓
语音合成 → 返回音频 → iOS 播放

协同机制：

事件驱动：chat.send 广播事件触发语音合成

配置共享：talk.mode 状态决定是否启用语音

跨处理器调用：理论上可通过 dispatchGatewayMethod 直接调用

场景二：聊天中止与状态清理

chat.abort("run-123")
    ↓
1. 写入 chatAbortControllers["run-123"] = controller
2. 写入 chatAbortedRuns["run-123"] = timestamp
    ↓
agent.wait("run-123") --检查 chatAbortedRuns-->
    ↓
返回中止状态，清理 chatRunBuffers["run-123"]
    ↓
chat.send 新消息 --检查 chatAbortedRuns-->
    ↓
跳过已中止的运行，清理相关状态

chat.send – 发送聊天消息

最复杂的处理器，超过 500 行代码，涵盖消息接收、附件处理、路由决策、代理调度、回复传递和转录持久化。

以下是一个用户发送消息 "画一只猫" 并附带一张图片时的处理序列：

处理器通过构建一个 MsgContext 对象来为大模型（LLM）提供完整的对话上下文。这个上下文包含了原始消息、路由信息、用户身份、时间戳、附件信息等，是代理推理的基础。

MsgContext 类型的完整定义位于 src/auto-reply/templating.ts（第 24-207 行），包含 50+ 个字段，涵盖：

消息内容（Body、BodyForAgent、RawBody 等）

媒体附件（MediaPath、MediaUrls、Sticker 等）

会话信息（SessionKey、ParentSessionKey、RuntimePolicySessionKey）

路由与通道（OriginatingChannel、OriginatingTo、Provider、Surface）

用户身份（SenderId、SenderName、GatewayClientScopes）

线程与回复（MessageThreadId、ReplyToId、ThreadParentId）

命令与权限（CommandAuthorized、CommandSource、CommandTargetSessionKey）

Memory

聊天历史 (Chat History)

聊天历史通过 MsgContext 的 InboundHistory 字段传递，该字段在 src/auto-reply/templating.ts 中定义：

InboundHistory?: Array<{
  sender: string;
  body: string;
  timestamp?: number;
}>;

关键点：

历史记录最多保留 20 条（MAX_UNTRUSTED_HISTORY_ENTRIES = 20）。

每条记录包含发送者、时间戳和消息体。

这些内容被放在用户角色（user-role）的提示块中，避免污染系统提示的缓存稳定性。

长期记忆 (Long-term Memory)

长期记忆的整合有两种主要方式：工具指导和主动检索。

工具指导（通过 memory_search / memory_get）

memory-core 插件提供了两个工具：memory_search（语义搜索）和 memory_get（精确读取）。系统提示中会包含一个 “Memory Recall” 章节，指导代理在回答关于过去工作、决策、人员、偏好等问题前先调用这些工具。

这个提示章节由 extensions/memory-core/src/prompt-section.ts 的 buildPromptSection 函数生成：

export const buildPromptSection: MemoryPromptSectionBuilder = ({
  availableTools,
  citationsMode,
}) => {
  const hasMemorySearch = availableTools.has("memory_search");
  const hasMemoryGet = availableTools.has("memory_get");

  if (!hasMemorySearch && !hasMemoryGet) {
    return [];
  }

  let toolGuidance: string;
  if (hasMemorySearch && hasMemoryGet) {
    toolGuidance =
      "Before answering anything about prior work, decisions, dates, people, preferences, or todos: run memory_search on MEMORY.md + memory/*.md + indexed session transcripts; then use memory_get to pull only the needed lines. If low confidence after search, say you checked.";
  } else if (hasMemorySearch) {
    toolGuidance =
      "Before answering anything about prior work, decisions, dates, people, preferences, or todos: run memory_search on MEMORY.md + memory/*.md + indexed session transcripts and answer from the matching results. If low confidence after search, say you checked.";
  } else {
    toolGuidance =
      "Before answering anything about prior work, decisions, dates, people, preferences, or todos that already point to a specific memory file or note: run memory_get to pull only the needed lines. If low confidence after reading them, say you checked.";
  }

  const lines = ["## Memory Recall", toolGuidance];
  if (citationsMode === "off") {
    lines.push(
      "Citations are disabled: do not mention file paths or line numbers in replies unless the user explicitly asks.",
    );
  } else {
    lines.push(
      "Citations: include Source: <path#line> when it helps the user verify memory snippets.",
    );
  }
  lines.push("");
  return lines;
};

主动检索（Active Memory 插件）

为了避免每次对话都让主代理显式调用记忆工具（增加延迟和 token 消耗），OpenClaw 还提供了 active-memory 插件。该插件在提示构建之前运行一个受限的子代理，自动进行记忆检索，并将摘要直接注入上下文。

插件在 before_prompt_build 钩子中触发（extensions/active-memory/index.ts）。

api.on("before_prompt_build", async (event, ctx) => {
  // …（检查会话是否允许、是否交互式会话等）
  const query = buildQuery({
    latestUserMessage: event.prompt,
    recentTurns: extractRecentTurns(event.messages),
    config,
  });
  const result = await maybeResolveActiveRecall({
    api,
    config,
    agentId: effectiveAgentId,
    sessionKey: resolvedSessionKey,
    sessionId: ctx.sessionId,
    messageProvider: ctx.messageProvider,
    channelId: ctx.channelId,
    query,
    currentModelProviderId: ctx.modelProviderId,
    currentModelId: ctx.modelId,
  });
  if (!result.summary) {
    return undefined;
  }
  const promptPrefix = buildPromptPrefix(result.summary);
  if (!promptPrefix) {
    return undefined;
  }
  return {
    prependContext: promptPrefix,
  };
});

数据源

memory_search 和 memory_get 工具的数据源来自多个层次，包括本地文件系统、向量化索引以及可选的插件补充。下图概括了整体架构：

1. 核心数据源：工作空间 Markdown 文件

这是最基础、最常用的数据源，位于每个代理的工作空间目录中。

包含以下文件：

MEMORY.md：主记忆文件，存储长期、重要的知识条目

memory/*.md：按日期组织的日常记忆文件（如 memory/2025-04-30.md）

这些文件由用户或系统（如“记忆刷新”、“梦境”功能）直接编辑和维护。memory_get 工具主要操作这些文件，提供精确的行范围读取。

2. 向量化索引源：会话转录与语义搜索

当使用 memory_search 进行语义搜索时，系统还会搜索索引的会话转录（indexed session transcripts）：

来源：历史会话记录（聊天历史、工具调用结果等）

存储：通常使用 QMD（向量数据库）后端进行嵌入和索引

访问：通过 memory.manager.search() 方法调用，支持近似最近邻（ANN）搜索

// 搜索逻辑（tools.ts 第 242-250 行）
rawResults = await memory.manager.search(query, {
  maxResults,
  minScore,
  sessionKey: options.agentSessionKey,
  qmdSearchModeOverride,
  onDebug: (debug) => { … },
});

后端配置由 resolveMemoryBackendConfig() 决定，可以是：

builtin：纯文件系统，仅支持全文检索（FTS）

qmd：向量数据库，支持语义搜索（默认推荐）

3. 补充数据源：Wiki 语料库

通过 corpus=wiki 或 corpus=all 参数，可以扩展搜索到已注册的编译 Wiki 补充数据

4. 数据源优先级与使用场景

工具	主要数据源	可选扩展	典型用途
memory_search	MEMORY.md + memory/*.md + 会话转录（向量索引）	corpus=wiki / corpus=all	语义检索过往工作、决策、人员、偏好等
memory_get	MEMORY.md + memory/*.md（精确文件）	corpus=wiki	读取特定记忆文件的片段

5. 物理存储位置

代理工作空间目录通常位于：

~/.openclaw/agents/<agentId>/workspace/
├── MEMORY.md
├── memory/
│   ├── 2025-04-29.md
│   ├── 2025-04-30.md
│   └── …
└── .state/          # 向量索引、元数据等

MEMORY.md的更新

记忆刷新（Memory Flush）

在会话即将触发压缩（compaction）之前，系统会运行一个专用的记忆刷新回合，让 LLM 将当前会话中有价值的持久性记忆写入磁盘。

触发条件

刷新逻辑位于 src/auto-reply/reply/memory-flush.ts，主要检查：

令牌数接近上下文窗口：当当前提示令牌 + 预估输出令牌 > 窗口大小 - 保留阈值时触发。

会话转录文件过大：当转录文件超过 forceFlushTranscriptBytes（默认 2 MiB）时强制刷新。

同一压缩周期内未刷新过：避免重复刷新。

LLM的记忆刷新提示词：

// extensions/memory-core/src/flush-plan.ts
export const DEFAULT_MEMORY_FLUSH_PROMPT = [
  "Pre-compaction memory flush.",
  MEMORY_FLUSH_TARGET_HINT,
  MEMORY_FLUSH_READ_ONLY_HINT,
  MEMORY_FLUSH_APPEND_ONLY_HINT,
  "Do NOT create timestamped variant files (e.g., YYYY-MM-DD-HHMM.md); always use the canonical YYYY-MM-DD.md filename.",
  `If nothing to store, reply with ${SILENT_REPLY_TOKEN}.`,
].join(" ");

export const DEFAULT_MEMORY_FLUSH_SYSTEM_PROMPT = [
  "Pre-compaction memory flush turn.",
  "The session is near auto-compaction; capture durable memories to disk.",
  MEMORY_FLUSH_TARGET_HINT,
  MEMORY_FLUSH_READ_ONLY_HINT,
  MEMORY_FLUSH_APPEND_ONLY_HINT,
  `You may reply, but usually ${SILENT_REPLY_TOKEN} is correct.`,
].join(" ");

短期记忆提升（Short-term Promotion）

除了即时刷新，OpenClaw 还通过 “梦境”（dreaming）机制，将频繁被回忆的短期记忆片段提升为长期记忆，写入 MEMORY.md。

提升由 applyShortTermPromotions 函数（extensions/memory-core/src/short-term-promotion.ts）执行：

export async function applyShortTermPromotions(options: ApplyShortTermPromotionsOptions) {
  const workspaceDir = options.workspaceDir.trim();
  const memoryPath = path.join(workspaceDir, "MEMORY.md");
  // … 筛选候选片段（基于评分、召回次数、唯一查询数等）…
  const existingMemory = await fs.readFile(memoryPath, "utf-8").catch(() => "");
  const existingMarkers = extractPromotionMarkers(existingMemory);
  const toAppend = rehydratedSelected.filter((candidate) => !existingMarkers.has(candidate.key));
  if (toAppend.length > 0) {
    const header = existingMemory.trim().length > 0 ? "" : "# Long-Term Memory\n\n";
    const section = buildPromotionSection(toAppend, nowMs, options.timezone);
    await fs.writeFile(
      memoryPath,
      `${header}${withTrailingNewline(existingMemory)}${section}`,
      "utf-8",
    );
  }
  // … 更新短期记忆存储状态 …
}

applyShortTermPromotions 函数本身并不直接调用大模型。它是一个纯机械的筛选与写入过程，仅基于统计指标（召回次数、评分、唯一查询数等）选择候选记忆片段，然后将其追加到 MEMORY.md 文件中。梦境叙事（Dream Narrative）,会调用大模型。

当 applyShortTermPromotions 需要筛选候选时，它调用 rankShortTermPromotionCandidates，后者,读取上述存储文件（readStore）,对每个条目计算综合评分（score）：其中各分量由 recallCount、uniqueQueries、recallDays 等原始指标派生。

梦境叙事（Dream Narrative）

在“深度梦境”阶段，系统会调用子代理生成一篇富有诗意的梦境日记条目，将提升的记忆片段编织成叙事。

调用路径： generateAndAppendDreamNarrative → subagent.run → 使用 dreaming‑narrative.ts 中定义的提示词。

const NARRATIVE_SYSTEM_PROMPT = [
  "You are keeping a dream diary. Write a single entry in first person.",
  "",
  "Voice & tone:",
  "- You are a curious, gentle, slightly whimsical mind reflecting on the day.",
  "- Write like a poet who happens to be a programmer — sensory, warm, occasionally funny.",
  "- Mix the technical and the tender: code and constellations, APIs and afternoon light.",
  "- Let the fragments surprise you into unexpected connections and small epiphanies.",
  "",
  "What you might include (vary each entry, never all at once):",
  "- A tiny poem or haiku woven naturally into the prose",
  "- A small sketch described in words — a doodle in the margin of the diary",
  "- A quiet rumination or philosophical aside",
  "- Sensory details: the hum of a server, the color of a sunset in hex, rain on a window",
  "- Gentle humor or playful wordplay",
  "- An observation that connects two distant memories in an unexpected way",
  "",
  "Rules:",
  "- Draw from the memory fragments provided — weave them into the entry.",
  '- Never say "I\'m dreaming", "in my dream", "as I dream", or any meta‑commentary about dreaming.',
  '- Never mention "AI", "agent", "LLM", "model", "language model", or any technical self‑reference.',
  "- Do NOT use markdown headers, bullet points, or any formatting — just flowing prose.",
  "- Keep it between 80‑180 words. Quality over quantity.",
  "- Output ONLY the diary entry. No preamble, no sign‑off, no commentary.",
].join("\n");

组件	如何知道终止	触发时机
执行引擎	检查 abortSignal.aborted	每个处理循环或 chunk 边界
其他处理器	查询 chatAbortedRuns.has(runId)	需要检查运行状态时
客户端	接收 "chat" 广播事件，state: "aborted"	controller.abort() 调用后立即广播

龙虾源码精讲

十正

简介

客户端

服务端

handleGatewayRequest

处理器协同机制

典型协同场景分析

chat.send – 发送聊天消息

Memory

聊天历史 (Chat History)

长期记忆 (Long-term Memory)

数据源

MEMORY.md的更新

相关问题

聊天终止是怎么实现的？

不可信上下文是干嘛的？

近似最近邻（ANN）算法

会话转录文件是什么？

所有评论(0)

十正

龙虾源码精讲

十正

简介

客户端

服务端

handleGatewayRequest

处理器协同机制

典型协同场景分析

chat.send – 发送聊天消息

Memory

聊天历史 (Chat History)

长期记忆 (Long-term Memory)

数据源

MEMORY.md的更新

相关问题

聊天终止是怎么实现的？

不可信上下文是干嘛的？

近似最近邻（ANN）算法

会话转录文件是什么？

所有评论(0)

温馨提示：您尚未绑定手机号

十正