返回首页

对比阅读

对比阅读:MiniMax Launches MaxHermes: Self-Evolving Agent Builds Own Skills 与 MiniMax Launches MaxHermes: Self-Evolving Agent Builds Own Skills

AZH
MiniMaxMaxHermesM2.7·

MiniMax Launches MaxHermes: Self-Evolving Agent Builds Own Skills

事件概述

中国 AI 公司 MiniMax 正 式推出 MaxHermes——一款云端托管 Agent,官 方将其定位为全球首个云沙箱 Hermes 级 系统。产品已上线,访问地址为 agent .minimax.io/max-hermes。该消息最 初通过微信渠道发布,并经稀土 掘金旗下 AI 频道跟进报道。来 源未披露具体上线日期,仅表示" 本周"已正式推出。

MiniMax 的核 心主张是:MaxHermes 能够在任务执行完 成后,自主提取可复用的 Skills(技能),将其以 独立文档的形式存入持久化存储,并在后续处理相 似任务时按需加载——同时根据新的使 用反馈对这些 Skills 进行迭代优化。整个过程无 需人工介入来定义或更新技能集。

该 系统基于 Hermes Agent 框架构建,底层运行 M iniMax 自研的 M2.7 模型,并支持跨会话持久化记忆。这 意味着在某一会话中积累的知识可以延 续到后续会话——MiniMax 特别将这一 特性与大多数托管 LLM 部署默认的无状态模式 进行对比,后者在每次对话结束后会重 置上下文。

为何值得关注

Hermes Agent 这 一产品类别由 OpenClaw 在今年早 些时候带入大众视野,其采 用的是人工预定义 Skill 架构。这种设计模 式的能力上限,完全取决于开发者事先编 写了多少功能。MaxHermes 则提出了截 然不同的架构逻辑:能力上限由实际使用深度决 定,而非开发者的预判范围。

若自动 技能生成机制在规模化场景下依然有效,其 对企业级部署的影响不容忽视:

  • 使 用量带来的复利效应:高频用户将 随时间推移获得能力显著提升的 Agent,由此形成自然积累的 迁移成本。
  • 降低提示工程投入:目前需 要花费大量工程资源搭建 Skill 脚手架的 团队,可以将这部分工作转移给 Agent 自行完成。
  • 在拥 挤赛道中建立差异化:据悉,国内各 大 AI 实验室均在布局 Hermes 类产品。MiniMax 正试图在 窗口期关闭之前,率先锁定"自我进化"的市场定位。

零 代码部署同样具有重要的企业销售价值。MaxHermes 原 生集成 飞书(Lark)、钉钉和企业微信—— 国内三大主流企业即时通讯平台——无需任何环境配置或代 码开发即可接入。对于中型企业中的非技术决策者而言,这 直接消除了采用 AI Agent 的主要门槛。

注意事项

本文信息来源的原始文章带 有明显的宣传色彩,且仅来自单一中文开发者社区帖子。目 前尚无独立的基准测试数据、第三方评估报告,也未披 露任何用户数量数据。关于自我进化和跨会话记忆的相关声 明,在经独立验证之前,均应视为厂商自 述,需保持审慎态度。

技术细节

根据现有描述,其架构由三个核 心组件构成:

  • 任务执行器:Hermes Agent 框架负责多步骤任务拆解与 执行,底层调用 M2.7 主干模型。
  • 技能提取器 :任务完成后,系统自动运行一次提取流程,识别可复用的流 程化模式,并以独立技能文档的形式写入持久化存储。
  • 技能加载器:接收新任务时,系统执行检索步骤,将传 入的任务上下文与已存储的技能库进行匹配,并将相关文档加载至 当前活跃的上下文窗口中。

从结构上看,这与 将 Retrieval-Augmented Generation(RAG)应用于程序性记 忆(而非事实性知识)的思路高度相似——并在 此基础上加入了根据后续执行结果更新技能文档的反 馈循环。来源未说明技能更新机制是基于规则还 是模型驱动,也未描述如何评估技能质量或剔除低 质量技能。

跨会话记忆被列为一项核 心特性,与大多数托管 LLM API 默认的无状态模式形成对照 。具体实现机制(向量存储、键值数据库抑 或结构化数据库)在来源中未予披露。

部署方面 ,官方描述为 10 秒内完成云实例启动,无需本 地环境配置。与飞书、钉钉和企业微信的集成为原生接 入方式,而非基于 Webhook,但技术细节暂未公 开。

后续看点

  • 独立评测( 未来 2–4 周):关注中文开发者社 区针对任务完成率和技能留存准确性的基 准测试。自我进化是整个产品主张的核心支撑——亟需第三方进 行压力测试加以验证。
  • OpenClaw 的竞争回应:若 MaxHermes 凭借"无预定义技能"的定位获得市场认 可,预计 OpenClaw 及同类西方 Hermes 类产品将加 速推进动态技能生成功能。
  • M2.7 模型信息披 露:MiniMax 目前尚未公开 M2.7 的能力详情、参 数规模或基准测试表现。未来 30 天内若有任何技术层面的信息披露,将从 根本上改变外界对 MaxHermes 能力上限的评估。
  • 企业 试点公告:飞书/钉钉/企业微信的集成方 向表明 MiniMax 正积极拓展企业试点。若 有具名客户公告落地,将为规模化部 署能力提供有效背书。
  • 监管环境:国 家互联网信息办公室持续更新生成式 AI 服务相关法规。任何针对具备持久化记 忆的 Agent 系统的新合规要求,都可能对 MaxHermes 的产品路线图产生影响。
来源: juejin.cn
BEN
MiniMaxMaxHermesM2.7·

MiniMax Launches MaxHermes: Self-Evolving Agent Builds Own Skills

What Happened

MiniMax, the Chinese AI company, has launched MaxHermes — a cloud-hosted agent it describes as the world's first cloud-sandbox Hermes-class system . The product is live at agent.minimax.io/max-hermes, according to a MiniMax post circulated via WeChat and covered by Juejin's AI vertical. No launch date beyond " this week" was specified in the source.

The core claim: MaxHermes autonomously extracts reusable Skills from completed task executions, stores them as discrete documents, and loads them on demand for future similar tasks — improving those Skills iteratively based on new usage feedback. No human instruction is required to define or update the skill set.

The system is built on the Hermes Agent framework and runs on MiniMax's own M2.7 model. It supports cross-session persistent memory, meaning knowledge acquired in one session carries forward — a property MiniMax explicitly contr asts with stateless LLM deployments that reset context between conversations.

Why It Matters

The Hermes agent category was popularized earlier this year by OpenClaw, which uses a manually pre -defined Skill architecture. That design hard-caps capability at whatever a developer has explicitly programmed. MaxHermes is a direct architectural counter-argument: capability ceiling is determined by usage depth rather than developer foresight.

If the self-skill -generation mechanism holds at scale, the implications for enterprise deployment are significant:

  • Compounding returns on usage : Heavier users get a materially more capable agent over time, creating switching costs that accumulate organically.
  • Reduced prompt engineering overhead: Teams that currently invest engineering hours in skill scaffolding could offload that work to the agent itself.
  • Differentiation in a crowded market: Every major Chinese AI lab is reportedly building in the Hermes category. M iniMax is attempting to claim the "self-evolving" positioning before the window closes.

The zero-code deployment story also has enterprise sales implications. MaxHermes connects natively to Fe ishu (Lark), DingTalk, and WeCom — the three dominant Chinese enterprise messaging platforms — without requiring environment setup or code. For non -technical buyers in mid-market Chinese companies, that removes the primary adoption friction for agentic AI.

Caveats

The source article is promotional in tone and originates from a single Chinese developer community post. No independent benchmark data, no third-party evaluation, and no user count figures are provided. Claims about self -evolution and cross-session memory should be treated as vendor assertions until independently verified.

The Technical Detail

The architecture as described breaks down into three components :

  • Task executor: Hermes Agent framework handles multi -step task decomposition and execution against the M2.7 backbone model.
  • Skill ext ractor: Post-task, the system runs an extraction pass to identify reusable procedural patterns and writes them to persistent storage as discrete skill documents.
  • Skill loader: On new task intake, a retrieval step matches incoming task context against the stored skill library and loads relevant documents into the active context window.

This is structurally similar to Retri eval-Augmented Generation (RAG) applied to procedural memory rather than factual knowledge — with the addition of a feedback loop that updates skill documents based on subsequent executions. The source does not specify whether the skill update mechanism is rule-based or model-driven, nor does it describe how skill quality is evaluated or bad skills are pru ned.

Cross-session memory is described as a first-class feature, contrasting with the stateless default of most hosted LLM APIs. The implementation mechanism (vector store , key-value, structured DB) is not disclosed in the source.

Deployment is described as a 10-second cloud instance spin-up with no local environment requirements. Integration with Feishu, DingTalk, and WeCom appears to be native rather than webhook-based, though technical specifics are not provided.

What To Watch

  • Independent evals (next 2 -4 weeks): Watch for Chinese developer community benchmarks on task completion rates and skill retention accuracy. The self-evolution claim is the load-bearing assertion here — it needs third-party stress testing.
  • OpenClaw competitive response: If MaxHermes gains traction on the "no pre-defined skills" positioning , expect OpenClaw and comparable Western Hermes-category products to accelerate dynamic skill-generation features.
  • M2.7 model disclosure : MiniMax has not publicly detailed M2.7's capabilities, parameter count, or benchmark performance. Any technical disclosure in the next 30 days would materially change the evaluation of MaxHermes's capability ceiling.
  • Enterprise pilot announcements: The Feishu/DingTalk/WeCom integrations suggest MiniMax is targeting enterprise pilots aggressively. Named customer announcements would validate the deployment claim at scale.
  • Regulatory context: China's Cyberspace Administration continues to update generative AI service regulations. Any new compliance requirements affecting agentic systems with persistent memory could affect MaxHermes's road map.
来源: juejin.cn