MarkTechPost AI模型更新

Nous Research 發布 Hermes Desktop:Hermes Agent v0.15.2 的原生跨平臺前端,支援串流工具輸出

2026年6月3日 09:38

重點摘要

Nous Research 推出 Hermes Desktop 公開預覽版,這是一款原生支援 macOS、Windows 與 Linux 的應用程式,為開源 Hermes Agent 提供圖形化介面。在此之前,使用者只能透過命令列介面(CLI)與訊息閘道來執行 Hermes。目前版本為 Hermes Agent v0.15.2。根據 Nous Research 的文件,桌面版沿用相同的代理核心,並與 CLI 及閘道共享配置、API 金鑰、會話、技能與記憶體。桌面版僅是同一代理的另一個操作介面,並非分支版本。Hermes Desktop 是什麼?Hermes Agent 是一個自主式 AI 代理,並非綁定於編輯器的程式碼助手。它能執行任務、呼叫工具,並跨會話保持狀態。所謂「代理」,指的是在循環中進行規劃、行動與觀察的模型。Hermes Desktop 便是建構於該代理之上的圖形化前端。

站內 AI 整理稿

Nous Research has released Hermes Desktop in public preview. It is a native application for macOS, Windows, and Linux. It gives the open-source Hermes Agent a graphical interface. Until now, users ran Hermes through a CLI and messaging gateways. The current build is Hermes Agent v0.15.2. Per Nous Research’s documentation, the desktop reuses the same agent core. It shares configuration, API keys, sessions, skills, and memory with the CLI and gateway. The desktop is another surface over one agent, not a fork. What is Hermes Desktop Hermes Agent is an autonomous AI agent. It is not a coding copilot tied to an editor. It runs tasks, calls tools, and keeps state across sessions. An agent here means a model that plans, acts, and observes in a loop. Hermes Desktop is a GUI on top of that same agent core. It needs no terminal to use. The window shows streaming responses and live tool activity. A right-hand pane previews web pages, files, and tool outputs. It also includes a file browser, voice input and output, and a settings UI. Sessions are shared across surfaces. A conversation started in the desktop resumes in the CLI or TUI. The reverse also works, because state is not duplicated. macOS and Windows offer direct installers. Linux installs from the terminal on any distribution. An install script with an --include-desktop flag builds the app against an existing install. The Closed Learning Loop Nous research team describes Hermes as having a closed learning loop. This is what separates it from a simple chat wrapper. After a complex task, the agent writes a reusable skill. Those skills then self-improve during later use. Memory is persistent and agent-curated, with periodic nudges to save knowledge. Cross-session recall uses FTS5 session search with LLM summarization. User modeling runs through Honcho dialectic user modeling. In practice, longer use means more retained context and reuse. Skills follow the agentskills.io open standard. How It Connects, Schedules, and Sandboxes Hermes runs across messaging platforms from one gateway. The desktop lists Telegram, Discord, Slack, WhatsApp, Signal, Email, and CLI. You can start a task on one platform and continue on another. Scheduling uses natural language for reports, backups, and briefings. These run unattended through the gateway on a built-in cron scheduler. Delegation spawns isolated subagents with their own conversations and terminals. A subagent is a separate worker that handles one job. Python RPC scripts collapse multi-step pipelines into zero-context-cost turns. Execution is sandboxed. The desktop lists five backends: local, Docker, SSH, Singularity, and Modal. It applies container hardening and namespace isolation. Namespace isolation limits what a running process can see or touch. Built-in tools include web search, browser automation, vision, image generation, text-to-speech, and multi-model reasoning. Hermes also connects external tools through MCP. MCP is the Model Context Protocol, a standard for tool integration. Nous Portal and the Tool Gateway Hermes works with any provider, so API keys are optional. Nous Portal bundles them under one subscription instead. Portal tiers are Free, Plus, Super, and Ultra. Paid tiers include monthly credits and access to 300+ models. They also include built-in tool use. The Tool Gateway routes several tools through one account. Web search uses Firecrawl and image generation uses FAL. Text-to-speech uses OpenAI and the cloud browser uses Browser Use. The next evolution of Hermes Agent is here! Introducing Hermes Desktop: everything you love about Hermes, now native on your machine.First demoed in Jensen's GTC keynote, it's now in public preview. pic.twitter.com/8ND1k8hyaz— Nous Research (@NousResearch) June 2, 2026 Strengths and Questions Strengths: Native installers remove the terminal requirement for most users Streaming output and previews make tool calls easier to inspect Persistent memory and self-improving skills reduce repeated instructions Model-agnostic design avoids lock-in to a single provider The MIT license allows audit, self-hosting, and modification Questions: The product is in public preview, so expect rough edges Autonomous memory and scheduling raise oversight and review questions The Linux desktop still installs through the terminal Broad capability means a steeper learning curve for beginners Key Takeaways Nous Research released Hermes Desktop in public preview, a native macOS, Windows, and Linux app for its open-source Hermes Agent. The GUI shares one agent core, configuration, API keys, sessions, skills, and memory with the CLI and gateway; sessions resume across surfaces. It runs no-terminal with streaming tool output, a side-by-side preview pane, file browser, voice I/O, and a settings UI. Hermes is model-agnostic and MIT-licensed, working with Nous Portal, OpenRouter, OpenAI, or any compatible endpoint. The current build is Hermes Agent v0.15.2, backed by a closed learning loop, MCP tool support, and five sandbox backends. Check out the Project here. Also, feel free to follow us on Twitter and don’t forget to join our 150k+ ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well. Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.? Connect with us The post Nous Research Releases Hermes Desktop: A Native Cross-Platform Front End for Hermes Agent v0.15.2 with Streaming Tool Output appeared first on MarkTechPost.

Related

相關文章

收費才是DeepSeek的“成人禮”

這篇消息聚焦「收費才是DeepSeek的“成人禮”」。原始導語提到:豆包先給DeepSeek探探路。 從 AI 情報角度來看,這類內容值得關注其背後的技術進展、產品落地、產業競爭與後續市場影響。

54 分鐘前
Hugging Face Blog模型更新

如何針對您的語言、領域或口音微調 Nemotron 3.5 ASR

NVIDIA 推出 Nemotron 3.5 ASR,這是一個具備 6 億參數的串流多語言語音轉文字模型,能從單一檢查點即時轉錄 40 種語言區域,並內建標點符號與大小寫功能。它繼承了今年稍早於 Hugging Face 及 NIM 發布的 Nemotron 3 ASR(僅支援英文)模型,後者已獲得人工智慧分析獨立基準測試的驗證。

4 小時前
雷峰網模型更新

何小鵬內部講話曝光,「最美」機器人量產時間表出來了?

機器人能不能規模化穩定交付,是整個行業的生死問題。 作者丨李希 編輯丨馬曉寧 “4季度一定要把量產做出來。”在近日的一場小鵬機器人量產動員大會上,何小鵬給內部團隊定下了一條極具壓迫感的時間線:2026 年四季度完成量產,2027 年一季度進入國內汽車門店導購場景,2027 年二季度開始進入海外市場。這場講話裡,何小鵬反覆強調的並不是 Demo、視頻或者模型參數,而是三個關鍵詞:“量產、全棧自研、跨域融合”。如果把過去兩年的具身智能行業理解為“模型秀場”,那麼小鵬這場內部講話,更像是一場真正的製造業動員令。我們認為,小鵬正在試圖把機器人,按照“造車邏輯”重新做一遍。01何小鵬說了哪些話小鵬去年年底發佈的全新一代人形機器人 IRON 相當矚目,一度被稱為“最美”機器人。除了量產時間表外,何小鵬表示,小鵬機器人是全國唯一一家全領域自研、跨界融合的機器人廠商。而且小鵬的自研深度很深。小鵬汽車花了5年的時間做自研,才能把第一個版本做到行業水平,再花5年時間才能把多個不同能力的跨域進行融合。而如果做簡單的產品定義與集成自研,你就永遠做不到跨域融合。(因為)你永遠會看到其他人給你提的需求,你再去找供應商商量,供應商說做不到,你就做不到了。所以何小鵬在會場堅決表態,小鵬要成為機器人中的蘋果,從芯片到操作系統、從關節到手都要做到自研,這樣才有可能做到不一樣。當然前期的投入時間難度特別大,但是想象力、創新力和改造力也特別不一樣。這款量產機器人,會是一款什麼樣的機器人?何小鵬給的產品定義是,小鵬正在走出一條不一樣的產品和商業路線。小鵬的機器人是一個優雅、美、安全的機器人,是一個能夠跟人近距離交互的機器人。02從“機器人 Demo”到“機器人工程”何小鵬在講話中回憶了小鵬汽車早期自動駕駛研發時的經歷。當時團隊曾經認為,“硬件先做,軟件後面 OTA 即可”,但後來發現,零下 30 度起霧、電磁幹擾、

7 小時前