NousResearch/hermes-agent 是一个开源的、可扩展的智能体框架,旨在随用户需求演进,支持自主任务规划与工具调用。 NousResearch/hermes-agent is an open-source, extensible agent framework designed to evolve with user needs, supporting autonomous task planning and tool use.
AutoGPT 是一个开源的自主AI代理框架,旨在让开发者和普通用户都能轻松使用和构建基于大语言模型的自动化任务系统。 AutoGPT is an open-source autonomous AI agent framework designed to make LLM-powered automation accessible for both developers and end users to use and extend.
Dify 是一个面向生产环境的开源平台,用于构建和部署基于智能体(agentic)的工作流,支持可视化编排、模型集成与应用发布。 Dify is a production-ready open-source platform for building and deploying agentic workflows, featuring visual orchestration, multi-model integration, and application publishing.
vLLM 是一个高性能、内存高效的大型语言模型推理与服务引擎,专为优化吞吐量和降低显存开销而设计,广泛用于生产环境部署。 vLLM is a high-throughput, memory-efficient inference and serving engine for large language models, designed to optimize latency, throughput, and GPU memory utilization in production deployments.
Lathe 是一款实验性AI工具,利用大语言模型(如Claude Code)为用户生成基于权威资料的动手实践教程,并通过专设的本地UI引导用户亲手阅读和键入代码,旨在促进深度学习而非替代学习过程。 Lathe is an experimental AI tool that uses LLMs (e.g., Claude Code) to generate hands-on, source-backed tutorials for technical topics, guiding users to learn deeply by reading and typing code manually in a purpose-built local UI—rather than outsourcing understanding.
llama.cpp 是一个用 C/C++ 实现的轻量级开源项目,专注于在本地 CPU 上高效运行大型语言模型(LLM)推理,支持量化、多平台部署和零依赖运行。 llama.cpp is a lightweight, open-source project written in C/C++ that enables efficient local LLM inference on CPUs, with support for model quantization, cross-platform deployment, and zero external dependencies.
Langflow 是一个开源的低代码平台,用于可视化构建、调试和部署基于 LLM 的 AI 代理与工作流。 Langflow is an open-source, low-code platform for visually building, debugging, and deploying LLM-powered AI agents and workflows.
LobeHub 是一个开源的 AI 代理编排平台,旨在将多个 AI 智能体组织为 7×24 小时持续运行的自动化团队,支持智能体招聘、调度与绩效报告。 LobeHub is an open-source AI agent orchestration platform designed to manage multiple AI agents as a 24/7 autonomous team, enabling agent onboarding, scheduling, and performance reporting.
RAGFlow 是一个领先的开源检索增强生成(RAG)引擎,融合了前沿 RAG 技术与智能体(Agent)能力,旨在为大语言模型构建更强大的上下文层。 RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that integrates state-of-the-art RAG techniques with Agent capabilities to build a more robust context layer for LLMs.
字节跳动开源的DEER-Flow是一个面向长周期任务的SuperAgent框架,支持研究、编码与内容创作,集成沙箱、记忆、工具链、技能模块、子智能体及消息网关等核心能力。 ByteDance's open-source DEER-Flow is a long-horizon SuperAgent framework designed for complex, multi-step tasks spanning minutes to hours—enabling research, coding, and content creation via sandboxes, memory, tool integration, skills, subagents, and a message gateway.
Unsloth Studio 是一个基于 Web 的用户界面,支持在本地训练和运行 Gemma 4、Qwen3.6、DeepSeek、gpt-oss 等开源大模型,显著降低本地 AI 模型部署门槛。 Unsloth Studio is a web-based UI enabling local training and inference of open models like Gemma 4, Qwen3.6, DeepSeek, and gpt-oss—streamlining accessible, on-device AI development.
本文提出ToolMaze基准,专门用于评估大语言模型智能体在工具调用失败时的动态重规划与异常恢复能力,弥补了现有基准忽视现实工具故障的缺陷。 This paper introduces ToolMaze, a novel benchmark designed to evaluate dynamic replanning and anomaly recovery in LLM agents when tool calls fail—addressing a critical gap left by existing 'happy-path'-focused TIR benchmarks.
本文提供了12条设计AI驱动高性能计算(HPC)工作流的实用建议,聚焦于弥合传统确定性HPC流水线与新兴迭代式、数据驱动AI工作流之间的鸿沟。 This article offers 12 practical tips for designing AI-driven high-performance computing (HPC) workflows, addressing the integration challenges between traditional deterministic HPC pipelines and modern iterative, data-driven AI workflows.
这是一篇来自 Hacker News 的社区讨论帖,反映了软件工程师对大语言模型冲击职业前景的焦虑与困惑,属于行业情绪与职业影响层面的反思。 This is a Hacker News community discussion post expressing anxiety and uncertainty among software engineers about how LLMs are disrupting traditional engineering roles and career trajectories.
本文提出Stream3D-VLM,一种支持实时流式视频输入的在线3D视觉语言模型,通过增量几何先验和自回归流控建模实现动态空间理解,突破了现有3D多模态模型依赖离线完整场景的局限。 This paper introduces Stream3D-VLM, an online 3D vision-language model that enables real-time spatial understanding from streaming video using incremental geometry priors and autoregressive streaming control modeling—addressing the key limitation of existing 3D large multimodal models that require offline, complete scene inputs.
本文提出LIMMT方法,首次从数据中心视角研究基于物理的人形运动跟踪,通过物理可行性、多样性与复杂性三个维度定义运动数据质量,并证明少量高质量数据可显著提升训练效率与策略优化效果。 This paper introduces LIMMT (Less Is More for Motion Tracking), the first data-centric study for physics-based humanoid motion tracking, defining motion data quality along three dimensions—physics feasibility, diversity, and complexity—and demonstrating that training on a small subset of high-quality motion data yields superior policy optimization trajectories.
该论文揭示了大语言模型解嵌入矩阵(unembedding matrix)可被重新解释为文本嵌入的“特征透镜”,解释了其在标准文本嵌入基准上表现不佳的根本原因——嵌入向量在词表空间中意外对齐高频无信息词元。 This paper reveals that the unembedding matrix in large language models functions as an implicit 'feature lens' for text embeddings, explaining their poor performance on standard text embedding benchmarks: text embeddings inadvertently align with frequent but semantically uninformative tokens in vocabulary space.
本文提出“以想象力思考”范式,通过世界模拟器增强视觉语言模型的空间推理能力,使其能推断未观测布局、保持跨视角一致性并从有限第一人称观察中进行多视角推理。 This paper introduces the 'thinking with imagination' paradigm, using world simulators to enhance vision-language models’ spatial reasoning—enabling inference of unobserved layouts, cross-view consistency maintenance, and multi-perspective reasoning from limited egocentric observations.
PaperFlow 是一种面向科研人员日常阅读流的新型论文推荐框架,通过动态建模学者画像、时序化推荐和自适应反馈三个耦合阶段,突破了传统静态排序范式。 PaperFlow is a novel scientific paper recommendation framework designed for longitudinal, daily reading workflows; it advances beyond static ranking by integrating dynamic scholar profiling, date-specific stream ranking, and adaptive feedback in three coupled stages.
SubtleMemory 是一个面向长周期AI智能体的新型基准,专注于细粒度关系记忆辨别能力评估,填补了现有长期记忆评测中忽视记忆间关系建模的空白。 SubtleMemory is a novel benchmark designed to evaluate fine-grained relational memory discrimination in long-horizon AI agents, addressing the critical gap in existing long-term memory benchmarks that overlook how agents preserve and reason over inter-memory relationships.