NousResearch/hermes-agent 是一个开源的、可扩展的智能体框架,旨在随用户需求演进,支持动态工具调用与上下文自适应。该项目提供代码仓库、配置示例和基础文档,便于开发者快速集成与定制。 NousResearch/hermes-agent is an open-source, extensible agent framework designed to evolve with user needs, supporting dynamic tool use and context-aware adaptation. The repository includes code, configuration examples, and foundational documentation for developer integration and customization.
Dify 是一个面向生产环境的开源平台,用于构建和部署基于智能体(agentic)的工作流,支持可视化编排、模型集成与应用发布。 Dify is a production-ready open-source platform for building and deploying agentic workflows, featuring visual orchestration, multi-model integration, and one-click application publishing.
vLLM 是一个高性能、内存高效的大型语言模型推理与服务引擎,专为加速 LLM 部署而设计,支持 PagedAttention 等创新技术。 vLLM is a high-throughput, memory-efficient inference and serving engine for large language models, featuring innovations like PagedAttention to significantly improve decoding speed and GPU memory utilization.
这是一个基于大语言模型的开源股票分析系统,支持A股、港股和美股,整合多源行情数据、实时新闻与LLM决策仪表盘,并提供零成本定时运行和多渠道推送功能。 This is an open-source, LLM-powered stock analysis system supporting A-share, H-share, and US markets, integrating multi-source market data, real-time news, an LLM-driven decision dashboard, and zero-cost scheduled execution with multi-channel notifications.
Sem 是一种面向代码理解的新基础构件,它不依赖语言服务器协议(LSP),而是基于 Git 提取和建模代码实体(如函数、类型、依赖关系),旨在提供更稳定、版本感知的语义索引能力。 Sem is a new primitive for code understanding that moves beyond LSPs by modeling code entities (e.g., functions, types, dependencies) directly on top of Git history—enabling version-aware, stable semantic indexing.
Langflow 是一个开源的低代码平台,用于可视化构建、调试和部署基于 LLM 的 AI 代理与工作流。它支持与 LangChain 等主流框架集成,适合快速原型开发。 Langflow is an open-source, low-code platform for visually building, debugging, and deploying LLM-powered AI agents and workflows, with native integration for frameworks like LangChain, enabling rapid prototyping.
litellm 是一个开源的 Python SDK 和 AI 网关代理服务器,支持以统一 OpenAI 兼容格式调用 100+ 种大语言模型 API,并内置成本追踪、安全护栏、负载均衡和日志功能。 litellm is an open-source Python SDK and AI gateway proxy server that enables unified OpenAI-compatible calls to 100+ LLM APIs (including Bedrock, Azure, Vertex AI, Anthropic, etc.), with built-in features like cost tracking, guardrails, load balancing, and logging.
LobeHub 是一个开源的 AI 代理编排平台,旨在将多个 AI 智能体组织为 7×24 小时持续运行的自动化团队,提供招聘、调度与报告等类管理功能。 LobeHub is an open-source AI agent orchestration platform designed to organize multiple AI agents into a 24/7 operational team, offering agent 'hiring', scheduling, and performance reporting capabilities.
字节跳动开源的DEER-Flow是一个面向长周期任务的超级智能体(SuperAgent)框架,支持研究、编码与内容创作,集成沙盒、记忆、工具链、技能模块、子智能体和消息网关等核心能力。 ByteDance's open-source DEER-Flow is a long-horizon SuperAgent framework designed for research, coding, and content creation, featuring integrated sandboxes, memory, tool orchestration, skill modules, subagents, and a message gateway.
本文基于实证基准测试,质疑主流代码健康度指标(如SonarQube、CodeClimate分数)对真实软件缺陷的预测能力,指出其多为‘直觉性’指标而缺乏严谨验证。 This article empirically benchmarks whether popular code health scores (e.g., from SonarQube or CodeClimate) meaningfully predict actual bugs—finding they often reflect 'vibes' rather than validated, predictive signals.
Unsloth Studio 是一个基于 Web 的用户界面,支持在本地训练和运行 Gemma 4、Qwen3.6、DeepSeek、gpt-oss 等开源大模型,降低了本地 AI 模型部署与微调的门槛。 Unsloth Studio is a web-based UI that enables local training and inference of open large language models—including Gemma 4, Qwen3.6, DeepSeek, and gpt-oss—making local LLM fine-tuning and deployment more accessible.
MindPal 是一款面向个人知识管理的 AI 支持应用,主打长期记忆与上下文感知能力,旨在帮助用户持续追踪重要信息;该项目参与了 GitHub Finish-Up-A-Thon 挑战赛。 MindPal is a personal AI assistant app focused on long-term memory and context-aware support for knowledge continuity; it was developed as an entry for the GitHub Finish-Up-A-Thon challenge.
本文探讨AI公司高价收购Reddit历史数据引发的隐私与伦理隐忧,批评当前AI训练数据获取方式缺乏透明度和用户授权。 This article discusses privacy and ethical concerns arising from AI companies paying millions for historical Reddit posts, criticizing the lack of transparency and user consent in AI training data acquisition.
LibreChat 是一个功能丰富的开源 ChatGPT 克隆项目,支持多模型接入(如 GPT-5、Gemini、DeepSeek、o1 等)、智能体(Agents)、MCP 协议、技能系统、代码解释器、DALL-E-3 图像生成及多用户安全认证,可本地自托管。 LibreChat is a feature-rich, open-source ChatGPT alternative supporting multi-model integration (including GPT-5, Gemini, DeepSeek, o1), agents, MCP protocol, skills, code interpreter, DALL-E-3 image generation, and secure multi-user authentication — fully self-hostable.
SABER 是一个面向状态化项目工作空间的新型基准,用于评估大语言模型编码代理在多步操作序列下的运行时安全性,弥补了现有安全评测忽视环境状态演变的空白。 SABER is a novel benchmark for evaluating the operational safety of LLM-based coding agents in realistic, stateful project environments—shifting focus from single-response refusal to safety assessment based on final environment states after action sequences.
Code2LoRA是一种新型超网络框架,可为代码语言模型动态生成仓库级LoRA适配器,在不增加推理时token开销的前提下,高效注入项目上下文知识,应对软件演化挑战。 Code2LoRA is a novel hypernetwork framework that dynamically generates repository-specific LoRA adapters for code language models, enabling efficient injection of project-level context—such as imports, APIs, and conventions—without inference-time token overhead, addressing challenges posed by software evolution.
本文提出了BRepCLIP,首个针对CAD边界表示(BRep)的对比多模态预训练框架,旨在将精确的参数化几何、语言和图像表征对齐,填补了CAD原生格式表征学习的研究空白。 This paper introduces BRepCLIP, the first contrastive multimodal pretraining framework that aligns CAD boundary representations (BReps)—the native, parametric, topology-aware format—with language and image embeddings, addressing a critical gap in CAD representation learning.
本文是面向AI工程师的FastAPI系列教程第三部分,详细讲解如何在FastAPI应用中连接和操作数据库(如SQLAlchemy),包含可直接复用的代码示例与配置步骤。 This is Part 3 of a tutorial series on FastAPI for AI engineers, providing a step-by-step, hands-on guide to integrating databases (e.g., SQLAlchemy) into FastAPI applications, with production-ready code snippets and configuration best practices.
KITScenes是一个面向自动驾驶的新型多模态欧洲数据集,具备高保真传感器(如400米以上长距激光雷达、4D成像雷达)和目前最完整的经验证高清地图,旨在弥补现有数据集在传感器精度、地图完备性和地理多样性上的不足。 KITScenes is a new multimodal European autonomous driving dataset featuring high-fidelity sensors—including long-range lidar (>400m), 4D imaging radar, global-shutter cameras—and the most complete validated HD maps to date, addressing key gaps in sensor fidelity, map completeness, and geographic diversity of existing datasets.
MAOAM 是一种基于视觉-语言模型的统一框架,支持通过文本或点击交互实现对象与材质级别的图像区域选择,填补了现有VLM选择方法在材质感知能力上的空白。 MAOAM is a unified vision-language model framework for interactive image selection that supports both object-level and material-level targeting via text or click inputs—addressing a key gap in existing VLM-based selection methods.