docs: fuzzy numbers in README (3K/~100/9 tools)

This commit is contained in:
Liang Jiaqing
2026-04-07 22:18:18 +08:00
parent 1e39032c67
commit f4eec277c5

View File

@@ -10,18 +10,18 @@
<a name="english"></a> <a name="english"></a>
## 🌟 Overview ## 🌟 Overview
**GenericAgent** is a minimal, self-evolving autonomous agent framework. Its core is just **~3,300 lines of code**. Through **7 atomic tools + a 92-line Agent Loop**, it grants any LLM system-level control over a local computer — covering browser, terminal, filesystem, keyboard/mouse input, screen vision, and mobile devices (ADB). **GenericAgent** is a minimal, self-evolving autonomous agent framework. Its core is just **~3K lines of code**. Through **9 atomic tools + a ~100-line Agent Loop**, it grants any LLM system-level control over a local computer — covering browser, terminal, filesystem, keyboard/mouse input, screen vision, and mobile devices (ADB).
Its design philosophy: **don't preload skills — evolve them.** Its design philosophy: **don't preload skills — evolve them.**
Every time GenericAgent solves a new task, it automatically crystallizes the execution path into an skill for direct reuse later. The longer you use it, the more skills accumulate — forming a skill tree that belongs entirely to you, grown from 3,300 lines of seed code. Every time GenericAgent solves a new task, it automatically crystallizes the execution path into an skill for direct reuse later. The longer you use it, the more skills accumulate — forming a skill tree that belongs entirely to you, grown from 3K lines of seed code.
> **🤖 Self-Bootstrap Proof** — Everything in this repository, from installing Git and running `git init` to every commit message, was completed autonomously by GenericAgent. The author never opened a terminal once. > **🤖 Self-Bootstrap Proof** — Everything in this repository, from installing Git and running `git init` to every commit message, was completed autonomously by GenericAgent. The author never opened a terminal once.
## 📋 Core Features ## 📋 Core Features
- **Self-Evolving**: Automatically crystallizes each task into an skill. Capabilities grow with every use, forming your personal skill tree. - **Self-Evolving**: Automatically crystallizes each task into an skill. Capabilities grow with every use, forming your personal skill tree.
- **Minimal Architecture**: ~3,300 lines of core code. Agent Loop is just 92 lines. No complex dependencies, zero deployment overhead. - **Minimal Architecture**: ~3K lines of core code. Agent Loop is ~100 lines. No complex dependencies, zero deployment overhead.
- **Strong Execution**: Injects into a real browser (preserving login sessions). 7 atomic tools take direct control of the system. - **Strong Execution**: Injects into a real browser (preserving login sessions). 9 atomic tools take direct control of the system.
- **High Compatibility**: Supports Claude / Gemini / Kimi / MiniMax and other major models. Cross-platform. - **High Compatibility**: Supports Claude / Gemini / Kimi / MiniMax and other major models. Cross-platform.
@@ -40,7 +40,7 @@ This is what fundamentally distinguishes GenericAgent from every other agent fra
| *"Monitor stocks and alert me"* | Install mootdx → build selection flow → configure cron → save skill | **one-line start** | | *"Monitor stocks and alert me"* | Install mootdx → build selection flow → configure cron → save skill | **one-line start** |
| *"Send this file via Gmail"* | Configure OAuth → write send script → save skill | **ready to use** | | *"Send this file via Gmail"* | Configure OAuth → write send script → save skill | **ready to use** |
After a few weeks, your agent instance will have a skill tree no one else in the world has — all grown from 3,300 lines of seed code. After a few weeks, your agent instance will have a skill tree no one else in the world has — all grown from 3K lines of seed code.
##### 🎯 Demo Showcase ##### 🎯 Demo Showcase
@@ -115,12 +115,12 @@ streamlit run frontends/stapp2.py # Alternative Streamlit UI
| Feature | GenericAgent | OpenClaw | Claude Code | | Feature | GenericAgent | OpenClaw | Claude Code |
|------|:---:|:---:|:---:| |------|:---:|:---:|:---:|
| **Codebase** | ~3,300 lines | ~530,000 lines | Open-sourced (large) | | **Codebase** | ~3K lines | ~530,000 lines | Open-sourced (large) |
| **Deployment** | `pip install` + API Key | Multi-service orchestration | CLI + subscription | | **Deployment** | `pip install` + API Key | Multi-service orchestration | CLI + subscription |
| **Browser Control** | Real browser (session preserved) | Sandbox / headless browser | Via MCP plugin | | **Browser Control** | Real browser (session preserved) | Sandbox / headless browser | Via MCP plugin |
| **OS Control** | Mouse/kbd, vision, ADB | Multi-agent delegation | File + terminal | | **OS Control** | Mouse/kbd, vision, ADB | Multi-agent delegation | File + terminal |
| **Self-Evolution** | Autonomous skill growth | Plugin ecosystem | Stateless between sessions | | **Self-Evolution** | Autonomous skill growth | Plugin ecosystem | Stateless between sessions |
| **Out of the Box** | 10 .py files + 5 skills | Hundreds of modules | Rich CLI toolset | | **Out of the Box** | A few core files + starter skills | Hundreds of modules | Rich CLI toolset |
## 🧠 How It Works ## 🧠 How It Works
@@ -138,10 +138,10 @@ GenericAgent accomplishes complex tasks through **Layered Memory × Minimal Tool
> _Perceive environment state → Task reasoning → Execute tools → Write experience to memory → Loop_ > _Perceive environment state → Task reasoning → Execute tools → Write experience to memory → Loop_
The entire core loop is just **92 lines of code** (`agent_loop.py`). The entire core loop is just **~100 lines of code** (`agent_loop.py`).
3**Minimal Toolset** 3**Minimal Toolset**
> _GenericAgent provides only **7 atomic tools**, forming the foundational capabilities for interacting with the outside world._ > _GenericAgent provides only **9 atomic tools**, forming the foundational capabilities for interacting with the outside world._
| Tool | Function | | Tool | Function |
|------|------| |------|------|
@@ -187,18 +187,18 @@ MIT License — see [LICENSE](LICENSE)
<a name="chinese"></a> <a name="chinese"></a>
## 🌟 项目简介 ## 🌟 项目简介
**GenericAgent** 是一个极简、可自我进化的自主 Agent 框架。核心仅 **~3,300 行代码**,通过 **7 个原子工具 + 92 行 Agent Loop**,赋予任意 LLM 对本地计算机的系统级控制能力,覆盖浏览器、终端、文件系统、键鼠输入、屏幕视觉及移动设备。 **GenericAgent** 是一个极简、可自我进化的自主 Agent 框架。核心仅 **~3K 行代码**,通过 **9 个原子工具 + ~100 行 Agent Loop**,赋予任意 LLM 对本地计算机的系统级控制能力,覆盖浏览器、终端、文件系统、键鼠输入、屏幕视觉及移动设备。
它的设计哲学是:**不预设技能,靠进化获得能力。** 它的设计哲学是:**不预设技能,靠进化获得能力。**
每解决一个新任务GenericAgent 就将执行路径自动固化为 Skill供后续直接调用。使用时间越长沉淀的技能越多形成一棵完全属于你、从 3,300 行种子代码生长出来的专属技能树。 每解决一个新任务GenericAgent 就将执行路径自动固化为 Skill供后续直接调用。使用时间越长沉淀的技能越多形成一棵完全属于你、从 3K 行种子代码生长出来的专属技能树。
> **🤖 自举实证** — 本仓库的一切,从安装 Git、`git init` 到每一条 commit message均由 GenericAgent 自主完成。作者全程未打开过一次终端。 > **🤖 自举实证** — 本仓库的一切,从安装 Git、`git init` 到每一条 commit message均由 GenericAgent 自主完成。作者全程未打开过一次终端。
## 📋 核心特性 ## 📋 核心特性
- **自我进化**: 每次任务自动沉淀 Skill能力随使用持续增长形成专属技能树 - **自我进化**: 每次任务自动沉淀 Skill能力随使用持续增长形成专属技能树
- **极简架构**: ~3,300 行核心代码Agent Loop 仅 92 行,无复杂依赖,部署零负担 - **极简架构**: ~3K 行核心代码Agent Loop 约百行,无复杂依赖,部署零负担
- **强执行力**: 注入真实浏览器(保留登录态),7 个原子工具直接接管系统 - **强执行力**: 注入真实浏览器(保留登录态),9 个原子工具直接接管系统
- **高兼容性**: 支持 Claude / Gemini / Kimi / MiniMax 等主流模型,跨平台运行 - **高兼容性**: 支持 Claude / Gemini / Kimi / MiniMax 等主流模型,跨平台运行
## 🧬 自我进化机制 ## 🧬 自我进化机制
@@ -215,7 +215,7 @@ MIT License — see [LICENSE](LICENSE)
| *"监控股票并提醒我"* | 安装 mootdx → 构建选股流程 → 配置定时任务 → 保存 Skill | **一句话启动** | | *"监控股票并提醒我"* | 安装 mootdx → 构建选股流程 → 配置定时任务 → 保存 Skill | **一句话启动** |
| *"用 Gmail 发这个文件"* | 配置 OAuth → 编写发送脚本 → 保存 Skill | **直接可用** | | *"用 Gmail 发这个文件"* | 配置 OAuth → 编写发送脚本 → 保存 Skill | **直接可用** |
用几周后,你的 Agent 实例将拥有一套任何人都没有的专属技能树,全部从 3,300 行种子代码中生长而来。 用几周后,你的 Agent 实例将拥有一套任何人都没有的专属技能树,全部从 3K 行种子代码中生长而来。
<!-- | *"帮我读取微信消息"* | 安装依赖 → 逆向数据库 → 写读取脚本 → 保存 Skill | **一句话调用** | --> <!-- | *"帮我读取微信消息"* | 安装依赖 → 逆向数据库 → 写读取脚本 → 保存 Skill | **一句话调用** | -->
@@ -361,12 +361,12 @@ streamlit run frontends/stapp2.py # 另一种 Streamlit 风格 UI
| 特性 | GenericAgent | OpenClaw | Claude Code | | 特性 | GenericAgent | OpenClaw | Claude Code |
|------|:---:|:---:|:---:| |------|:---:|:---:|:---:|
| **代码量** | ~3,300 行 | ~530,000 行 | 已开源(体量大) | | **代码量** | ~3K 行 | ~530,000 行 | 已开源(体量大) |
| **部署方式** | `pip install` + API Key | 多服务编排 | CLI + 订阅 | | **部署方式** | `pip install` + API Key | 多服务编排 | CLI + 订阅 |
| **浏览器控制** | 注入真实浏览器(保留登录态) | 沙箱 / 无头浏览器 | 通过 MCP 插件 | | **浏览器控制** | 注入真实浏览器(保留登录态) | 沙箱 / 无头浏览器 | 通过 MCP 插件 |
| **OS 控制** | 键鼠、视觉、ADB | 多 Agent 委派 | 文件 + 终端 | | **OS 控制** | 键鼠、视觉、ADB | 多 Agent 委派 | 文件 + 终端 |
| **自我进化** | 自主生长 Skill 和工具 | 插件生态 | 会话间无状态 | | **自我进化** | 自主生长 Skill 和工具 | 插件生态 | 会话间无状态 |
| **出厂配置** | 10 个 .py + 5 个 Skills | 数百模块 | 丰富 CLI 工具集 | | **出厂配置** | 几个核心文件 + 少量初始 Skills | 数百模块 | 丰富 CLI 工具集 |
## 🧠 工作机制 ## 🧠 工作机制
@@ -385,10 +385,10 @@ GenericAgent 通过**分层记忆 × 最小工具集 × 自主执行循环**完
> 感知环境状态 → 任务推理 → 调用工具执行 → 经验写入记忆 → 循环 > 感知环境状态 → 任务推理 → 调用工具执行 → 经验写入记忆 → 循环
整个核心循环仅 **92 行代码**`agent_loop.py`)。 整个核心循环仅 **约百行代码**`agent_loop.py`)。
3**最小工具集** 3**最小工具集**
>GenericAgent 仅提供 **7 个原子工具**,构成与外部世界交互的基础能力 >GenericAgent 仅提供 **9 个原子工具**,构成与外部世界交互的基础能力
| 工具 | 功能 | | 工具 | 功能 |
|------|------| |------|------|