diff --git a/README.md b/README.md index c2ef6d2..aab7a11 100644 --- a/README.md +++ b/README.md @@ -10,18 +10,18 @@ ## 🌟 Overview -**GenericAgent** is a minimal, self-evolving autonomous agent framework. Its core is just **~3,300 lines of code**. Through **7 atomic tools + a 92-line Agent Loop**, it grants any LLM system-level control over a local computer — covering browser, terminal, filesystem, keyboard/mouse input, screen vision, and mobile devices (ADB). +**GenericAgent** is a minimal, self-evolving autonomous agent framework. Its core is just **~3K lines of code**. Through **9 atomic tools + a ~100-line Agent Loop**, it grants any LLM system-level control over a local computer — covering browser, terminal, filesystem, keyboard/mouse input, screen vision, and mobile devices (ADB). Its design philosophy: **don't preload skills — evolve them.** -Every time GenericAgent solves a new task, it automatically crystallizes the execution path into an skill for direct reuse later. The longer you use it, the more skills accumulate — forming a skill tree that belongs entirely to you, grown from 3,300 lines of seed code. +Every time GenericAgent solves a new task, it automatically crystallizes the execution path into an skill for direct reuse later. The longer you use it, the more skills accumulate — forming a skill tree that belongs entirely to you, grown from 3K lines of seed code. > **🤖 Self-Bootstrap Proof** — Everything in this repository, from installing Git and running `git init` to every commit message, was completed autonomously by GenericAgent. The author never opened a terminal once. ## 📋 Core Features - **Self-Evolving**: Automatically crystallizes each task into an skill. Capabilities grow with every use, forming your personal skill tree. -- **Minimal Architecture**: ~3,300 lines of core code. Agent Loop is just 92 lines. No complex dependencies, zero deployment overhead. -- **Strong Execution**: Injects into a real browser (preserving login sessions). 7 atomic tools take direct control of the system. +- **Minimal Architecture**: ~3K lines of core code. Agent Loop is ~100 lines. No complex dependencies, zero deployment overhead. +- **Strong Execution**: Injects into a real browser (preserving login sessions). 9 atomic tools take direct control of the system. - **High Compatibility**: Supports Claude / Gemini / Kimi / MiniMax and other major models. Cross-platform. @@ -40,7 +40,7 @@ This is what fundamentally distinguishes GenericAgent from every other agent fra | *"Monitor stocks and alert me"* | Install mootdx → build selection flow → configure cron → save skill | **one-line start** | | *"Send this file via Gmail"* | Configure OAuth → write send script → save skill | **ready to use** | -After a few weeks, your agent instance will have a skill tree no one else in the world has — all grown from 3,300 lines of seed code. +After a few weeks, your agent instance will have a skill tree no one else in the world has — all grown from 3K lines of seed code. ##### 🎯 Demo Showcase @@ -115,12 +115,12 @@ streamlit run frontends/stapp2.py # Alternative Streamlit UI | Feature | GenericAgent | OpenClaw | Claude Code | |------|:---:|:---:|:---:| -| **Codebase** | ~3,300 lines | ~530,000 lines | Open-sourced (large) | +| **Codebase** | ~3K lines | ~530,000 lines | Open-sourced (large) | | **Deployment** | `pip install` + API Key | Multi-service orchestration | CLI + subscription | | **Browser Control** | Real browser (session preserved) | Sandbox / headless browser | Via MCP plugin | | **OS Control** | Mouse/kbd, vision, ADB | Multi-agent delegation | File + terminal | | **Self-Evolution** | Autonomous skill growth | Plugin ecosystem | Stateless between sessions | -| **Out of the Box** | 10 .py files + 5 skills | Hundreds of modules | Rich CLI toolset | +| **Out of the Box** | A few core files + starter skills | Hundreds of modules | Rich CLI toolset | ## 🧠 How It Works @@ -138,10 +138,10 @@ GenericAgent accomplishes complex tasks through **Layered Memory × Minimal Tool > _Perceive environment state → Task reasoning → Execute tools → Write experience to memory → Loop_ -The entire core loop is just **92 lines of code** (`agent_loop.py`). +The entire core loop is just **~100 lines of code** (`agent_loop.py`). 3️⃣ **Minimal Toolset** -> _GenericAgent provides only **7 atomic tools**, forming the foundational capabilities for interacting with the outside world._ +> _GenericAgent provides only **9 atomic tools**, forming the foundational capabilities for interacting with the outside world._ | Tool | Function | |------|------| @@ -187,18 +187,18 @@ MIT License — see [LICENSE](LICENSE) ## 🌟 项目简介 -**GenericAgent** 是一个极简、可自我进化的自主 Agent 框架。核心仅 **~3,300 行代码**,通过 **7 个原子工具 + 92 行 Agent Loop**,赋予任意 LLM 对本地计算机的系统级控制能力,覆盖浏览器、终端、文件系统、键鼠输入、屏幕视觉及移动设备。 +**GenericAgent** 是一个极简、可自我进化的自主 Agent 框架。核心仅 **~3K 行代码**,通过 **9 个原子工具 + ~100 行 Agent Loop**,赋予任意 LLM 对本地计算机的系统级控制能力,覆盖浏览器、终端、文件系统、键鼠输入、屏幕视觉及移动设备。 它的设计哲学是:**不预设技能,靠进化获得能力。** -每解决一个新任务,GenericAgent 就将执行路径自动固化为 Skill,供后续直接调用。使用时间越长,沉淀的技能越多,形成一棵完全属于你、从 3,300 行种子代码生长出来的专属技能树。 +每解决一个新任务,GenericAgent 就将执行路径自动固化为 Skill,供后续直接调用。使用时间越长,沉淀的技能越多,形成一棵完全属于你、从 3K 行种子代码生长出来的专属技能树。 > **🤖 自举实证** — 本仓库的一切,从安装 Git、`git init` 到每一条 commit message,均由 GenericAgent 自主完成。作者全程未打开过一次终端。 ## 📋 核心特性 - **自我进化**: 每次任务自动沉淀 Skill,能力随使用持续增长,形成专属技能树 -- **极简架构**: ~3,300 行核心代码,Agent Loop 仅 92 行,无复杂依赖,部署零负担 -- **强执行力**: 注入真实浏览器(保留登录态),7 个原子工具直接接管系统 +- **极简架构**: ~3K 行核心代码,Agent Loop 约百行,无复杂依赖,部署零负担 +- **强执行力**: 注入真实浏览器(保留登录态),9 个原子工具直接接管系统 - **高兼容性**: 支持 Claude / Gemini / Kimi / MiniMax 等主流模型,跨平台运行 ## 🧬 自我进化机制 @@ -215,7 +215,7 @@ MIT License — see [LICENSE](LICENSE) | *"监控股票并提醒我"* | 安装 mootdx → 构建选股流程 → 配置定时任务 → 保存 Skill | **一句话启动** | | *"用 Gmail 发这个文件"* | 配置 OAuth → 编写发送脚本 → 保存 Skill | **直接可用** | -用几周后,你的 Agent 实例将拥有一套任何人都没有的专属技能树,全部从 3,300 行种子代码中生长而来。 +用几周后,你的 Agent 实例将拥有一套任何人都没有的专属技能树,全部从 3K 行种子代码中生长而来。 @@ -361,12 +361,12 @@ streamlit run frontends/stapp2.py # 另一种 Streamlit 风格 UI | 特性 | GenericAgent | OpenClaw | Claude Code | |------|:---:|:---:|:---:| -| **代码量** | ~3,300 行 | ~530,000 行 | 已开源(体量大) | +| **代码量** | ~3K 行 | ~530,000 行 | 已开源(体量大) | | **部署方式** | `pip install` + API Key | 多服务编排 | CLI + 订阅 | | **浏览器控制** | 注入真实浏览器(保留登录态) | 沙箱 / 无头浏览器 | 通过 MCP 插件 | | **OS 控制** | 键鼠、视觉、ADB | 多 Agent 委派 | 文件 + 终端 | | **自我进化** | 自主生长 Skill 和工具 | 插件生态 | 会话间无状态 | -| **出厂配置** | 10 个 .py + 5 个 Skills | 数百模块 | 丰富 CLI 工具集 | +| **出厂配置** | 几个核心文件 + 少量初始 Skills | 数百模块 | 丰富 CLI 工具集 | ## 🧠 工作机制 @@ -385,10 +385,10 @@ GenericAgent 通过**分层记忆 × 最小工具集 × 自主执行循环**完 > 感知环境状态 → 任务推理 → 调用工具执行 → 经验写入记忆 → 循环 -整个核心循环仅 **92 行代码**(`agent_loop.py`)。 +整个核心循环仅 **约百行代码**(`agent_loop.py`)。 3️⃣ **最小工具集** ->GenericAgent 仅提供 **7 个原子工具**,构成与外部世界交互的基础能力 +>GenericAgent 仅提供 **9 个原子工具**,构成与外部世界交互的基础能力 | 工具 | 功能 | |------|------|