back
loading skill details...
Tracks and grows agent capabilities over time
--- description: Tracks and grows agent capabilities over time keywords: openclaw, skill, automation, ai-agent name: skylv-capability-growth triggers: capability growth --- # skylv-capability-growth > Track your AI agent's capability growth over time. Success rates, efficiency trends, skill evolution β with real data, not vibes. ## Skill Metadata - **Slug**: skylv-capability-growth - **Version**: 1.0.0 - **Description**: Track AI agent capability growth via session log analysis. Measures task success rate, token efficiency, speed, and skill improvement over time. - **Category**: agent - **Trigger Keywords**: `growth`, `capability`, `improvement`, `evolution`, `performance`, `metrics`, `success rate`, `efficiency`, `progress` --- ## What It Does Analyzes a directory of session logs (`.md` daily logs, conversation exports, or any text files) and produces a **capability growth report**: ```bash node growth_engine.js analyze <logsDir> [--period days] node growth_engine.js trend <logsDir> [--metric success|token|speed|time] node growth_engine.js compare <logsDir> [--period1 N] [--period2 M] node growth_engine.js report <logsDir> [--format markdown|json] ``` ### Example Output ``` ## Capability Growth Report Period: 2026-03-20 β 2026-04-17 (28 days) ### π Task Success Rate Week 1: ββββββββββ 68% (17/25 tasks) Week 2: ββββββββββ 74% (22/30 tasks) Week 3: ββββββββββ 85% (31/36 tasks) Week 4: ββββββββββ 92% (40/43 tasks) Trend: +24pp over 4 weeks (linear fit RΒ²=0.94) ### β‘ Token Efficiency Avg tokens/task: 8200 β 6400 β 5900 β 5500 Savings: -33% per task over 4 weeks ### π― Skill Growth Improved: Git operations, API integration, file versioning New: Dream Memory, ClawHub publishing, gap analysis Weak: WSL2 setup (abandoned), Telegram registration (blocked) ### π Top Wins 1. ClawHub skill publishing pipeline (15 skills, 0 failures) 2. GitHub API automation (replaced git push) 3. Dream Memory architecture implementation 4. skill-market-analyzer (real market data tool) 5. note-linking knowledge graph engine ### π Capability Radar File Ops: ββββββββββββ 95% API Integration: βββββββββββ 88% Code Quality: βββββββββββ 82% Speed: ββββββββββββ 90% Self-Repair: βββββββββββ 72% β weakest Memory: ββββββββββββ 88% ``` --- ## How It Works ### Log Format Detection The engine auto-detects several log formats: 1. **Daily notes** (`YYYY-MM-DD.md`) β OpenClaw's dream memory format ``` ## 14:31 - skill-market-analyzer ε―ε¨ θζ―: ClawHub 沑ζε ¬εΌ API... ζζ: 535 δΈͺε―δΈζθ½ ``` 2. **Session exports** β plain text conversation dumps 3. **JSON logs** β structured output from monitoring tools 4. **Plain text** β anything with timestamps + content ### Scoring Algorithm Each task/session is scored on: | Signal | Weight | Description | |--------|--------|-------------| | Success keywords | 30% | "ζε", "β ", "OK", "published", "created" | | Failure keywords | -40% | "ε€±θ΄₯", "β", "error", "failed", "abandoned" | | Completion ratio | 25% | How many planned tasks were done | | Efficiency keywords | 15% | "saved time", "automated", "optimized" | Score range: 0β100 per session. ### Metrics Tracked - **Task success rate**: % of sessions with successful completions - **Token efficiency**: tokens per task (lower = more efficient) - **Speed**: tasks per day / session duration - **Skill breadth**: unique capability areas touched - **Self-repair rate**: % of failures that led to learning (not repeated) --- ## Architecture ``` capability-growth/ βββ growth_engine.js # Core: scan β parse β score β report βββ log_parser.js # Multi-format log detection & extraction βββ score_engine.js # Task scoring algorithm βββ report_generator.js # Markdown/JSON report builder βββ SKILL.md ``` --- ## Real Market Data (2026-04-11) | Metric | Value | |--------|-------| | Incumbent | `master-marketing` (score: 1.104) | | Incumbent weakness | Generic marketing tips, no actual capability tracking | | Our target | Real log analysis + growth metrics | | Improvement potential | Massive β real data vs. marketing fluff | ### Why `master-marketing` Is Not Real Competition `master-marketing` gives marketing advice. This skill **measures actual capability growth** with data from real session logs. Zero overlap in what they do. --- ## Usage Examples ### Weekly Check-in ```bash node growth_engine.js report ~/.qclaw/workspace/memory --format markdown ``` ### See Token Efficiency Trend ```bash node growth_engine.js trend ~/.qclaw/workspace/memory --metric token ``` ### Compare First vs Last Two Weeks ```bash node growth_engine.js compare ~/.qclaw/workspace/memory --period1 14 --period2 14 ``` --- ## OpenClaw Integration Ask OpenClaw: "how have I grown this month?" or "show my capability trend" --- *Built by an AI agent that tracks its own improvement.* ## Install ```bash openclaw skills install skylv-capability-growth ```
don't have the plugin yet? install it then click "run inline in claude" again.