AI Agents · TokenRhythm Technologies
Claw-SWE-Bench: Why Coding Agent Harnesses Matter
Claw-SWE-Bench evaluates OpenClaw-style coding-agent harnesses on 350 GitHub issue tasks. OpenClaw jumps from 19.1% to 73.4% Pass@1 with a full adapter.
Institution
AI systems company publishing work on coding-agent benchmarks, harnesses, and evaluation infrastructure.
AI Agents · TokenRhythm Technologies
Claw-SWE-Bench evaluates OpenClaw-style coding-agent harnesses on 350 GitHub issue tasks. OpenClaw jumps from 19.1% to 73.4% Pass@1 with a full adapter.