everything-claude-codeMCP Server47/100 via “eval-driven development workflow with automated testing”
The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.
Unique: Integrates eval definition, automated test case generation, and skill evolution into a closed-loop workflow that measures agent performance against quantitative metrics and automatically improves skills based on eval results. Evals are first-class citizens in the development process, not afterthoughts.
vs others: Unlike manual testing or post-hoc evaluation, ECC's eval-driven workflow makes metrics central to development, enabling continuous measurement and automatic skill evolution based on quantitative feedback.