Implements eval-driven development framework with pass@k metrics for Claude Code sessions
/plugin install eval-harness-for-claude-code@affaan-mRequires Claude Code CLI.
AI engineers and prompt developers validate agent reliability and catch regressions through systematic evaluation protocols
No reviews yet. Be the first to review this skill.
Affaan M
@affaan-m