Not on Product Hunt
Claude Skills
AI Agentssoon
Workflowssoon
Creators

Not on Product Hunt

1,496 curated Claude Skills. We rejected 2,904 so you don't have to.

Categories

DevelopmentMarketingSecurityIntegrationsOperationsLegal

Resources

Submit a SkillSearch SkillsCreatorsSitemapllms.txt

Legal

Privacy PolicyTerms of Service

© 2025 Not on Product Hunt. Not affiliated with Product Hunt.

Built for the Claude community

ai-agentsintermediate

Evaluate Agent Results by Metric

Evaluate and rank agent results using metrics, LLM judge comparison, or hybrid approach for AgentHub sessions.

Install

/plugin install evaluate-agent-results-by-metric@alirezarezvani

Requires Claude Code CLI.

Use cases

AI development teams use this to automatically benchmark and rank competing agent solutions by performance metrics or qualitative assessment.

Reviews

No reviews yet. Be the first to review this skill.

Stats

Installs0
GitHub Stars7.4k
Forks887
LicenseMIT
UpdatedMar 27, 2026

Creator

A

Alireza Rezvani

@alirezarezvani

View on GitHub