developmentadvanced

LLM-as-Judge Evaluation Systems

Build production-grade LLM evaluation pipelines using direct scoring and pairwise comparison while mitigating systematic biases.

Install

/plugin install llm-as-judge-evaluation-systems@sickn33

Requires Claude Code CLI.

Use cases

ML engineers designing automated quality assessment systems for LLM outputs need practical patterns to choose evaluation approaches, detect bias, and select appropriate metrics.

Reviews

No reviews yet. Be the first to review this skill.

Stats

Installs0
GitHub Stars27.2k
Forks4595
LicenseMIT License
UpdatedMar 25, 2026