developmentadvanced
LLM-as-Judge Evaluation Systems
Build production-grade LLM evaluation pipelines using direct scoring and pairwise comparison while mitigating systematic biases.
Install
/plugin install llm-as-judge-evaluation-systems@sickn33Requires Claude Code CLI.
Use cases
ML engineers designing automated quality assessment systems for LLM outputs need practical patterns to choose evaluation approaches, detect bias, and select appropriate metrics.
Reviews
No reviews yet. Be the first to review this skill.
Stats
Installs0
GitHub Stars27.2k
Forks4595
LicenseMIT License
UpdatedMar 25, 2026
Creator
Ssickn33
@sickn33