write-judge-prompt
Design LLM-as-Judge evaluators for subjective criteria that code-based checks cannot handle. Use when a failure mode requires interpretation (tone, faithfulness, relevance, completeness, reasoning quality). Do NOT use when the failure mode can be checked with code (regex, schema validation, test execution). Default to Claude (claude-sonnet-4-6 or claude-opus-4-6) as the judge model.