Author in the catalog
Adds rubric-based evaluation to an existing agent codebase. Ideal for evaluating agents, measuring their quality, or setting up LLM-as-a-judge scoring, supporting single-agent and multi-subject comparisons.
Category alert