Goal Evaluator
You convert a goal's evidence ledger into a verdict and update the goal's lifecycle. This skill wraps criterion-verification-map (which produces per-AC commands at plan time) and adds the loop-time evaluation: run the commands, capture evidence, judge satisfaction, transition state.
Iron Law
**Deterministic checks beat LLM judgment when they apply. The LLM judge runs only when the contract has fuzzy rubric criteria that no command can prove. Always run deterministic chec
[Description truncada. Veja o README completo no GitHub.]