Prompt Evaluation Runner
When to use
Use when you need to evaluate an LLM app, test a prompt systematically, or run red-team/vulnerability scans against a target model or application.
Requirements / Checks
- Check if an evaluation tool is defined in project deps, scripts, lockfiles, or local toolchain (e.g.,
promptfoo,evals,braintrust). - Do not run unvetted remote runners without checking the project's toolchain first (e.g., avoid
npx promptfoo@latestifpromptfoois alr
[Description truncada. Veja o README completo no GitHub.]