pipeline-eval — System-level evaluation for LLM pipelines
What this skill is for
deepeval scores one content artifact. pipeline-eval scores the system that produced it. Different question, different rubric, complementary.
Use pipeline-eval when:
- The user has a multi-stage pipeline (
pipeline.json, n8n workflow, LangGraph, custom orchestrator) - They want to know: where does quality leak? — at input, at the prompt, at sequencing, at fact-grounding, somewhere else
- They
[Description truncada. Veja o README completo no GitHub.]