Spec-Driven Development (SDD) Plugin: Continuous Learning + LLM-as-Judge + Agent Swarm
Comprehensive specification-driven development workflow plugin that transforms prompts into production-ready implementations through structured planning, architecture design, and quality-gated execution.
This plugin is designed to consistently and reproducibly produce working code. It was tested on real-life production projects by our team, and in 100% of cases it generated working code aligned with the initial prompt. If you find a use case it cannot handle, please report it as an issue.
Key Features
- Development as compilation — The plugin works like a "compilation" or "nightly build" for your development process:
task specs → run /sdd:implement → working code. After writing your prompt, you can launch the plugin and expect a working result when you come back. The time it takes depends on task complexity — simple tasks may finish in 30 minutes, while complex ones can take a few days. - Benchmark-level quality in real life — Model benchmarks improve with each release, yet real-world results usually stay the same. That's because benchmarks reflect the best possible output a model can achieve, whereas in practice LLMs tend to drift toward sub-optimal solutions that can be wrong or non-functional. This plugin uses a variety of patterns to keep the model working at its peak performance.
- Customizable — Balance between result quality and process speed by adjusting command parameters. Learn more in the Customization section.
- Developer time-efficient — The overall process is designed to minimize developer time and reduce the number of interactions, while still producing results better than what a model can generate from scratch. However, overall quality is highly proportional to the time you invest in iterating and refining the specification.
- Industry-standard — The plugin's specification template is based on the arc42 standard, adjusted for LLM capabilities. Arc42 is a widely adopted, high-quality standard for software development documentation used by many companies and organizations.
- Works best in complex or large codebases — While most other frameworks work best for new projects and greenfield development, this plugin is designed to perform better the more existing code and well-structured architecture you have. At each planning phase it includes a codebase impact analysis step that evaluates which files may be affected and which patterns to follow to achieve the desired result.
- Simple — This plugin avoids unnecessary complexity and mainly uses just 3 commands, offloading process complexity to the model via multi-agent orchestration.
/sdd:implementis a single command that produces working code from a task specification. To create that specification, you run/sdd:add-taskand/sdd:plan, which analyze your prompt and iteratively refine the specification until it meets the required quality.
Quick Start
/plugin marketplace add NeoLabHQ/context-engineering-kit
Enable sdd plugin in installed plugins list
/plugin
# Installed -> sdd -> Space to enable
Then run the following commands:
# create .specs/tasks/draft/design-auth-middleware.feature.md file with initial prompt
/sdd:add-task "Design and implement authentication middleware with JWT support"
# write detailed specification for the task
/sdd:plan
# will move task to .specs/tasks/todo/ folder
Restart the Claude Code session to clear context and start fresh. Then run the following command:
# implement the task
/sdd:implement @.specs/tasks/todo/design-auth-middleware.feature.md
# produces working implementation and moves the task to .specs/tasks/done/ folder
Overall Flow
End-to-end task implementation process from initial prompt to pull request, including commands from the git plugin:
/sdd:add-task→ creates a.specs/tasks/draft/<task-name>.<type>.mdfile with the initial task description./sdd:plan→ generates a.claude/skills/<skill-name>/SKILL.mdfile with skills needed to implement the task (by analyzing library and framework documentation used in the codebase), then updates the task file with a refined specification and moves it to.specs/tasks/todo/./sdd:implement→ produces a working implementation, verifies it, then moves the task to.specs/tasks/done/./git:commit→ commits changes./git:create-pr→ creates a pull request.
1. Create 2. Plan 3. Implement 4. Ship
+-------------+ +-----------+ +---------------+ +-----------------+
|/sdd:add-task| | /sdd:plan | |/sdd:implement | | /git:commit |
+------+------+ +-----+-----+ +------+--------+ | | |
| | | | v |
v v v |/git:create-pr |
+-------+---------+
|
Task Lifecycle |
+----------+ +----------+ +--------------+ +---------+
| draft/ +-->| todo/ +-->| in-progress/ +-->| done/ |
| *.md | | *.md | | *.md | | *.md |
+----------+ +----------+ +--------------+ +---------+
Commands
Core workflow commands:
- /sdd:add-task - Create task template file with initial prompt
- /sdd:plan - Analyze prompt, generate required skills and refine task specification
- /sdd:implement - Produce working implementation of the task and verify it
Additional commands useful before creating a task:
- /sdd:create-ideas - Generate diverse ideas on a given topic using creative sampling techniques
- /sdd:brainstorm - Refine vague ideas into fully-formed designs through collaborative dialogue
Available Agents
The SDD plugin uses specialized agents for different phases of development:
| Agent | Description | Used By |
|---|---|---|
researcher | Technology research, dependency analysis, best practices | /sdd:plan (Phase 2a) |
code-explorer | Codebase analysis, pattern identification, architecture mapping | /sdd:plan (Phase 2b) |
business-analyst | Requirements discovery, stakeholder analysis, specification writing | /sdd:plan (Phase 2c) |
software-architect | Architecture design, component design, implementation planning | /sdd:plan (Phase 3) |
tech-lead | Task decomposition, dependency mapping, risk analysis | /sdd:plan (Phase 4) |
team-lead | Step parallelization, agent assignment, execution planning | /sdd:plan (Phase 5) |
qa-engineer | Verification rubrics, quality gates, LLM-as-Judge definitions | /sdd:plan (Phase 6) |
developer | Code implementation, TDD execution, quality review, verification | /sdd:implement |
tech-writer | Technical documentation writing, API guides, architecture updates, lessons learned | /sdd:implement |
Patterns
Key patterns implemented in this plugin:
- Structured reasoning templates — includes Zero-shot and Few-shot Chain of Thought, Tree of Thoughts, Problem Decomposition, and Self-Critique. Each is tailored to a specific agent and task, enabling sufficiently detailed decomposition so that isolated sub-agents can implement each step independently.
- Multi-agent orchestration for context management — Context isolation of independent agents prevents the context rot problem, essentially keeping LLMs at optimal performance at each step of the process. The main agent acts as an orchestrator that launches sub-agents and controls their work.
- Quality gates based on LLM-as-Judge — Evaluate the quality of each planning and implementation s