Draft Shells
Decompose a specification file into shells at .turbo/shells/<spec-slug>-NN-<title>.md. Each shell represents one unit of work for a separate Claude Code session.
Task Tracking
At the start, use TaskCreate to create a task for each step:
- Resolve the source spec
- Decompose into shells
- Resolve open questions
- Write shell files
- Present summary
If the confirmed shell count is one, the Single-Shell Bail-out at the end of Step 2 marks tasks 3-5 deleted via TaskUpdate and exits.
Step 1: Resolve the Source Spec
Determine which spec to decompose using these rules in order:
- Explicit path — If the user passed a file path, use it
- Explicit slug — If a slug was passed, resolve to
.turbo/specs/<slug>.md - Single file — Glob
.turbo/specs/*.md. If exactly one file exists, use it - Most recent — If multiple files exist, use the most recently modified
- Legacy fallback — If
.turbo/specs/does not exist but.turbo/spec.mdexists, use it - Nothing found — If no spec exists, nothing to decompose; stop
The slug of the resolved spec becomes the prefix for shell file names: a spec at .turbo/specs/<slug>.md produces shells at .turbo/shells/<slug>-NN-<title>.md. For the legacy fallback, use slug legacy.
State the resolved spec path and target shell directory before continuing.
Read the spec and identify:
- Scope — total surface area of work
- Work categories — UI, backend, data layer, infrastructure, tests, documentation, tooling
- Spec requirements — enumerate the
R<N>IDs from the spec's## Requirementssection. Every R-id must be tracked in at least one shell's Covers field. - Dependencies — which pieces must exist before others can start
- Greenfield vs existing — is there an established codebase to work within
- Open questions — decisions the spec deferred that will need to be answered at implementation time
If the spec has no ## Requirements section or contains no R<N>-numbered items, use AskUserQuestion with two options: re-run /draft-spec (then restart Step 1 with the resulting spec) or stop so the user can add a ## Requirements section with enumerated R<N> IDs manually. Shells depend on stable R-ids for coverage tracking.
Step 2: Decompose Into Shells
Split the spec into shells, each a unit of work for a separate Claude Code session. The user sets the final count in the gate at the end of this step. The analysis here makes that choice informed: find where the work can be cut, name what must stay together, then recommend a count with options.
Find the Seams
Identify where the work can be cut and the order pieces must land:
- Dependency order — foundational work before dependent work: setup and scaffolding (project init, config, CI), then the data and domain layer (models, schemas, types), then core business logic, then the API and service layer, then UI and frontend, then integration and end-to-end concerns. A hard dependency is a strong seam: a later piece cannot be drafted or expanded until an earlier piece's concrete output exists (generated types, framework wiring, patterns later sessions survey against).
- Natural boundaries — candidate cut points where one piece's output is another's input. A spec's suggested groupings are a starting point; treat them as candidate seams the count gate may regroup.
A seam is weak when cutting it buys nothing: the two sides share no ordering dependency and would sit comfortably in one session. Shared-nothing independence alone is a weak seam. A seam is strong when one side must exist before the other, or when keeping both sides in one session would overload it: too much code to read in full, too many distinct conventions to absorb, or too much output for one window.
Keep Combined
Some pieces must share a shell regardless of the count the user picks:
- Tightly-coupled pieces — when UI, API, and tests are inseparable, keep them in one shell.
- Atomic ripple — when a breaking change to a shared interface requires every consumer across modules to update in lockstep, the change and all consumer updates land in one shell regardless of size. Splitting leaves intermediate states that break dependents.
- Reachability — each shell leaves the codebase fully integrated, with no components unreachable from the project's entry points. Bundle tightly-coupled producer/consumer pairs into one shell, or have a foundation shell include a minimal integration point (a single working endpoint or CLI command) that proves the code is reachable. When a shell builds infrastructure a later shell consumes, name that consumer in the Produces field.
These set the ceiling on the count: the work cannot split past the point where a combined piece would break.
Items folded into a shell go into that shell's Implementation Steps. If several folded items have no clear home, group them into a single "minor fixes" shell at the end.
Wiring Invariants
For each shell, identify the structural contract with the rest of the decomposition:
- Produces — What this shell creates that other shells (or the final system) can use. List concrete artifacts at the conceptual level: modules, types, endpoints, data models, UI screens, migration files. File paths are filled in at expansion time.
- Consumes — What this shell depends on that must already exist. Either listed in a prior shell's Produces (and that producing shell named directly in this shell's frontmatter
depends_on), or marked "from existing codebase" if it predates this decomposition. Every Consumes entry must be traceable to a source. - Covers spec requirements — Which
R<N>IDs from the spec's## Requirementssection this shell implements. The union of Covers across all shells must equal the full set of R-ids in the spec. Every R-id must appear in at least one shell's Covers. Write one R-id per bullet in the Step 4 template. For partial coverage of a single R-id, mark the entryR<N> (partial: <what's deferred>)and name the deferred work in that shell's Open Questions. A bareR<N>for partial coverage breaks the invariant. When a single R-id spans two shells — typically when one shell ships scaffolding or placeholders that a later shell fills — neither shell may claim it bare. Both shells useR<N> (partial: <what's deferred to the other shell>)with non-overlapping deferred slices. The bare form is reserved for an R-id that is fully satisfied as of the end of one shell. Do not invent variant annotations such as(finished: ...),(closing: ...),(completes: ...), or any other annotation that tries to convey "this is the shell that ships the rest"; use the two-partials pattern instead.
Shell Slug
Each shell gets a slug derived from its title using spec slug rules (lowercase, hyphenated, ≤40 chars), prefixed with the shell number: <spec-slug>-NN-<title-slug>. The shell keeps this file name when /expand-shell fills it in.
Example: spec slug photo-sorter-v2, Shell 3 titled "Build duplicate detection" → slug photo-sorter-v2-03-build-duplicate-detection, written to .turbo/shells/photo-sorter-v2-03-build-duplicate-detection.md.
Recommend and Confirm Shell Count
Form a recommended count from the seams and combination constraints above. The trade-off: more shells each cost a fresh-session handoff (lost in-memory context, a repeated pattern survey, an extra /pick-next-shell round); fewer shells risk overloading a session. Land the recommendation where that balance falls: lean toward fewer when the seams are weak, toward more when a strong seam or session overload pushes the work apart.
Output the recommendation as text: the recommended count, a one-line scope for each proposed shell, and a line or two on why that count over its neighbors. Then use AskUserQuestion to have the user set the final count. Offer the recommended count first, marked "(Recommended)", alongside 2-3 alternative counts;