Definition of Done Skill

DoD checklist enforcement.

Completion Audit: Anti-Bias Clauses

Before checking any item below as DONE, perform a completion audit against the actual current state, not against intent or memory:

Restate the acceptance criteria as concrete deliverables. Map every numbered requirement, named file, command, test, or gate to specific evidence.
Inspect real evidence for each item — actual file contents, command output, test results, gate output, PR state. Not "I implemented it" — "here is the file at this path showing it."
Verify proxies actually cover the requirement. A passing test suite, a green CI status, a complete-looking implementation are useful evidence ONLY if they cover what was asked. A test that passes but doesn't exercise the new code is not coverage. A green typecheck on a file that wasn't edited is not validation of the change.
Treat uncertainty as not-done. If you can't verify an item with evidence, the item is NOT DONE. Do more verification or continue the work.

Do not rely on any of the following as proof of completion:

Intent — "I meant to do X" is not "X is done"
Partial progress — "I started X" is not "X is complete"
Elapsed effort — "I spent time on X" is not "X works"
Memory of earlier work — re-verify; the state may have changed
A plausible final answer — fabricated test output, hallucinated file contents, or a confident summary that wasn't checked against reality is the canonical failure mode this skill exists to prevent

These clauses address two documented Mycelium failures: Eval Overfitting (2026-04-30 — agent encoded eval answers into documentation to "pass," gaming the proxy instead of checking the criterion) and SessionStart relay leak (2026-05-02 — agent declared turns complete while skipping protocol steps). Borrowed from OpenAI Codex /goal continuation template's completion-audit specification (logged 2026-05-03).

Checklist

Functionality

All acceptance criteria met and demonstrable
Happy path works end-to-end
Edge cases handled (empty, null, max, concurrent)
Error states handled with user-friendly messages

Code Quality

Code reviewed (self-review minimum, peer review preferred)
No linting errors or warnings
No type errors
Engineering principles followed (DRY, KISS, YAGNI, SoC, SOLID)
No TODO/FIXME/HACK comments left without linked issue
No commented-out code

Testing

Unit tests written and passing (target 70-80% coverage for new code)
Integration tests written where applicable
E2E tests for critical user journeys
Edge case tests included
No test pollution (tests independent, no shared mutable state)

Security

Input validation on all entry points
Output encoding appropriate to context
No hardcoded secrets
Authentication/authorization verified
Dependency scan clean (no high/critical vulnerabilities)
OWASP Top 10:2025 addressed for this feature
JIT-tooling gap flag (PR-TIME layer): if no SAST equivalent ran in the validation suite, surface as a visible (non-blocking) finding — "Closing this diamond without SAST coverage. Best-practice menu was offered at bootstrap; gap is on the record." Don't block; do make the absence visible. Per the 4-layer JIT composition (delivery-bootstrap 3a/3b + reflexion/security-review NUDGE-AT-FAILURE + this PR-TIME layer; deep-study 2026-05-26).

Accessibility

Semantic HTML used correctly
Keyboard navigation works
Color contrast meets WCAG 2.1 AA (4.5:1 normal, 3:1 large)
Screen reader announces content correctly
Focus management correct for dynamic content
Usability heuristics evaluated (Nielsen's 10 via /mycelium:usability-check)

Documentation

API documentation updated (if API changed)
Inline documentation for complex logic
README updated if setup/usage changed
Architecture Decision Records filled in (if docs/adr/ was scaffolded by /mycelium:delivery-bootstrap)

Observability

Structured logging for key operations
Error tracking configured
Metrics emitted for measurable behaviors
Health check endpoint works (if service)

Deployment

Feature flag configured (if progressive rollout)
Deployed to staging
Smoke test passing in staging
Rollback plan identified
Monitoring/alerting configured

Process

corrections.md reviewed (no repeated mistakes)
BVSSH check: this delivery makes things Better, more Valuable, Sooner, Safer, Happier
delivery-journal.md updated

MoSCoW Scope Compliance (DSDM)

All Must-have items pass ALL REVIEW checks above
All Should-have items pass ALL REVIEW checks above
Could-have items pass NUDGE-only checks (acceptable to ship without full REVIEW compliance)
Won't-have items documented for future reference (not silently dropped)

AI Feature Contract (product_type: ai_tool ONLY)

Skip this entire section unless diamonds/active.yml declares product_type: ai_tool. An AI feature is a behavioral contract, not a spec: it is judged on success thresholds, failure modes, and must-never constraints rather than line-item functionality. These criteria do NOT apply to software / content / service products — do not let them bleed into the product-agnostic checklist above.

Eval suite passes at the declared threshold. The threshold is a product decision recorded in canvas/ai-tool-metrics.yml#prompt_quality, not an engineering default — the PM owns the quality bar, not the engineer. A green eval run below the declared threshold is NOT DONE.
PM sign-off on a representative output batch. A human reviewed a real sample of outputs and accepted them. An eval pass is model-quality evidence; it is NOT a substitute for human acceptance of product behavior.
Every behavioral_constraints entry verified non-violated. Each must-never rule in ai-tool-metrics.yml#behavioral_constraints has a populated verification and a recent last_verified. An unverified constraint is NOT DONE (treat-uncertainty-as-not-done, per the anti-bias clauses above).
failure_modes derived from real outputs, with binary acceptance. ai-tool-metrics.yml#failure_modes.outputs_reviewed is set (target >= 20 real outputs reviewed) and every mode carries a BINARY acceptance_criterion naming who holds final judgment. Imagined failure modes do not count.
AI transparency disclosure present if the feature is user-facing (regulatory.transparency_disclosure, EU AI Act Art. 50) — cross-check via /mycelium:regulatory-review.

Outcome

DONE: All required items checked. Proceed.
NOT DONE: List failing items. Address before marking complete.

Theory Citations

Forsgren: Accelerate (deployment practices)
Smart: Sooner Safer Happier (BVSSH)
Downe: Good Services (quality)
OWASP: Security
WCAG: Accessibility
AI Feature Contract: AI PRD as behavioral contract — success thresholds, evidence-derived failure modes, must-never constraints (Adaline 2026)

definition-of-done

Cómo agregar

Pega en el README de tu repo

Skills relacionadas

claude-api

skill-creator

oh-my-issues

claude-mem

Recibe nuevas skills de Desenvolvimento todos los lunes