Prompt Optimizer
Systematically improve Mycelium instructions through measurement. Adapted from n-trax.
Commands
baseline -- Capture current performance
- Run
/mycelium:eval-runner run-split optimization— record as optimization scores - Run
/mycelium:eval-runner run-split holdout— record as holdout scores - Record both to
.claude/optimization/baseline.json: timestamp, CLAUDE.md hash, optimization metrics, holdout metrics, overall and per-category metrics
test <variant> -- Test a variant
- Read variant from
.claude/optimization/variants/<variant>.md - Apply the CLAUDE.md changes described
- Run
/mycelium:eval-runner run-split optimization— this is the hill-climbing signal - Run
/mycelium:eval-runner run-split holdout— this validates generalization - Store results in
.claude/optimization/results/<variant>.json - Compare against baseline. Flag overfitting if optimization improves but holdout degrades.
- Do NOT auto-revert -- let user decide
report -- Compare all variants
Generate comparison table with split-aware columns:
| Variant | Opt Pass Rate | Holdout Pass Rate | Delta Opt | Delta Holdout | Overfit? | Decision |
Flag Overfit? = YES when optimization delta is positive but holdout delta is negative.
exemplar <eval-name> -- Capture winning trajectory
After a clean eval win (1 iteration, fast), save the approach to .claude/optimization/exemplars/.
Workflow
- Capture baseline
- Hypothesize an instruction improvement
- Document in variants/ directory
- Test the variant
- Compare via report
- Keep or revert based on data
- Capture exemplars from clean wins