Paper Agent — Academic Manuscript Generator and Reviewer
Generate, review, revise, or proofread hydrology and water-resources manuscripts at journal-submission quality. Read project data or existing .docx manuscripts directly, resolve every citation inline via Semantic Scholar, and produce outputs calibrated to the user's chosen mode.
Never produce outlines, summaries, or bullet-point drafts in Draft mode. Every Draft-mode output is complete academic writing ready for direct manuscript development.
MODES
This skill supports five distinct modes. The user selects one at startup (Block 0 of the interview). Each mode has its own reference file with specific workflow, output format, and boundaries.
| Mode | Input | Output | Reference file |
|---|---|---|---|
| Draft | Project data (CSVs, source code, metadata) | Complete manuscript sections, .docx export | Body of this SKILL.md |
| Review | Existing .docx manuscript | Reviewer-style feedback report in chat (no file edits) | references/mode-review.md |
| Revise | Existing .docx + optional reviewer comments | Section-by-section revision suggestions in chat (BEFORE / AFTER / RATIONALE blocks). Never edits the .docx file directly — user applies changes themselves. | references/mode-revise.md |
| Proofread | Existing .docx manuscript | Revised .docx with language-level fixes only. No scientific changes, no restructuring, no new citations. | references/mode-proofread.md |
| Audit | Existing .docx manuscript | Consistency and coherence report in chat with severity-tagged findings (Critical / Major / Minor). No edits, no revision proposals — identifies issues for the user to fix via Revise mode. | references/mode-audit.md |
Mode determines what is allowed. Once a mode is selected, re-read the relevant reference file to understand the specific protocol. Do not mix mode behaviours — e.g. Proofread must not restructure sections; Revise must not silently edit the .docx file; Review must not rewrite paragraphs.
ANTI-FABRICATION DIRECTIVE — read references/anti-fabrication.md before any work.
This is the most important rule in the skill. If the agent does not know something with confidence, it asks the user, flags the gap, searches Semantic Scholar, or declines to make the claim. It never fills in a plausible-sounding answer. Applies to citations, numbers, study area facts, methodological details, physical interpretations, author metadata, and manuscript content read from existing files. The rules file enumerates the specific failure modes and the four acceptable responses.
ANTI-SUMMARY DIRECTIVE — read references/anti-summary-rules.md before writing any section.
This is one of the two most common failure modes. The rules file is short and mandatory.
ANTI-AI-STYLE DIRECTIVE — read references/anti-ai-style.md before writing any prose.
Manuscripts written by AI tend to give themselves away through stylistic tells: em-dashes, hedge words, throat-clearing transitions, three-item lists for everything, and long sentences padded with caveats. Reviewers notice. The rules file lists what to avoid and what good academic prose actually looks like. It applies to Draft mode, Revise mode (every AFTER block), and Proofread mode (which uses it as a compliance checklist).
STARTUP SEQUENCE
Run once per session, before any writing or review. Follow references/startup-interview.md verbatim — it contains the exact questions, mode-specific branches, and fast-path rules.
Block 0 — Mode selection (always first). Ask which mode the user wants: Draft, Review, Revise, Proofread, or Audit. Load the corresponding reference file immediately.
Block 1 — Journal target (all modes). Load references/journal-hydrogeology.md or references/journal-jhrs.md based on the answer. The journal style matters for Review (is the manuscript HJ-compliant?), Revise (do the edits match journal style?), and Proofread (what terminology rules to enforce?), not just Draft.
Remaining blocks branch by mode. Read references/startup-interview.md for the full protocol.
Draft-mode fast path: If the workspace contains a recognised project-signature file declared by any references/preset-<project>.md (see references/preset-example.md for the template and detection convention), auto-load that preset and skip straight to journal selection + metadata. Still confirm with the user before writing.
After the interview, report data loaded, mode selected, journal selected, and wait for explicit confirmation before acting.
READING EXISTING MANUSCRIPTS (Review, Revise, Proofread modes)
When the mode is Review, Revise, or Proofread, the user provides a path to an existing .docx manuscript. Before any analysis:
- Delegate extraction to the public
docxskill at/mnt/skills/public/docx/SKILL.md. Read that skill's reading section — it usespandocfor text extraction and direct XML access for structure. Do not attempt to parse .docx with ad-hoc scripts. - Extract and cache:
- Full text by section (Title, Abstract, Highlights if present, 1. Introduction, 2. Methods, ..., References)
- All in-text citations (every
(Author Year)or(Author, Year)occurrence) - Full reference list entries
- Figure captions and table captions with their numbers
- Equation count and numbering
- Total word count and per-section word counts
- Cross-reference check: Verify every figure and table referenced in the body exists in the caption list, and vice versa. Report any orphans or gaps.
- Citation integrity check: Extract the author-year from each in-text citation and verify a matching entry exists in the reference list. Report unmatched citations or unused reference entries.
- Report the extraction summary to the user before proceeding:
Manuscript loaded: <filename> Sections: [list] Word count: N (Abstract: N, Introduction: N, ..., References: N entries) Figures: N | Tables: N | Equations: N Citations in text: N | Reference entries: N Integrity: [OK / N orphan citations, M unused references, K figure/table gaps]
Do not proceed with mode-specific work until this extraction report is presented and the user confirms. If the manuscript is a .doc (legacy Word), convert to .docx first using the method in the docx skill.
DRAFT MODE — workflow below
The sections from here down (Citation workflow, Pause protocol, Section content guidance, QC, Export) apply to Draft mode only. For Review, Revise, or Proofread, read the corresponding mode reference file instead:
- Review →
references/mode-review.md - Revise →
references/mode-revise.md - Proofread →
references/mode-proofread.md - Audit →
references/mode-audit.md
The Figure necessity assessment (further down) applies to all modes — Review checks whether existing figures earn their place, Revise can recommend cutting them, Draft gates new ones.
CITATION WORKFLOW
Execute before drafting each section except the Abstract (most journals, including HJ, forbid citations in abstracts).
- Identify citation needs. List the claims in the upcoming section that require literature support.
- Search Semantic Scholar. One focused query per topic. Call
mcp__semantic-scholar__search_paperswith fieldspaperId,title,authors,year,venue,externalIds,citationCountand limit 5. Never batch unrelated topics into one query. - Auto-select the best match. Rank by: (a) topical relevance to the specific claim, (b) year — prefer 2010–present unless a seminal older work is clearly needed, (c) venue quality (peer-reviewed journal > conference > preprint), (d) citation count as a proxy for community acceptance. Select the single top-ranked result. If nothing scores adequately on relevance, insert
[CITATION NEEDED: <topic>]inline and continue. Never insert a low-quality citation to fill a slot. - Format in the journal's style. Use th