Peer Review
Run a full simulated peer review — dispatch the peer-reviewer and integrity-reviewer agents over the manuscript, dedupe their findings, and produce a referee report with a recommendation.
Core Rule
Simulate the referee you fear, before the journal assigns one. This skill runs an adversarial, evidence-bound review: it surfaces what a skeptical reviewer would attack — overclaim, unsupported assertions, scope creep, methods gaps, statistical sins — and produces a referee report you can act on. The review reports; it does not silently edit the manuscript. Fixes that touch the thesis, a quantity, methods, or an argument-carrying citation are Protected Claims (CLAUDE.md) and need author sign-off.
The reviewers are bound by the same cardinal rule as the writer: a critique must point at real text and a real problem. No invented weaknesses, no citations the reviewer "expects to see" that don't exist as a demand — every issue cites a section/line.
When to Use
Invoke with /peer-review when:
- A draft is complete and you want a referee report before submission.
- A major section (Discussion, Methods) is settled and you want it stress-tested.
- You are deciding readiness for a target venue (pair with
/journal-fit). - A reviser wants to know which Reviewer-2 objections are still open.
Scope it: /peer-review for the whole manuscript, or /peer-review sections/methods.tex for one section.
Process
Phase 1: Load the Manuscript and Its Contract
- Read
MANUSCRIPT_MAP.md— Thesis, Contribution, Audience, target venue, Key sources, "Claims that need extra care," and the Structure table. The review judges the manuscript against its own stated thesis and scope — a reviewer's first question is "did they deliver what they claimed?" - Read the manuscript — every section file in the Structure table (or the scoped file).
- Read
tasks/reviews/_index.md → ## Top Rules— recurring prior critiques. If a reviewer already flagged "you overclaim in the discussion," check it has been addressed; an unfixed Top Rule is a Major issue. - Note the field overlay in
agent_docs/field/if one exists — discipline-specific reporting standards the reviewer will hold you to.
Phase 2: Dispatch the Reviewer Agents
Run two specialist agents in parallel — they read the manuscript independently and return findings:
-
peer-revieweragent — the scholarly referee. Judges:- Significance & novelty — is the contribution real and clearly distinguished from prior work?
- Soundness — do the methods support the claims? Is the analysis appropriate?
- Claim calibration — does every results/discussion sentence stay within what the evidence licenses? (verb + quantifier + scope)
- Clarity & structure — does each section establish the one claim it owes (
MANUSCRIPT_MAP → Structure)? - Statistics — effect size + uncertainty reported, not just significance; N, test, and assumptions stated (
agent_docs/statistics.md).
-
integrity-revieweragent — the research-integrity referee. Judges:- Sourcing — every substantive claim is cited, the author's own, or common knowledge; no UNSUPPORTED assertions.
- No fabrication — no invented-looking citation, DOI, quantity, or quote; placeholders (
[CITE],[VALUE — verify]) surfaced, not hidden. - Quote fidelity — quotations verbatim with locators.
- Reproducibility — data/code availability stated; methods sufficient to reproduce (
agent_docs/reproducibility.md). - Scope honesty — generalization claims match the tested population/matrix.
If the agents are unavailable in this environment, run both review lenses yourself sequentially — but keep them as separate passes (scholarly soundness vs integrity), because they catch different failures.
Phase 3: Dedupe and Triage
The two agents will overlap (an overclaim is both a soundness and an integrity issue). Merge:
- Collapse duplicates — same sentence flagged by both → one issue, noting both lenses.
- Classify severity:
- Major — threatens a central claim, the contribution, soundness, or integrity. Would justify "major revision" or rejection. (Unsupported thesis-level claim, methods that don't support the result, fabrication risk, unaddressed prior Top Rule.)
- Minor — does not threaten the conclusion but should be fixed. (Local overclaim, a missing cite on a secondary claim, an undefined cross-reference, a clarity issue.)
- Order by impact — Major issues first, most central first. A reviewer leads with the objection that decides the paper.
Phase 4: Write the Referee Report
Produce a report in the shape a journal referee submits: a Summary that proves you understood the contribution, then Major and Minor issues, then a recommendation. Keep the reviewer's professional, specific voice — every point names a location and states the fix or the question.
Phase 5: Hand Off, Do Not Auto-Fix
End with a checklist of edits the author can apply. Do not apply them in this skill. Surface Protected Claims explicitly (thesis/quantity/methods/argument-citation changes) so the author decides. Offer to run /claim-check on flagged overclaims and /citation-audit on flagged bibliography issues as the follow-up.
Output Format
# Referee Report — <manuscript title> (simulated)
## Summary (reviewer's understanding)
This manuscript argues <thesis, in the reviewer's words>. The contribution is
<X>, established via <method>. [2–4 sentences showing the contribution was understood.]
## Recommendation
Major revision (Major: 3, Minor: 6)
## Major Issues
1. **[Soundness]** §Discussion ¶3 (discussion.tex:41): The claim "the gate causes
higher tool-call accuracy across task settings" is causal and general, but the
design tests one agent harness and shows association. The conclusion overreaches
the evidence.
→ Restrict to the tested harness and to associational language, or supply the
comparison/identification that licenses the stronger claim. (Protected: verb +
scope change on the central claim.)
2. **[Integrity]** §Results (results.tex:62): "23% baseline hallucination rate" carries
no citation and is not in the data. UNSUPPORTED. → Cite the source or flag
[CITE]; do not assert it bare.
3. **[Prior Top Rule]** tasks/reviews flagged overclaim in the discussion before;
§Discussion ¶5 repeats the pattern. → Apply the existing rule.
## Minor Issues
1. **[Statistics]** §Results: p-values reported without effect sizes or CIs. Add both.
2. **[Clarity]** §Intro ¶2: the gap is stated twice; the contribution once. Invert.
3. **[Cross-ref]** discussion.tex:40: \ref{fig:horizon} has no \label.
4. **[Terminology]** "tool-call accuracy" and "success rate" used interchangeably — lock one (MANUSCRIPT_MAP → Terminology).
5. **[Sourcing]** §Intro: "widely reported" needs at least one cite or reframing.
6. **[Reproducibility]** No data-availability statement.
## Edit Checklist (for the author — not auto-applied)
- [ ] Discussion ¶3: calibrate verb + scope (PROTECTED — confirm)
- [ ] Results: resolve [CITE] on baseline hallucination rate
- [ ] Results: add effect sizes + CIs
- [ ] Fix \ref{fig:horizon}
- [ ] Add data-availability statement
- [ ] Lock terminology to "tool-call accuracy"
## Suggested follow-ups
- /claim-check sections/discussion.tex (the overclaim cluster)
- /citation-audit (sourcing + cross-ref issues)Pairs With
peer-revieweragent +integrity-revieweragent — the two lenses this skill orchestrates (Phase 2). Defined in.claude/agents/; methodology inagent_docs/peer-review.md./claim-check— the targeted follow-up for flagged overclaims (verb/quantifier verification against sources)./citation-audit— the follow-up for flagged bibliography/cross-reference issues./journal-fit— run alongside when the question is also "is this the right venue?"tasks/reviews/— recurring critiques live here; an unaddressed Top Rule is automatically a Major issue.
Common Rationalizations
| Rationalization | Reality |
|---|---|
| "My co-authors already read it" | Co-authors share your blind spots and your investment. Simulate the hostile stranger. |
| "Reviewer 2 is just mean" | Reviewer 2 is reading adversarially — which is the correct way to read your strongest claims. Pre-empt them. |
| "These are nitpicks" | A pile of minor issues reads as carelessness and primes the referee to doubt the major claims too. |
| "Let the journal find the problems" | The journal finding them costs you a rejection or a revision cycle. Find them now. |
| "Just fix the issues for me" | Fixes to the thesis/quantities/methods/argument-citations are Protected — they need the author, not the agent. |
Notes
- Reviewing is judgment-heavy — run on the Reasoner model (
CLAUDE.md → Model Selection). - Feed the outcome back: a critique you keep receiving is a rule. Log it to
tasks/reviews/and promote to Top Rules if it recurs. - This is a simulation — it pre-empts likely objections; it does not guarantee acceptance. Its job is to leave Reviewer 2 with less to say.
Literature Review
Synthesize a related-work / literature-review section from the project's OWN library (references.bib + sources/ + vault) — thematic, gap-driven, calibrated, real citations only.
Response To Reviewers
Draft a point-by-point response letter — quote each reviewer comment, state the change made and its exact location, stay courteous, and NEVER claim a change not actually made.