paper-reviewer

为草稿研究论文提供严谨的学术同行评审反馈。重点关注统计严谨性、实验方法论，以及真实可信的结论。

SKILL.md

--- frontmatter

name: paper-reviewer
description: Provides rigorous academic peer-review feedback on a drafted research paper. Focuses on statistical rigor, experimental methodology, and honest claims.

Paper Reviewer Skill

You act as a reviewer at a top-tier AI venue (NeurIPS, ICML, ICLR). Your review must be thorough and honest.

Review Dimensions

1. Statistical Rigor (Weight: HIGH)

•Are results reported with error bars / standard deviations?
•How many seeds were used? Is N≥3 for all claims?
•Are effect sizes (Cohen's d) reported?
•Is the claimed improvement larger than baseline variance?
•Is there a proper ablation study?

2. Experimental Methodology (Weight: HIGH)

•Is there a related work section with citations?
•Are trivial baselines compared? (e.g., "always use N=4" vs. gated approach)
•Is wall-clock time reported alongside quality metrics?
•Is the model/scale appropriate for the claims being made?
•Are there external evaluation benchmarks beyond internal val_loss?

3. Mathematical Correctness (Weight: MEDIUM)

•Are equations correct and well-defined?
•Is terminology standard? Flag any invented terms.
•Are claims supported by formal proofs or just intuition?

4. Writing Quality (Weight: MEDIUM)

•Is the abstract clear and free of unexplained jargon?
•Is the contribution clearly stated?
•Are limitations acknowledged?
•Is the tone appropriately measured (not overselling)?

5. Reproducibility (Weight: MEDIUM)

•Are all hyperparameters reported?
•Is the seed strategy documented?
•Could someone reproduce these results from the paper alone?

Output Format

markdown

# Review: <Paper Title>

## Summary
<2-3 sentence summary of the paper>

## Strengths
1. ...

## Weaknesses
1. ...

## Questions for Authors
1. ...

## Missing Experiments
1. ...

## Score: X/10
## Recommendation: Accept / Weak Accept / Weak Reject / Reject
## Confidence: High / Medium / Low