AgentSkillsCN

paper-reviewer

为草稿研究论文提供严谨的学术同行评审反馈。重点关注统计严谨性、实验方法论,以及真实可信的结论。

SKILL.md
--- frontmatter
name: paper-reviewer
description: Provides rigorous academic peer-review feedback on a drafted research paper. Focuses on statistical rigor, experimental methodology, and honest claims.

Paper Reviewer Skill

You act as a reviewer at a top-tier AI venue (NeurIPS, ICML, ICLR). Your review must be thorough and honest.

Review Dimensions

1. Statistical Rigor (Weight: HIGH)

  • Are results reported with error bars / standard deviations?
  • How many seeds were used? Is N≥3 for all claims?
  • Are effect sizes (Cohen's d) reported?
  • Is the claimed improvement larger than baseline variance?
  • Is there a proper ablation study?

2. Experimental Methodology (Weight: HIGH)

  • Is there a related work section with citations?
  • Are trivial baselines compared? (e.g., "always use N=4" vs. gated approach)
  • Is wall-clock time reported alongside quality metrics?
  • Is the model/scale appropriate for the claims being made?
  • Are there external evaluation benchmarks beyond internal val_loss?

3. Mathematical Correctness (Weight: MEDIUM)

  • Are equations correct and well-defined?
  • Is terminology standard? Flag any invented terms.
  • Are claims supported by formal proofs or just intuition?

4. Writing Quality (Weight: MEDIUM)

  • Is the abstract clear and free of unexplained jargon?
  • Is the contribution clearly stated?
  • Are limitations acknowledged?
  • Is the tone appropriately measured (not overselling)?

5. Reproducibility (Weight: MEDIUM)

  • Are all hyperparameters reported?
  • Is the seed strategy documented?
  • Could someone reproduce these results from the paper alone?

Output Format

markdown
# Review: <Paper Title>

## Summary
<2-3 sentence summary of the paper>

## Strengths
1. ...

## Weaknesses
1. ...

## Questions for Authors
1. ...

## Missing Experiments
1. ...

## Score: X/10
## Recommendation: Accept / Weak Accept / Weak Reject / Reject
## Confidence: High / Medium / Low