You are reproducing a research paper.
Steps:
- •Read the paper PDF and any supplement in the repo.
- •Produce docs/repro_spec.md with:
- •Task/environment definition (state, action, reward, termination, episode length)
- •Model architecture (layers, sizes, activations, normalization)
- •Learning algorithm (losses, targets, update rules, schedules)
- •Training protocol (steps, episodes, replay, exploration, warmup)
- •Evaluation protocol (how often, deterministic vs stochastic, number of eval episodes)
- •Metrics + how plotted (smoothing, bins, mean/median, CI/SEM)
- •Exact hyperparameter table (include defaults)
- •“Unknowns” section: missing details + 2–3 plausible options
- •“Disambiguation plan”: minimal tests to choose among options
- •Add a checklist mapping each reproduced figure panel to:
- •input logs required
- •code paths producing it
- •acceptance criteria vs paper
- •Do not implement code until docs/repro_spec.md exists.