AgentSkillsCN

rnow-cli

使用 ReinforceNow CLI 进行 RLHF 训练。适用于运行 rnow 命令、初始化项目、提交训练任务、测试回滚部署,或下载模型时使用。触发条件包括“rnow”、“rnow init”、“rnow run”、“rnow test”、“rnow download”、“rnow login”以及“training run”。

SKILL.md
--- frontmatter
name: rnow-cli
description: Use the ReinforceNow CLI for RLHF training. Use when running rnow commands, initializing projects, submitting training runs, testing rollouts, or downloading models. Triggers on "rnow", "rnow init", "rnow run", "rnow test", "rnow download", "rnow login", "training run".
allowed-tools: Bash, Read, Grep, Glob

ReinforceNow CLI Reference

The rnow CLI manages RLHF training projects on the ReinforceNow platform.

Installation

bash
pip install rnow

Command Overview

CommandDescription
rnow loginAuthenticate with the platform
rnow logoutRemove credentials
rnow statusCheck auth and running jobs
rnow orgsManage organizations
rnow initCreate new project from template
rnow runSubmit training run
rnow stopCancel active run
rnow testTest rollouts locally
rnow downloadDownload trained model

rnow login

Authenticate using OAuth device flow.

bash
rnow login [OPTIONS]
OptionDescription
--forceForce new login even if already authenticated
--api-url URLCustom API base URL

Example:

bash
rnow login
# Opens browser for authentication
# Stores credentials in ~/.reinforcenow/credentials.json

rnow logout

Remove stored credentials.

bash
rnow logout

rnow status

Check authentication status and running jobs.

bash
rnow status

Output:

code
Logged in as: user@example.com
Organization: My Team (org_abc123)
Active runs: 2
  - run_xyz789 (running) - Math Training
  - run_def456 (queued) - Code Agent

rnow orgs

List or select organizations.

bash
# List all organizations
rnow orgs

# Select an organization
rnow orgs ORG_ID

Example:

bash
rnow orgs
# Output:
# * org_abc123 - My Team (owner)
#   org_def456 - Other Team (member)

rnow orgs org_def456
# Switched to: Other Team

rnow init

Initialize a new project from a template.

bash
rnow init [OPTIONS]
OptionDescription
--template NAMETemplate to use (see below)
--name NAMEProject name (prompts if not provided)

Available Templates

TemplateTypeDescription
startRLDefault single-turn RL (alias for rl-single)
rl-singleRLSingle-turn with math reasoning
rl-toolsRLMulti-turn with tool calling
sftSFTSupervised finetuning
tutorial-rewardRLLearn reward functions
tutorial-toolRLLearn tool functions
mcp-tavilyRLExternal MCP server (web search)
deepseek-ahaRLDeepSeek aha-moment training
finqaRLFinancial QA
convfinqaRLConversational financial QA
quantqaRLQuantitative finance
newRLMinimal template
blank-Empty (config only)

Examples:

bash
# Create SFT project
rnow init --template sft --name "my-sft-project"

# Create RL project with tools
rnow init --template rl-tools

# Create from tutorial
rnow init --template tutorial-reward

Generated Files

TemplateFiles
sftconfig.yml, train.jsonl
rl-singleconfig.yml, train.jsonl, rewards.py, requirements.txt
rl-toolsconfig.yml, train.jsonl, rewards.py, tools.py, requirements.txt
blankconfig.yml

rnow run

Submit project for training.

bash
rnow run [OPTIONS]
OptionDescription
--dir PATHProject directory (default: current)
--name NAMECustom run name

Required files:

  • config.yml - Configuration
  • train.jsonl - Training data
  • rewards.py - Reward functions (RL only)

Optional files:

  • tools.py - Tool definitions
  • requirements.txt - Python dependencies

Example:

bash
cd my-project
rnow run

# Output:
# Validating project...
# Uploading files...
# Starting run: run_abc123xyz
# View at: https://www.reinforcenow.ai/runs/run_abc123xyz

rnow stop

Cancel an active training run.

bash
rnow stop RUN_ID

Example:

bash
rnow stop run_abc123xyz
# Are you sure you want to stop run_abc123xyz? [y/N]: y
# Run stopped.
# Duration: 2h 15m
# Cost: $12.50

rnow test

Test RL rollouts locally before submitting.

bash
rnow test [OPTIONS]
OptionDefaultDescription
-d, --dir PATH.Project directory
-n, --num-rollouts N1Number of rollouts
--entry INDICESrandomTest specific entries (e.g., "0,2,5")
--model MODELconfigOverride model for testing

Examples

Basic test:

bash
rnow test
# Runs 1 rollout, shows reward breakdown

Multiple rollouts:

bash
rnow test -n 5

Test specific entries:

bash
rnow test --entry 0,3,7
# Tests entries at indices 0, 3, and 7 from train.jsonl

Override model:

bash
rnow test --model gpt-5-nano -n 3
# Uses gpt-5-nano instead of config.model.path

Test Output

code
Rollout 1/3
Entry: 0
Prompt: What is 2+2?

Turn 1:
  Assistant: The answer is 4.

Rewards:
  accuracy: 1.0
  format_check: 1.0
Total: 1.0

---
Rollout 2/3
...

rnow download

Download a trained model checkpoint.

bash
rnow download RUN_ID [OPTIONS]
OptionDefaultDescription
-o, --output DIR./modelOutput directory

Example:

bash
rnow download run_abc123xyz -o ./my-model
# Downloading checkpoint...
# Progress: 100%
# Saved to: ./my-model/