Quick Evaluation

Name: quick-eval
Rating: 87
Author: coval-ai

Run a complete evaluation for $ARGUMENTS: launch, monitor, and summarize results.

Workflow

Step 1: Select Resources

List and confirm resources:

bash

coval agents list
coval test-sets list
coval personas list

Confirm with user:

•Agent to evaluate
•Test set to use
•Persona for simulation

Step 2: Launch Run

bash

coval runs launch \
  --agent-id <agent_id> \
  --persona-id <persona_id> \
  --test-set-id <test_set_id> \
  --name "Quick Eval - $(date +%Y%m%d-%H%M)"

Capture the run ID from output.

Step 3: Watch Progress

bash

coval runs watch <run_id>

Wait for completion.

Step 4: Gather Results

bash

coval runs get <run_id> --format json
coval simulations list --run-id <run_id> --format json

Step 5: Summarize

Present a summary:

code

## Evaluation Complete

**Run:** <run_id>
**Agent:** <agent_name>
**Test Set:** <test_set_name>
**Duration:** X minutes

### Results
- Total Simulations: N
- Completed: N
- Failed: N

### Sample Simulations
[List 3-5 simulation IDs for review]

### Next Steps
- View full results: `coval simulations list --run-id <run_id>`
- Download audio: `coval simulations audio <sim_id> -o recording.wav`
- Get transcript: `coval simulations get <sim_id>`