Evaluate Diagram
Evaluate a generated diagram against a human reference using PaperBanana's VLM-as-Judge scoring.
Instructions
- •
$ARGUMENTS[0]is the path to the generated image. - •
$ARGUMENTS[1]is the path to the human reference image. - •Ask the user for:
- •Source context: the methodology text (or a file path to read it from). If the user provides a file path, read that file to get the text.
- •Figure caption: a description of what the diagram communicates.
- •Call the MCP tool
evaluate_diagramwith:- •
generated_path: the generated image path - •
reference_path: the reference image path - •
context: the methodology text content - •
caption: the figure caption
- •
- •Present the evaluation scores to the user. Scores cover 4 dimensions: Faithfulness, Conciseness, Readability, and Aesthetics.
CLI Fallback
If the MCP tool is not available, fall back to the CLI:
bash
paperbanana evaluate --generated <generated-img> --reference <reference-img> --context <context-file> --caption "<caption>"
Example
code
/evaluate-diagram output.png reference.png