Prompt Engineering Skill

Name: prompt-engineering
Rating: 88
Author: paulyokota

Optimize LLM prompts for classification and theme extraction using data-driven validation.

Workflow

•
Load Required Context
- •docs/prompts.md - Current prompt versions
- •config/theme_vocabulary.json - Theme definitions
- •docs/process-playbook/gates/functional-testing-gate.md - Testing requirements
- •Relevant classifier/extractor code
•
Research Best Practices
- •Review OpenAI prompt engineering guidelines
- •Check fixture files for failure patterns
- •Look for edge cases in test data
•
Design Change
- •Make ONE targeted change at a time
- •Document reasoning for the change
- •Predict expected impact

•
Apply Change
- •Update prompt in appropriate file
- •Maintain consistent format
- •Preserve schema structure
•
Run Functional Test (MANDATORY)
- •Execute full classification pipeline
- •Capture new accuracy metrics
- •Compare against baseline
- •Document in functional test evidence format
•
Analyze Results
- •Did accuracy improve/stay same/regress?
- •Were edge cases handled better?
- •Any new failure patterns introduced?

•
Update Prompt Documentation
- •Add entry to docs/prompts.md with version number
- •Include accuracy metrics (before/after)
- •Note what changed and why
- •Reference functional test evidence
•
Invoke /prompt-iteration command (if available)
- •Logs prompt version with metrics
- •Creates traceable history

Before claiming completion:

File	Purpose
`src/classifier_stage1.py`	Fast routing classifier
`src/classifier_stage2.py`	Refined analysis classifier
`src/theme_extractor.py`	Theme extraction with vocabulary
`config/theme_vocabulary.json`	Theme definitions and keywords
`docs/prompts.md`	Prompt version history
`data/theme_fixtures.json`	Test fixtures for accuracy
`data/labeled_fixtures.json`	Labeled test data

If you cannot proceed: