AgentSkillsCN

model-selection

在每项任务启动之初加载此模块,以评估当前模型是否为最优选择。若任务特性表明存在更优模型,则提示用户切换至其他模型。

SKILL.md
--- frontmatter
name: model-selection
description: Load at the start of any task to assess whether the current model is optimal. Prompt the user to switch models when task characteristics indicate a better choice exists.

Model Selection

At the start of each significant task, assess whether the current model is optimal for the work ahead. If a different model would be substantially better, prompt the user to switch.

When to Prompt for Model Switch

Evaluate these conditions and prompt the user if a switch is warranted:

Upgrade to Opus 4.5 When:

  • Task involves complex architecture design or multi-system reasoning
  • Critical code review where accuracy is paramount
  • Deep domain understanding required (e.g., /project-domain, /project-constitution)
  • User is struggling with a complex problem that needs deeper reasoning

Prompt template:

code
💡 **Model Recommendation**: This task involves [complex architecture/deep reasoning/critical review]. 
Consider switching to **Opus 4.5** for higher accuracy. You can switch back to Sonnet after this step.

Upgrade to GPT-5.2 Extra High When:

  • Writing security-sensitive code (lowest vulnerability rate)
  • Mathematical or algorithmic problems requiring precise reasoning
  • Long-document analysis

Switch to Gemini 3 Flash / Grok Code When:

  • User needs fast responses for interactive work
  • Simple fixes or quick iterations
  • Budget is a concern
  • "Vibe coding" or UI polish work

Prompt template:

code
💡 **Model Recommendation**: For this interactive/quick-fix work, **Gemini 3 Flash** would provide 
faster responses while maintaining sufficient quality.

Use Composer 1 (in Cursor) When:

  • Story implementation (/story-implement)
  • Multi-file editing and refactoring
  • Rapid prototyping
  • User is in Cursor IDE and speed matters

Cross-Validation Recommended When:

  • Architecture decisions are being finalized
  • Security-sensitive code has been written
  • Production deployment is imminent

Prompt template:

code
💡 **Cross-Validation Recommended**: This [architecture/critical code] was authored with [current model]. 
For additional confidence, consider having a different model review it before proceeding.

Quick Reference Table

Task TypeRecommended ModelWhy
Complex architectureOpus 4.5Deep reasoning
Critical code reviewOpus 4.5Highest accuracy
Security-sensitive codeGPT-5.2 Extra HighLowest vulns
Standard implementationSonnet 4.5Best balance
Story implementation (Cursor)Composer 14x faster
Interactive/quick fixesGemini 3 FlashSpeed
Budget-constrainedGemini 3 FlashCheapest quality
Mathematical problemsGPT-5.2 Extra High100% AIME 2025
ValidationDifferent modelFresh perspective

Speck Command Recommendations

Project Level

CommandModelReason
/project-domainOpus 4.5Deep domain understanding
/project-architectureOpus 4.5Complex system design
/project-constitutionOpus 4.5Principle extraction
/project-validateDifferent modelCross-validation
Other project commandsSonnet 4.5Good balance

Story Level

CommandModelReason
/story-implementComposer 1Speed + Cursor integration
/story-tasksComposer 1 / Gemini 3 FlashFast structured output
/story-planSonnet 4.5Reasoning needed
/story-validateDifferent from implementerCross-validation

When NOT to Prompt

Don't prompt for model switch if:

  • User just switched models (avoid flip-flopping)
  • Task is simple and current model is adequate
  • User has explicitly stated model preference
  • Switching would interrupt flow for minimal gain

MAX Mode Guidance

If you detect a task that would benefit from MAX mode (project-wide refactoring, large codebase navigation, >25 tool calls needed), inform the user:

code
💡 **MAX Mode**: This task spans multiple modules and may need >25 tool calls. 
Consider enabling MAX mode, but be aware it uses token-based pricing ($5-60+ per complex request).

Avoid MAX mode for single-file edits, small features, or bug fixes.

Cost Awareness

When budget is mentioned or implied:

  • Default to Gemini 3 Flash for routine work
  • Reserve Opus 4.5 for critical decisions only
  • Note that Opus 4.5's higher per-token cost often results in lower total cost due to 76% better token efficiency