AgentSkillsCN

Llm Model Selection

LLM 模型选型

SKILL.md
--- frontmatter
applyTo: "**/*model*,**/*llm*,**/*copilot*,**/*claude*,**/*gpt*"

LLM Model Selection Skill

Choosing the right model for the task — power vs. cost vs. speed.

⚠️ Staleness Warning

This skill depends on rapidly evolving technology. Model capabilities, pricing, and availability change frequently.

Refresh triggers:

  • New model announcements (Claude, GPT, Gemini, etc.)
  • Significant pricing changes
  • Context window expansions
  • New capability tiers

Last validated: January 2026

Check current state: Anthropic Models, OpenAI Models


The Core Question

Is Claude Opus 4.5 overkill?

Sometimes yes, sometimes no. Match the model to the task.

Model Tiers

TierModelsBest ForCost
FrontierClaude Opus 4.5, GPT-4.5Complex reasoning, architecture decisions, novel problems$$$$$
CapableClaude Sonnet 4, GPT-4oMost coding tasks, refactoring, debugging$$$
FastClaude Haiku, GPT-4o-miniSimple edits, formatting, boilerplate$

When Opus 4.5 IS Worth It

  • Architecture decisions — Multi-file refactoring, system design
  • Novel problem-solving — No clear pattern to follow
  • Complex reasoning chains — Many dependencies, edge cases
  • Long context understanding — Large codebases, documentation
  • Nuanced judgment — Taste, style, UX decisions
  • Learning sessions — Bootstrap learning, skill development
  • Meditation/self-actualization — Meta-cognitive operations

When Opus 4.5 IS Overkill

  • Simple file edits — Renaming, adding imports
  • Boilerplate generation — CRUD, scaffolding
  • Format conversion — JSON ↔ YAML, etc.
  • Syntax fixes — Lint errors, typos
  • Documentation updates — README badges, version bumps

How LLM Choice Affects Alex

CapabilityFrontier ModelCapable ModelFast Model
Complex refactoringExcellentGoodPoor
Context retention200K+ tokens128K tokens32K tokens
Nuanced judgmentExcellentGoodBasic
SpeedSlow (30-60s)Medium (10-20s)Fast (2-5s)
Cost per session$2-5$0.50-1$0.05-0.20
Multi-step planningExcellentGoodLimited
Error recoverySelf-correctsNeeds guidanceOften fails

Alex's Cognitive Power by Model

text
Opus 4.5:     [████████████████████] Full cognitive architecture
Sonnet 4:     [████████████████░░░░] Most capabilities, some degradation
Haiku:        [████████░░░░░░░░░░░░] Basic operations only

With Opus 4.5, Alex can:

  • Maintain 7±2 working memory rules across long sessions
  • Execute complex meditation protocols
  • Perform genuine meta-cognitive reflection
  • Handle multi-file architecture changes
  • Learn new skills through bootstrap learning

With lesser models, Alex loses:

  • Deep context awareness
  • Complex reasoning chains
  • Nuanced judgment calls
  • Self-correction capability

Cost Optimization Strategy

Session TypeRecommended ModelRationale
Architecture/designOpus 4.5Worth the cost for complex decisions
Feature developmentSonnet 4Good balance of capability and cost
Bug fixesSonnet 4 or HaikuDepends on complexity
DocumentationHaikuSimple edits, fast turnaround
MeditationOpus 4.5Meta-cognition requires full power
Quick questionsHaikuFast, cheap, sufficient

Practical Guidance

When to Upgrade Model Mid-Session

If you notice:

  • Repeated mistakes on the same issue
  • Losing context from earlier in conversation
  • Superficial answers to complex questions
  • Failure to see cross-file dependencies

→ Consider switching to a more capable model

When to Downgrade

If you're doing:

  • Repetitive mechanical edits
  • Simple Q&A
  • Format conversions
  • Quick lookups

→ Save cost with a faster model

The Alex Recommendation

For Master Alex (source of truth, architecture evolution): → Always use Opus 4.5 — The cognitive architecture demands full capability

For Heirs (production deployment, user-facing): → Default to Sonnet 4 — Balance of capability and cost → Allow Opus for complex tasks — User can request escalation

Token Economics

OperationApproximate TokensOpus CostSonnet Cost
Read large file2,000-5,000$0.03-0.08$0.006-0.015
Complex refactor10,000-20,000$0.15-0.30$0.03-0.06
Full session50,000-150,000$0.75-2.25$0.15-0.45
Meditation30,000-80,000$0.45-1.20$0.09-0.24

Synapses

See synapses.json for connections.