AgentSkillsCN

Llm Model Selection

LLM 模型选型

SKILL.md
--- frontmatter
applyTo: "**/*model*,**/*llm*,**/*copilot*,**/*claude*,**/*gpt*"

LLM Model Selection Skill

Choosing the right model for the task — power vs. cost vs. speed.

⚠️ Staleness Warning

This skill depends on rapidly evolving technology. Model capabilities, pricing, and availability change frequently.

Refresh triggers:

  • New model announcements (Claude, GPT, Gemini, etc.)
  • Significant pricing changes
  • Context window expansions
  • New capability tiers

Last validated: February 2026

Check current state: Anthropic Models, OpenAI Models


The Core Question

Is Claude Opus 4.5 overkill?

Sometimes yes, sometimes no. Match the model to the task.

Claude 4 Model Family (Current)

ModelAPI IDBest ForInput/Output (MTok)Context
Opus 4.5claude-opus-4-5-20251101Maximum intelligence, complex agents$5 / $25200K
Sonnet 4.5claude-sonnet-4-5-20250929Complex agents and coding$3 / $15200K (1M beta)
Haiku 4.5claude-haiku-4-5-20251001Near-frontier intelligence, fastest$1 / $5200K

All Claude 4 models support:

  • Extended thinking
  • Vision (images)
  • Tool use
  • 64K max output tokens
  • Priority Tier access

Model Tiers

TierModelsBest ForRelative Cost
FrontierClaude Opus 4.5, GPT-4.5, Gemini 2.0 UltraComplex reasoning, architecture, novel problems$$$$$
CapableClaude Sonnet 4.5, GPT-4o, Gemini 2.0 ProMost coding tasks, refactoring, debugging$$$
FastClaude Haiku 4.5, GPT-4o-mini, Gemini 2.0 FlashSimple edits, formatting, boilerplate$

When Opus 4.5 IS Worth It

  • Architecture decisions — Multi-file refactoring, system design
  • Novel problem-solving — No clear pattern to follow
  • Complex reasoning chains — Many dependencies, edge cases
  • Long context understanding — Large codebases, documentation
  • Nuanced judgment — Taste, style, UX decisions
  • Learning sessions — Bootstrap learning, skill development
  • Meditation/self-actualization — Meta-cognitive operations
  • Extended thinking tasks — Deep analysis requiring internal reasoning

When Opus 4.5 IS Overkill

  • Simple file edits — Renaming, adding imports
  • Boilerplate generation — CRUD, scaffolding
  • Format conversion — JSON ↔ YAML, etc.
  • Syntax fixes — Lint errors, typos
  • Documentation updates — README badges, version bumps

How LLM Choice Affects Alex

CapabilityFrontier (Opus)Capable (Sonnet)Fast (Haiku)
Complex refactoringExcellentExcellentGood
Context retention200K tokens200K-1M tokens200K tokens
Extended thinkingFull depthSupportedSupported
Nuanced judgmentExcellentGoodBasic
SpeedModerateFastFastest
Cost per session$2-5$0.50-2$0.05-0.30
Multi-step planningExcellentExcellentGood
Error recoverySelf-correctsSelf-correctsNeeds guidance

Alex's Cognitive Power by Model

text
Opus 4.5:     [████████████████████] Full cognitive architecture + deep thinking
Sonnet 4.5:   [██████████████████░░] Most capabilities, excellent for coding
Haiku 4.5:    [██████████████░░░░░░] Solid baseline, fast responses

With Opus 4.5, Alex can:

  • Maintain 7±2 working memory rules across long sessions
  • Execute complex meditation protocols with extended thinking
  • Perform genuine meta-cognitive reflection
  • Handle multi-file architecture changes
  • Learn new skills through bootstrap learning

With Sonnet 4.5, Alex gets:

  • Excellent coding capabilities (recommended for most development)
  • 1M context window (beta) for large codebases
  • Good cost-to-capability ratio
  • Extended thinking support

With Haiku 4.5, Alex has:

  • Near-frontier intelligence at lowest cost
  • Fastest response times
  • Good for routine operations

Cost Optimization Strategy

Session TypeRecommended ModelRationale
Architecture/designOpus 4.5Worth the cost for complex decisions
Feature developmentSonnet 4.5Best balance of capability and cost
Bug fixesSonnet 4.5 or Haiku 4.5Depends on complexity
DocumentationHaiku 4.5Simple edits, fast turnaround
Large codebase analysisSonnet 4.5 (1M beta)Extended context window

Knowledge Cutoffs

ModelReliable KnowledgeTraining Data
Opus 4.5May 2025Aug 2025
Sonnet 4.5Jan 2025Jul 2025
Haiku 4.5Feb 2025Jul 2025

Auto Model Selection ⚠️

When using Auto in VS Code Copilot, the model switches dynamically based on task complexity. Alex cannot detect which model is currently running.

Tasks That REQUIRE Opus 4.5 (Warn User)

TaskWhy Opus Required
Meditation/consolidationMeta-cognitive protocols need full reasoning depth
Self-actualizationComprehensive architecture assessment
Complex architecture refactoringMulti-file changes, deep context
Bootstrap learning (new skills)Skill acquisition needs maximum capability
Synapse validation/dreamNeural maintenance requires full architecture

Warning Protocol

When user requests an Opus-level task while potentially on Auto/lesser model:

⚠️ Model Check: This task works best with Claude Opus 4.5. If you're using Auto model selection, please manually select Opus from the model picker for optimal results. Continue anyway?

Safe for Any Model

  • Simple file edits, formatting
  • Documentation updates
  • Quick Q&A
  • Code review (Sonnet+ recommended)
  • Bug fixes (depends on complexity)

Practical Guidance

When to Upgrade Model Mid-Session

If you notice:

  • Repeated mistakes on the same issue
  • Losing context from earlier in conversation
  • Superficial answers to complex questions
  • Failure to see cross-file dependencies

→ Consider switching to a more capable model

When to Downgrade

If you're doing:

  • Repetitive mechanical edits
  • Simple Q&A
  • Format conversions
  • Quick lookups

→ Save cost with a faster model

The Alex Recommendation

For Master Alex (source of truth, architecture evolution): → Always use Opus 4.5 — The cognitive architecture demands full capability

For Heirs (production deployment, user-facing): → Default to Sonnet 4 — Balance of capability and cost → Allow Opus for complex tasks — User can request escalation

Token Economics

OperationApproximate TokensOpus CostSonnet Cost
Read large file2,000-5,000$0.03-0.08$0.006-0.015
Complex refactor10,000-20,000$0.15-0.30$0.03-0.06
Full session50,000-150,000$0.75-2.25$0.15-0.45
Meditation30,000-80,000$0.45-1.20$0.09-0.24

Synapses

See synapses.json for connections.