Minion Models
Manage Ollama models for your minion squad.
Quick commands
List installed models
bash
ollama list
Pull a model
bash
ollama pull qwen2.5-coder:1.5b
Check model info
bash
ollama show qwen2.5-coder:1.5b
Remove a model
bash
ollama rm qwen2.5-coder:0.5b
Presets
| Preset | Models | Download | RAM |
|---|---|---|---|
| nano | qwen2.5-coder:0.5b | ~350MB | ~1GB |
| small | qwen2.5-coder:1.5b | ~1GB | ~2GB |
| medium | qwen2.5-coder:7b | ~4.5GB | ~8GB |
| large | qwen2.5-coder:14b | ~9GB | ~16GB |
Pull preset models
nano:
bash
ollama pull qwen2.5-coder:0.5b
small (recommended):
bash
ollama pull qwen2.5-coder:1.5b
medium:
bash
ollama pull qwen2.5-coder:7b
large:
bash
ollama pull qwen2.5-coder:14b
Switch preset
Edit llm_gc/config/models.yaml and change the preset line:
yaml
preset: small # Change to: nano, small, medium, or large
Or use sed:
bash
sed -i.bak 's/^preset:.*/preset: medium/' llm_gc/config/models.yaml
Check disk usage
bash
# Total Ollama storage du -sh ~/.ollama/models # Per-model breakdown ls -lh ~/.ollama/models/blobs/ | head -20
Recommended models
| Task | Model | Why |
|---|---|---|
| Quick patches | qwen2.5-coder:1.5b | Fast, good enough |
| Quality patches | qwen2.5-coder:7b | Better reasoning |
| Code review | qwen2.5-coder:7b+ | Needs context |
| Simple questions | qwen2.5-coder:0.5b | Speed matters |
Alternative models
bash
# DeepSeek (alternative to Qwen) ollama pull deepseek-coder:1.3b ollama pull deepseek-coder:6.7b # CodeLlama (Meta) ollama pull codellama:7b # StarCoder ollama pull starcoder2:3b
Troubleshooting
Model not found:
bash
ollama pull <model-name>
Slow responses:
- •Try smaller model
- •Check
htopfor RAM pressure - •Reduce
--workersin swarm
Out of disk space:
bash
# Remove unused models ollama rm <model-name> # Check what's installed ollama list
Model quality issues:
- •Upgrade preset: nano → small → medium
- •Add more context with
--read - •Simplify the task