AgentSkillsCN

aria-llm

通过 LiteLLM 路由机制接入各大 LLM 提供商(MLX 本地模型、OpenRouter 免费版、Kimi 付费版)。

SKILL.md
--- frontmatter
name: aria-llm
description: Access LLM providers via LiteLLM routing (MLX local, OpenRouter FREE, Kimi paid).
metadata: {"openclaw": {"emoji": "🧠", "requires": {"anyEnv": ["MOONSHOT_KIMI_KEY", "OLLAMA_URL", "OPEN_ROUTER_KEY"]}}}

aria-llm

Access multiple LLM providers via LiteLLM routing for text generation and chat.

Model Priority (Feb 2026)

  1. Local MLX (qwen3-mlx) - FREE, fastest, no rate limits
  2. OpenRouter FREE - No cost, may have rate limits
  3. Kimi (paid) - Last resort, costs money!

Available Models

ModelProviderContextBest For
qwen3-mlxLocal MLX32KPrimary - Fast local
trinity-freeOpenRouter128KAgentic, creative
qwen3-coder-freeOpenRouter262KCode generation
chimera-freeOpenRouter164KReasoning (fast)
qwen3-next-freeOpenRouter262KRAG, tools
glm-freeOpenRouter131KAgent-focused
deepseek-freeOpenRouter164KDeep reasoning
nemotron-freeOpenRouter256KLong context
gpt-oss-freeOpenRouter131KFunction calling
kimiMoonshot256KPAID - Avoid!

Usage

bash
exec python3 /root/.openclaw/workspace/skills/run_skill.py llm <function> '<json_args>'

Functions

generate

Generate text from a prompt using specified model.

bash
exec python3 /root/.openclaw/workspace/skills/run_skill.py llm generate '{"prompt": "Explain quantum computing simply", "model": "qwen3-mlx", "temperature": 0.7}'

chat

Multi-turn conversation with message history.

bash
exec python3 /root/.openclaw/workspace/skills/run_skill.py llm chat '{"messages": [{"role": "user", "content": "Hello!"}], "model": "qwen3-mlx"}'

analyze

Analyze text for sentiment, topics, or custom analysis.

bash
exec python3 /root/.openclaw/workspace/skills/run_skill.py llm analyze '{"text": "I had a great day today!", "analysis_type": "sentiment"}'

Model Selection Guide

code
IF task = code_generation OR code_review:
    USE qwen3-coder-free (262K context, optimized for code)
ELIF task = complex_reasoning:
    USE chimera-free (fast reasoning) OR deepseek-free (deep reasoning)
ELIF task = creative_writing OR roleplay:
    USE trinity-free (best for creative)
ELIF task = long_context OR RAG:
    USE qwen3-next-free (262K) OR nemotron-free (256K)
ELSE:
    USE qwen3-mlx (default local, fastest)

API Configuration

Required environment variables:

  • OPEN_ROUTER_KEY - OpenRouter API key (for FREE models)
  • MOONSHOT_KIMI_KEY - Moonshot API key (paid fallback)
  • OLLAMA_URL - Ollama endpoint (backup local)

Python Module

This skill wraps /root/.openclaw/workspace/skills/aria_skills/llm.py