AgentSkillsCN

Cerebras Inference

利用此工具,通过 LiteLLM 和 OpenRouter,结合 Cerebras 推理提供商,编写代码以调用大语言模型。

SKILL.md
--- frontmatter
name: Cerebras Inference
description: Use this to write code to call an LLM using LiteLLM and OpenRouter with the Cerebras inference provider

Calling an LLM via Cerebras

These instructions allow you write code to call an LLM with Cerebras specified as the inference provider.
This method uses LiteLLM and OpenRouter.

Setup

The OPENROUTER_API_KEY must be set in the .env file and loaded in as an environment variable.

The uv project must include litellm and pydantic. uv add litellm pydantic

Code snippets

Use code like these examples in order to use Cerebras.

Imports and constants

python
from litellm import completion
MODEL = "openrouter/openai/gpt-oss-120b"
EXTRA_BODY = {"provider": {"order": ["cerebras"]}}

Code to call via Cerebras for a text response

python
response = completion(model=MODEL, messages=messages, reasoning_effort="low", extra_body=EXTRA_BODY)
result = response.choices[0].message.content

Code to call via Cerebras for a Structured Outputs response

python
response = completion(model=MODEL, messages=messages, response_format=MyBaseModelSubclass, reasoning_effort="low", extra_body=EXTRA_BODY)
result = response.choices[0].message.content
result_as_object = MyBaseModelSubclass.model_validate_json(result)