AgentSkillsCN

groq-whisper

通过 Groq Whisper API 快速实现云端语音转文字(取代本地 openai-whisper)。

SKILL.md
--- frontmatter
name: groq-whisper
description: Fast cloud speech-to-text via Groq Whisper API (replaces local openai-whisper).
homepage: https://console.groq.com/docs/speech-text
metadata: {"clawdbot":{"emoji":"🎙️","requires":{"env":["GROQ_API_KEY"]}}}

Groq Whisper — Audio Transcription

Transcribe audio files using the Groq API (whisper-large-v3-turbo).

Quick Start

bash
node skills/groq-whisper/transcribe.js /path/to/audio.mp3
node skills/groq-whisper/transcribe.js /path/to/audio.m4a --language de
node skills/groq-whisper/transcribe.js /path/to/audio.mp3 --translate --format txt

Programmatic

js
const { transcribe } = require("./skills/groq-whisper/transcribe");
const result = await transcribe("/path/to/file.mp3", { language: "de" });
console.log(result.text);

Models

ModelUse Case
whisper-large-v3-turboDefault transcription (fast)
whisper-large-v3Translation / higher accuracy

Env

Supported Formats

mp3, mp4, mpeg, mpga, m4a, wav, webm — max 25 MB.