AgentSkillsCN

audio-transcriber

使用OpenAI Whisper将音频/视频文件转录为文本。

SKILL.md
--- frontmatter
name: audio-transcriber
description: Transcribe audio/video files to text using OpenAI Whisper.
status: implemented

Audio Transcriber

Transcribe audio/video files to text using OpenAI Whisper.

Usage

node audio-transcriber/scripts/transcribe.cjs [options]

Troubleshooting

ErrorCauseFix
ffmpeg: command not foundFFmpeg not installedInstall: brew install ffmpeg (macOS) or apt install ffmpeg (Linux)
Cannot find module 'openai'OpenAI SDK not installedRun npm install openai
OPENAI_API_KEY is not setAPI key missingSet export OPENAI_API_KEY=sk-... in your shell
Error: 413 Request Entity Too LargeAudio file exceeds 25MB limitSplit file: ffmpeg -i input.mp3 -ss 0 -t 600 part1.mp3
Unsupported audio formatFile format not recognizedConvert to mp3/wav: ffmpeg -i input.ogg output.mp3

Knowledge Protocol

  • This skill adheres to the knowledge/orchestration/knowledge-protocol.md. It automatically integrates Public, Confidential (Company/Client), and Personal knowledge tiers, prioritizing the most specific secrets while ensuring no leaks to public outputs.