AgentSkillsCN

voiceover

使用ElevenLabs生成高质量的语音旁白音频,支持逐字级的时间戳,方便与视频同步。适用于制作产品演示解说、视频旁白、播客开场白,或任何需要文本转语音的应用场景。关键词:语音旁白、TTS、文本转语音、ElevenLabs、解说、音频、时间戳。

SKILL.md
--- frontmatter
name: voiceover
description: |
  Generate high-quality voiceover audio with ElevenLabs. Includes word-level
  timestamps for video sync. Use when: creating demo narration, video voiceover,
  podcast intros, or any TTS need. Keywords: voiceover, TTS, text to speech,
  ElevenLabs, narration, audio, timestamps.
argument-hint: "[script text or file path]"

Voiceover (ElevenLabs)

What This Does

  • Accept script text or file path.
  • Preprocess for TTS: expand acronyms, normalize numbers.
  • Generate via ElevenLabs API.
  • Return audio + optional word timestamps.

Prerequisites

  • ELEVENLABS_API_KEY env var set.
  • ElevenLabs Creator plan ~ $5/mo for ~100k chars.

Usage

  • /voiceover "Welcome to Heartbeat..."
  • /voiceover demo-script.md --timestamps --voice adam

Voices

  • See skills/voiceover/references/elevenlabs-voices.md.
  • Default: adam (clear, professional).

Output

  • voiceover.mp3
  • timestamps.json (word-level timing when requested)

Integration

  • Used by /demo-video for narration sync.