AgentSkillsCN

Voice Activity Detection (VAD)

通用湖泊模型(GLM)的基本用法,适用于湖泊水温模拟。当您需要运行 GLM、理解输入文件,或调整配置参数时,可选用此技能。

SKILL.md
--- frontmatter
name: Voice Activity Detection (VAD)
description: Detect speech segments in audio using VAD tools like Silero VAD, SpeechBrain VAD, or WebRTC VAD. Use when preprocessing audio for speaker diarization, filtering silence, or segmenting audio into speech chunks. Choose Silero VAD for short segments, SpeechBrain VAD for general purpose, or WebRTC VAD for lightweight applications.

Voice Activity Detection (VAD)

When to Use

  • Preprocessing audio before speaker diarization
  • Filtering out silence and noise
  • Segmenting audio into speech chunks
  • Improving diarization accuracy by focusing on speech regions