AgentSkillsCN

AudioEditor

AI驱动的音视频编辑——转录、智能剪辑检测、自动交叉淡入淡出编辑,以及可选的云端润色。适用于清理音频、编辑音频、去除填充词、清理播客、去除“嗯”声、修复音频、剪掉无效片段、润色音频、清理录音、转录与编辑。

SKILL.md
--- frontmatter
name: AudioEditor
description: AI-powered audio/video editing — transcription, intelligent cut detection, automated editing with crossfades, and optional cloud polish. USE WHEN clean audio, edit audio, remove filler words, clean podcast, remove ums, fix audio, cut dead air, polish audio, clean recording, transcribe and edit.

AudioEditor

AI-powered audio/video editing — transcription, intelligent cut detection, automated editing with crossfades, and optional cloud polish.

Customization

Before executing, check for user customizations at: ~/.claude/PAI/USER/SKILLCUSTOMIZATIONS/AudioEditor/

If this directory exists, load and apply any PREFERENCES.md, configurations, or resources found there. These override default behavior. If the directory does not exist, proceed with skill defaults.

Voice Notification

You MUST send this notification BEFORE doing anything else when this skill is invoked.

  1. Send voice notification:

    bash
    curl -s -X POST http://localhost:8888/notify \
      -H "Content-Type: application/json" \
      -d '{"message": "Running the WORKFLOWNAME workflow in the AudioEditor skill to ACTION"}' \
      > /dev/null 2>&1 &
    
  2. Output text notification:

    code
    Running the **WorkflowName** workflow in the **AudioEditor** skill to ACTION...
    

This is not optional. Execute this curl command immediately upon skill invocation.

Workflow Routing

WorkflowTriggerFile
Clean"clean audio", "edit audio", "remove filler words", "clean podcast", "remove ums", "cut dead air", "polish audio"Workflows/Clean.md

Pipeline Architecture

code
Audio Input
    |
[Transcribe] Whisper word-level timestamps (insanely-fast-whisper on MPS)
    |
[Analyze] Claude classifies each segment:
    |   KEEP / CUT_FILLER / CUT_FALSE_START / CUT_EDIT_MARKER / CUT_STUTTER / CUT_DEAD_AIR
    |   Distinguishes rhetorical emphasis from accidental repetition
    |
[Edit] ffmpeg executes cuts:
    |   - 40ms qsin crossfades at every edit point
    |   - Room tone extraction and gap filling
    |   - Breath attenuation (50% volume, not removal)
    |
[Polish] (optional) Cleanvoice API final pass:
        - Mouth sound removal
        - Remaining filler detection
        - Loudness normalization

Output: cleaned MP3/WAV

Tools

ToolCommandPurpose
Transcribebun ~/.claude/skills/Utilities/AudioEditor/Tools/Transcribe.ts <file>Word-level transcription via Whisper
Analyzebun ~/.claude/skills/Utilities/AudioEditor/Tools/Analyze.ts <transcript.json>LLM-powered edit classification
Editbun ~/.claude/skills/Utilities/AudioEditor/Tools/Edit.ts <file> <edits.json>Execute cuts with crossfades + room tone
Polishbun ~/.claude/skills/Utilities/AudioEditor/Tools/Polish.ts <file>Cleanvoice API cloud polish
Pipelinebun ~/.claude/skills/Utilities/AudioEditor/Tools/Pipeline.ts <file> [--polish]Full end-to-end pipeline

API Keys Required

ServiceEnv VarWhere to Get
Anthropic (for analyze step)ANTHROPIC_API_KEYAlready set via Claude Code
Cleanvoice (for polish step, optional)CLEANVOICE_API_KEYcleanvoice.ai Dashboard Settings API Key

Examples

Example 1: Clean a podcast recording

code
User: "clean up the audio on this podcast file"
-> Invokes Clean workflow
-> Runs full pipeline: transcribe -> analyze -> edit
-> Outputs cleaned MP3 with filler words, stutters, and dead air removed

Example 2: Preview edits before applying

code
User: "show me what edits you'd make to this recording"
-> Invokes Clean workflow with --preview flag
-> Transcribes and analyzes, shows proposed edits without modifying audio
-> User reviews edit list, then runs again to apply

Example 3: Aggressive clean with cloud polish

code
User: "aggressively clean this audio and polish it"
-> Invokes Clean workflow with --aggressive --polish flags
-> Tighter thresholds for filler detection
-> Cleanvoice API pass for mouth sounds and normalization