Voice Clone Skill

Name: voice-clone
Rating: 76
Author: Nuva-Lab

Use this skill to clone a speaker's voice and generate text-to-speech audio.

Two-Step Process

bash

python skills/voice-clone/clone.py <audio_sample.wav> [--transcript "text"]

Creates a speaker embedding file that can be reused.

bash

python skills/voice-clone/speak.py <embedding.safetensors> "Text to speak"

Generates audio using the cloned voice.

•assets/outputs/voice_embeddings/<name>_embedding.safetensors - Reusable voice model
•assets/outputs/audio/<name>_speech.wav - Generated audio

•qwen3-tts works best with Chinese speech samples
•Cross-lingual cloning (Chinese voice → English speech) may have quality variations
•Provide reference transcript for best quality