Qwen Voice (ASR + TTS)

Name: qwen-voice
Rating: 92
Author: ada20204

Use the bundled scripts. Configure DASHSCOPE_API_KEY in one of:

ASR (speech → text)

bash

python3 skills/qwen-voice/scripts/qwen_asr.py --in /path/to/audio.ogg

bash

python3 skills/qwen-voice/scripts/qwen_asr.py --in /path/to/audio.ogg --timestamps --chunk-sec 3

Notes:

bash

python3 skills/qwen-voice/scripts/qwen_tts.py --text '你好，我是 Pi。' --voice Cherry --out /tmp/out.ogg

bash

python3 skills/qwen-voice/scripts/qwen_voice_clone.py --in ./voice_sample.ogg --name george --out work/qwen-voice/george.voice.json

bash

python3 skills/qwen-voice/scripts/qwen_tts.py --text '你好，我是 George。' --voice-profile work/qwen-voice/george.voice.json --out /tmp/out.ogg

Notes:

•When user sends voice message/audio: run ASR and reply with the transcribed text.
•When user explicitly asks for voice reply: run TTS and send the generated .ogg as a voice note.