End-to-end voice workflow with Deepgram STT and TTS. Use when transcribing voice messages, generating spoken replies, or building a shell-based audio pipelin...
Use this skill for a complete speech workflow:
This skill is the right choice when the task is broader than plain transcription and needs an input-audio to output-audio pipeline.
{baseDir}/scripts/deepgram-transcribe.sh /path/to/audio.ogg
{baseDir}/scripts/deepgram-tts.sh "你好,我是 Neko。"
{baseDir}/scripts/neko-voice-pipeline.sh /path/to/audio.ogg --reply "收到啦,这是语音回复测试。"
Set DEEPGRAM_API_KEY before use.
The bundled scripts also fall back to reading it from:
/root/.openclaw/.envdeepgram-transcribe.sh whendeepgram-tts.sh whenneko-voice-pipeline.sh whendeepgram-transcribe.sh writes:
deepgram-tts.sh writes:
neko-voice-pipeline.sh prints JSON with:
out_dirtranscript_pathtranscriptreply_audio_pathThis makes it easy to wire into scripts or adapters.
Prefer this skill for:
deepgram-transcribe.sh defaults to model=nova-2 and language=zh.deepgram-tts.sh defaults to model=aura-2-luna-en; override the model when a different voice is preferred.Read these files when needed:
references/stt-notes.md for transcription detailsreferences/tts-notes.md for speech synthesis detailsreferences/pipeline-notes.md for end-to-end pipeline behaviorZIP package — ready to use