Transcribe audio files via Groq's OpenAI-compatible speech-to-text API. Use when the user sends voice messages or audio files and you need fast cloud speech-...
Fast speech-to-text for voice notes and audio files through Groq's OpenAI-compatible transcription endpoint. Use it when you want cloud transcription via Groq instead of running Whisper locally.
Best for:
You need a Groq API key. Groq often provides a free developer tier / trial credits for new users. Get one from:
If OpenClaw is already running and configured, you can simply ask your assistant:
The assistant can place the key into ~/.openclaw/openclaw.json for you.
Set GROQ_API_KEY, or configure it in ~/.openclaw/openclaw.json under:
{
"skills": {
"entries": {
"groq-voice-transcribe": {
"apiKey": "GROQ_KEY_HERE"
}
}
}
}
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg
Defaults:
whisper-large-v3-turbo<input>.txt# Basic transcript
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg
# Chinese voice message
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --language zh --prompt "中文普通话,日常聊天"
# Save to a custom file
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --out /tmp/transcript.txt
# Verbose JSON output
{baseDir}/scripts/transcribe.sh /path/to/audio.ogg --json --out /tmp/transcript.json
--model <name>: transcription model (default whisper-large-v3-turbo)--out <path>: output file path--language <code>: hint the spoken language, for example zh, en, ja--prompt <text>: optional context or spelling hint--json: write verbose JSON instead of plain text--language zh often helps for Chinese voice notes.ZIP package — ready to use