Drop an audio file and get an accurate transcript in seconds. Or type text and hear it in 5 Grok voices. 2 free uses/day — no account, no credit card.
MP3, WAV, FLAC, M4A, OGG, OPUS, WEBM, AAC · up to 25MB · transcript ready in seconds
No account. License key delivered instantly at checkout — paste it in the box below and start immediately.
$0.18 per use · cheaper than Otter, Rev, Descript
~$0.10/day for unlimited · less than a coffee
Upload any audio format up to 25MB. No account, no email.
xAI's Grok STT model processes the audio with high accuracy.
Get your transcript with speaker segments and timestamps.
Buy a credit pack or subscribe monthly if you need more uses.
MP3, WAV, FLAC, M4A, OGG, OPUS, WEBM, AAC, AIFF and MP4 video (audio extracted). Maximum 25MB per file.
No. Audio is processed in memory and the temporary file is deleted immediately after transcription. We never persist your audio files.
Your license key is delivered instantly at checkout and stored only in your browser's local storage. We don't ask for an account or email (email is optional at checkout to receive a backup copy of your key).
No. The 50-credit pack never expires. Monthly subscriptions renew each month and you can cancel anytime.
Speech-to-text is powered by xAI's Grok STT model. Text-to-speech uses xAI's Grok TTS model with 5 natural voices.
Grok STT is competitive with the best models on the market, supporting 50+ languages. Accuracy varies by audio quality and accent — clear recordings typically yield >95% accuracy.