Use xAI Grok STT without an API key: drop one audio file, get a transcript you can search, jump by timestamp, and export right in the browser. No account, 2 free previews/day, up to 100 MB per file.
Yes. xAI launched the Grok Speech-to-Text API on April 18, 2026. ScribeForge lets you use Grok STT in the browser without touching an API key: drop an audio file, search the transcript, jump to any timestamp, and export only if it is worth keeping. Speaker labels appear when the recording allows clear separation.
Best fit for meetings, podcasts, interviews, voice notes and quick research clips. Search the transcript, tap segments to jump the audio, then unlock the full export only if it is worth keeping.
Paste the license key from your purchase email.
No signup flow, no account layer. If you pay, you get a key immediately.
Any audio format up to 100 MB. No signup wall.
xAI Grok STT processes with high accuracy across 25+ languages.
Review the transcript in place, jump by timestamp, then copy or export when you are ready. Speaker labels appear when the recording allows clear separation.
Buy a credit pack or subscribe. Key delivered instantly by email.
MP3, WAV, FLAC, M4A, OGG, OPUS, WEBM, AAC, AIFF and MP4. Maximum 100 MB per file. See the complete Grok audio formats guide for quality tips and conversion instructions.
No. Audio is processed in memory and discarded immediately after transcription, and transcript text is not retained. For paid usage we may keep minimal metadata such as duration, file size, and license-linked usage counts.
Stored only in your browser's local storage. There is no account layer, and checkout email is optional.
No. The 50-credit pack never expires. Monthly plans renew each month and can be cancelled anytime.
xAI's Grok Speech-to-Text API — the same audio stack powering Grok Voice, Tesla in-car voice and Starlink customer support. Read the independent Grok STT vs Whisper vs Deepgram benchmark.
Sometimes. ScribeForge shows speaker labels when the recording has clear speaker separation, but they are not guaranteed on every file. See how timestamped transcripts work.
25+ languages including English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese, Arabic, Hindi and more. Detected automatically.
The tool comes first. If you want benchmarks, API walkthroughs or format guidance after trying it, start here.
Complete library — from "can Grok transcribe audio?" to step-by-step podcast, YouTube and meeting workflows.