Grok Transcription Online —
Audio to Text with xAI STT

Upload any audio file and get an accurate transcript in seconds, powered by xAI's Grok speech-to-text model. Free to try, no account required.

Try Grok transcription free →

50+ languages

Grok STT automatically detects the language and delivers accurate results across major world languages.

Timestamps & speakers

Get segment-level timestamps. Speaker labels included where detectable in the audio.

All common formats

MP3, WAV, FLAC, M4A, OGG, OPUS, WEBM, AAC, AIFF. Up to 25MB per file.

No audio stored

Your file is deleted immediately after processing. We never keep a copy of your audio.

What is Grok STT?

Grok STT (speech-to-text) is a large-scale automatic speech recognition model developed by xAI, the AI lab founded by Elon Musk. Released in April 2026, it is one of the most accurate open-endpoint transcription APIs available, with particular strengths in noisy audio, accented speech, and long-form recordings.

ScribeForge is the first dedicated web service to offer Grok transcription online — no need to build your own API integration, sign up for xAI access, or manage API keys.

How to transcribe audio with Grok online

  1. Go to the ScribeForge homepage.
  2. Drag your audio file onto the upload area or click to browse.
  3. Click Transcribe and wait a few seconds.
  4. Copy your transcript or download it as a .txt file.

You get 2 free transcriptions per day without entering any payment details.

Grok STT vs Whisper vs Deepgram

ModelSpeedAccuracyLanguagesTimestampsPrice/min
Grok STT (xAI)FastVery high50+Yes (segment)~$0.013
Whisper large-v3MediumHigh99Yes~$0.006
Deepgram Nova-2Very fastHigh35Yes (word)~$0.0043

Grok STT is particularly strong on English conversational audio and performs well at higher noise levels — making it a compelling choice for meeting recordings, interviews, and podcast content.

Frequently asked questions

Is Grok transcription free?

Yes — ScribeForge offers 2 free Grok transcriptions per day, no credit card required. For higher volumes, buy a 50-credit pack ($9) or subscribe monthly ($19/mo).

What languages does Grok STT support?

Grok STT supports over 50 languages including English, Spanish, French, German, Italian, Portuguese, Japanese, Korean, Chinese, Arabic, Hindi, and many more. Language is detected automatically.

How long can my audio file be?

Files up to 25MB are supported. For MP3 at 128kbps, that's roughly 25 minutes of audio. For lossless WAV, shorter. If your file is larger, compress it to MP3 first.

Is my audio private?

Yes. Your audio file is sent to xAI's API for processing and deleted from our servers immediately after transcription. We do not store or log the content of your audio. See our privacy policy.

Can I use this via API?

Yes — with a paid license key you can call POST /api/transcribe directly from your own application. See the API docs on the homepage.