Preview first · Pay only if it works

Use Grok STT,
then work the transcript.

Use xAI Grok STT without an API key: drop one audio file, get a transcript you can search, jump by timestamp, and export right in the browser. No account, 2 free previews/day, up to 100 MB per file.

No API key needed 2 free previews/day 100 MB per file Audio deleted immediately
Try with your file
Search the transcript Jump to timestamps TXT + PDF export built in No account or API key
Quick answer · Updated April 28, 2026

Can Grok (xAI) transcribe audio?

Yes. xAI launched the Grok Speech-to-Text API on April 18, 2026. ScribeForge lets you use Grok STT in the browser without touching an API key: drop an audio file, search the transcript, jump to any timestamp, and export only if it is worth keeping. Speaker labels appear when the recording allows clear separation.

↓ Try the preview Read the full answer → Supported formats guide →
Transcribe now Upload one file. Review the transcript right below.

Best fit for meetings, podcasts, interviews, voice notes and quick research clips. Search the transcript, tap segments to jump the audio, then unlock the full export only if it is worth keeping.

Meetings Podcasts Voice notes Interviews
Drop audio here or tap to browse Transcript, search, timestamp jump and export · MP3 · WAV · FLAC · M4A · OGG · OPUS · WEBM · AAC · AIFF · MP4 · max 100 MB
Already purchased a plan?

Simple pricing

No signup flow, no account layer. If you pay, you get a key immediately.

Free
$0
Check the workflow before you pay anything.
  • 2 free previews per day
  • Up to 100 MB audio
  • 25+ languages, auto-detected
  • Timestamps, with speaker labels when available
  • No account or API key
Monthly
$19 /mo
200 transcriptions/day. Cancel anytime.
  • Unlimited (200/day cap)
  • Up to 100 MB per file
  • Timestamps, with speaker labels when available
  • No per-use credit tracking
  • Cancel anytime
~$0.10/day

How it works

1

Drop your file

Any audio format up to 100 MB. No signup wall.

2

Grok transcribes

xAI Grok STT processes with high accuracy across 25+ languages.

3

Search, jump, export

Review the transcript in place, jump by timestamp, then copy or export when you are ready. Speaker labels appear when the recording allows clear separation.

4

Upgrade when ready

Buy a credit pack or subscribe. Key delivered instantly by email.


FAQ

What audio formats are supported?+

MP3, WAV, FLAC, M4A, OGG, OPUS, WEBM, AAC, AIFF and MP4. Maximum 100 MB per file. See the complete Grok audio formats guide for quality tips and conversion instructions.

Do you store my audio?+

No. Audio is processed in memory and discarded immediately after transcription, and transcript text is not retained. For paid usage we may keep minimal metadata such as duration, file size, and license-linked usage counts.

What happens to my license key?+

Stored only in your browser's local storage. There is no account layer, and checkout email is optional.

Do credits expire?+

No. The 50-credit pack never expires. Monthly plans renew each month and can be cancelled anytime.

What AI model powers this?+

xAI's Grok Speech-to-Text API — the same audio stack powering Grok Voice, Tesla in-car voice and Starlink customer support. Read the independent Grok STT vs Whisper vs Deepgram benchmark.

Does it detect multiple speakers?+

Sometimes. ScribeForge shows speaker labels when the recording has clear speaker separation, but they are not guaranteed on every file. See how timestamped transcripts work.

What languages are supported?+

25+ languages including English, Spanish, French, German, Italian, Portuguese, Chinese, Japanese, Arabic, Hindi and more. Detected automatically.

Developer Guides

Go deeper when you need it

View all →

The tool comes first. If you want benchmarks, API walkthroughs or format guidance after trying it, start here.

STT Comparison
Grok STT vs Whisper vs Deepgram in 2026
Developer Guide
xAI Grok STT & TTS API: Developer Guide
xAI API
xAI Grok TTS Voices: Eve, Ara, Rex, Sal, Leo
Audio Formats
Grok STT: Supported Audio Formats