Guides on AI transcription and xAI Grok audio APIs.
Direct answer with supported formats, limits, and a free in-browser tool. No code required.
MP3, WAV, FLAC, M4A, OGG, MP4, WebM, AAC — full list with quality tips and current upload-limit notes.
Inside the xAI Grok STT API: 25+ languages, word-level timestamps, speaker diarization, and accuracy benchmarks.
xAI API pricing, the 500 MB vs 100 MB limit split, and when the no-API-key browser workflow is the better option.
We tested all three on 6 audio conditions. Full WER benchmarks, latency, and pricing breakdown.
Character profiles, use cases, and best speed settings for all 5 Grok TTS voices (xAI API).
Working Python code for transcription, timestamps, error handling, and cost estimation.
Export, compress, upload, get text. Plus what to do with the transcript once you have it.
Step-by-step from audio export to SEO-ready transcript page. With tool comparison table.
Drag your MP3, get a transcript with timestamps in seconds. Supports 8 audio formats. Free preview, no account.
Download audio with yt-dlp, transcribe with ScribeForge. Full workflow including splitting long videos.
Export from iPhone Voice Memos, Android recorder, or WhatsApp. Transcribe in your mobile browser.
Every transcript includes phrase-level start and end times. Useful for subtitles, show notes, and citations.
How to generate AI speech with xAI Grok TTS voices (Eve, Ara, Rex, Sal, Leo) and download as MP3 via API.
What speaker diarization actually does, how timestamps work, and how to get labeled output for interviews, meetings, and podcasts.