xAI API · April 2026

xAI Grok TTS Voices: Eve, Ara, Rex, Sal, Leo — A Complete Guide

By ScribeForge · April 18, 2026 · 5 min read
Note: ScribeForge offers audio transcription (STT) only. This article covers the xAI Grok TTS API — you can use it directly in your own code. Try ScribeForge STT →

xAI's Grok TTS API ships with five named voices, each with a distinct character. Whether you're building a podcast intro, an audiobook, a product demo, or a customer-facing IVR system, the voice you pick changes how your message lands. Here's a complete breakdown of all five, with samples you can generate yourself.

At a glance

EveWarm, natural
AraCalm, clear
RexBold, assured
SalCrisp, neutral
LeoDeep, measured

Voice profiles

Eve Female · Warm

Eve is the default voice and for good reason — she sounds the most natural for general-purpose content. Her pacing is conversational, with subtle warmth that makes long-form listening comfortable. She doesn't sound robotic even at higher speeds.

Ara Female · Calm

Ara is softer and more measured than Eve. She works especially well for meditation apps, wellness content, or any context where you want the voice to feel non-intrusive. Think "NPR afternoon show."

Rex Male · Bold

Rex has presence. He sounds like a confident presenter — authoritative without being aggressive. Great for product launches, YouTube intros, and B2B explainers where you want the audience to feel they're learning from someone who knows what they're talking about.

Sal Neutral · Crisp

Sal is the most neutral voice in the lineup — least distinguishable gender, clinical diction, very little emotional colouring. This makes Sal ideal for professional contexts where you need the voice to "disappear" and let the content take centre stage.

Leo Male · Deep

Leo has the lowest pitch of the five voices, with a measured, almost cinematic quality. He sounds like a documentary narrator. At slower speeds he becomes very gravitas; at normal speed he's natural and trustworthy.

Comparison table

VoiceGenderToneBest speedPrimary use case
EveFemaleWarm, natural0.9–1.3×General purpose, audiobooks
AraFemaleCalm, soft0.9–1.1×Wellness, education
RexMaleBold, assured1.0–1.5×Sales, presentations
SalNeutralCrisp, clinicalAnyProfessional, IVR
LeoMaleDeep, cinematic0.9–1.0×Documentary, premium brand

Try Grok STT without an API key

ScribeForge gives you browser access to xAI's Grok speech recognition with no API key, no account, and no setup. Upload any audio file and get an instant 200-character transcript preview, with 2 free previews per day.

  1. Go to ScribeForge.
  2. Upload your audio file (MP3, WAV, M4A, OGG, and more).
  3. Click Transcribe and get your text with timestamps.
  4. Copy or download the transcript.

Grok TTS vs ElevenLabs vs OpenAI TTS

Grok TTS is newer and has fewer voices than ElevenLabs (which offers 1000+ cloned voices) but it's significantly cheaper and requires no subscription to try. OpenAI's TTS model (Alloy, Echo, Fable, Onyx, Nova, Shimmer) is the closest competitor — similar quality and similar pricing, but without xAI's ecosystem integration.

If you need voice cloning or a massive library of accents, ElevenLabs is still the market leader. If you need a simple, high-quality API voice for a project and you're already using xAI's LLM — Grok TTS is the natural choice.

Related reading

Transcribe audio with xAI Grok STT — free, no account, no API key.

Try free transcription →

No account  ·  No credit card  ·  Try free