04/30/2026 | News release | Distributed by Public on 04/30/2026 21:33
Today, we're introducing Custom Voices. Clone your voice from a few seconds of audio and use it instantly across Grok Text to Speech and Voice Agent APIs.
Tyler
SpaceX Broadcast Host
Alongside Custom Voices, the new Voice Library gives your team a single place to browse, preview, and manage all your voices from the xAI console.
Custom Voices unlock a new class of applications.
I need help with my recent order.
Of course! Let me pull up your order details.
Brand Voice Agents
Give your customer support agent a consistent, recognizable voice that matches your brand identity, not a generic preset.
In today's episode we dive deep into the future of AI and what it means for creators everywhere
Content Creators
Narrate videos, podcasts, and social posts in your own voice at scale, without re-recording every time.
Accessibility
Create personalized voices for individuals who have lost the ability to speak, preserving their vocal identity.
English
Spanish
French
German
Chinese
Japanese
Multilingual Teams
Deliver your CEO's keynote in every major language - naturally in English, Spanish, French, German, Chinese, Japanese, and more.
Narrator The ancient door creaked open...
Kira We need to move. Now.
Thane I have a bad feeling about this.
Gaming & Entertainment
Bring characters to life with unique voices without scheduling studio time for every line of dialogue.
Chapter 3
The Discovery
She opened the notebook and found the handwriting unmistakably her own though she had no memory of writing it
Podcasts & Audiobook Narration
Make your narrative engaging. Turn scripts into full audiobooks narrated in your own voice, chapter by chapter, without stepping into a studio.
I need help with my recent order.
Of course! Let me pull up your order details.
In today's episode we dive deep into the future of AI and what it means for creators everywhere
English
Spanish
French
German
Chinese
Japanese
Narrator The ancient door creaked open...
Kira We need to move. Now.
Thane I have a bad feeling about this.
Chapter 3
The Discovery
She opened the notebook and found the handwriting unmistakably her own though she had no memory of writing it
Brand Voice Agents
Give your customer support agent a consistent, recognizable voice that matches your brand identity, not a generic preset.
Clone your voice in under two minutes. Use it everywhere.
Record about a minute of natural speech in the xAI console. Our pipeline verifies you're the voice owner, processes your recording, and delivers a production-ready voice model, all in under two minutes. Your custom voice inherits every TTS capability: speech tags, multilingual output, and both REST and WebSocket streaming.
My voice is my key
Step 1 Read a passphrase aloud to confirm your identity
Custom voices work everywhere our built-in voices do. Pass the to any TTS endpoint or use it with the Voice Agent API for real-time conversational agents.
Every custom voice goes through a two-stage verification process before it can be created. First, the speaker reads a verification phrase that our STT engine transcribes and matches in real time, confirming intent and presence. Then we compute speaker embeddings from the verification clip and the full recording to confirm they belong to the same person.
You can't clone a voice from a pre-existing recording, and you can't clone someone else's voice.
My voice is my key
Passphrase Check
Read a verification phrase aloud. Our STT engine transcribes and matches it in real time, verifying your consent and presence.
Speaker Similarity
Speaker embeddings from the passphrase and the full recording are compared to confirm they belong to the same person.
The Voice Library is a new section in the xAI console that organizes every voice available to your team, with your custom creations alongside our built-in voices. Browse, preview, and manage voices from a single page.
We've expanded our built-in voice catalog to over 80 voices across 28 languages. Listen to any voice across different scenarios before choosing one for your application.
There is no extra charge to use Text to Speech or Voice Agent APIs with custom voices.