Voice Settings
Voice Settings give you full control over how your Delphi sounds.
What it is
Voice Settings give you full control over how your Delphi sounds.
Start with the sliders: Stability smooths or animates delivery, Similarity matches or polishes your sample, and Speed tweaks the pace.
Next, pick a Voice Model: Default for balanced clarity, For Accents 1 for the most authentic accent, or For Accents 2 for the fastest real-time response.
Finally, lock in tricky names with Custom Pronunciations so Delphi always says them right.
✅ Tip — Test in a live call: The Playground is great for quick spot-checks, but the truest preview is to start an actual voice call with your Delphi. Live calls mirror exactly what your audience will hear.
Adjust your Delphi's voice settings here →
Go to the voice settings page if you're wondering...
How do I change how my Delphi sounds on calls?
How can I get my Delphi to better represent my accent?
How do I slow down or speed up the voice?
How do I make my Delphi pronounce a name correctly?
Explained in Detail...
Voice Sliders
To adjust voice settings for every call, click the gear (⚙️) in the top-right corner of the Voice page. These global settings apply to all live voice and video sessions.
Want to experiment first? Open Voice Playground. The same sliders appear under the Generate tab so you can test scripts—ad reads, podcasts, book excerpts, or things your Delphi would actually say on a live call—without touching your universal settings. They only become permanent if you choose Apply settings to my Delphi.
Stability
(0 – 100 %)
Highly animated voice—big swings in pitch, loud/soft, intense, whispers.
Locked-in broadcaster tone—more monotone, same volume throughout.
Long reads that need consistency.
Role-play or emotional storytelling.
Similarity
(0 – 100 %)
Studio-polished synthesis—background hiss removed, quirks smoothed out.
Carbon copy of your raw sample—every breath, accent edge, and mic artifact preserved.
Brand voice that must sound exactly like you.
Noisy or low-quality original sample.
Speed
(0.7× – 1.2×)
0.7× = 30 % slower for clarity.
1.2× = 20 % faster for snappy updates.
Quick Q&A sessions.
Dense explanations, language learners.
Quick rules of thumb
Stability: Drop by 10 % if calls feel flat; raise if voice sounds chaotic.
Similarity: Keep below 60 % unless your sample was studio-grade quiet.
Speed: Adjust in 0.05 steps to avoid sounding rushed or sluggish.
Click Reset Settings anytime for a clean slate: 50% stability, 75% similarity, 1x speed.
A green check in the bottom right corner confirms all changes.
Voice Models
Choose the engine that shapes accent and tone most to your liking:
Default — Balanced clarity; smooths minor accent edges.
For Accents 1 — Most accurate accent reproduction but a bit slower than for accents 2.
For Accents 2 — Fastest generation speed among the two accents model; keeps overall accent flavor but may miss fine nuances.
Try this: Switch the model first, then nudge Stability or Similarity in 10-point steps to zero-in on your perfect sound.
Custom Pronunciations
Make your Delphi say names and jargon exactly the way you do.
Click Open.
Click Add a word.
Enter the word (e.g., “Delphi”).
Spell it phonetically (e.g., “DEL-f-eye”).
Hit Add. Look for the green check in the bottom right corner.
Test it in Voice Playground; tweak spelling if it sounds off.
Delete any entry by clicking the trash can and look for the green check in the bottom right corner.
❓FAQs/Troubleshooting
Last updated