"Humanlike & realtime voice agents, powered by a re-engineered Sesame CSM-1B"
TLDR: Make your voice agents way more realistic and stop getting hangups. Check it out in action at or talk to it yourself by calling their AI Trump voice agent at (510) 315-0014 or vogent.ai/sesame
Vogent powers millions of voice AI phone calls for healthcare, customer support, travel, and more.
Today, they are launching the most realistic realtime voice engine ever. It inserts natural pauses and disfluencies (“um”, “uh”) without being explicitly prompted, and is uncannily good at cloning voices. Check it out in the videos below:
If you’ve been online recently, you’ve probably come across Sesame’s super humanlike voice AI, and their open-source launch of CSM-1B, a 1B-parameter text-to-speech model.
The team has spent the past couple of weeks rearchitecting CSM-1B from the ground-up to support realtime, low-latency inference, and they have finally cracked it. The results are pretty insane; it’s a step-function increase in realism (customer hangup rates have decreased by 60%+), and they made it available out-of-the-box.
✨ If you want to create your own Sesame agents, sign up at app.vogent.ai, and select one of their Sesame voices. You can also use Sesame to clone new voices, as long as you have a 10-15 second reference clip.
📅 They are also releasing a realtime Sesame TTS soon; email the founders here for beta access.
Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.