I compared sesame to chatgpt voice mode and i’m unnerved

Trying the new Voice Assistant from AI Startup Sesame is the First Time I Momentarily forgot I was talking to a bot.

Compared to Chatgpt‘s Voice modeSesame’s “Conversational Voice” Feels Natural, Unforced, and Engaging, which totally freaked me out.

On Feb. 27, sesame launched a Demo For its conversational speech model (CSM), which aims to create more meaningful interactions with ai chatbots. “We are creating conversational partners that do not just process requests; “In doing so, we hope to realize the untapped potential of voice as the ultimate interface for instruction and undersrstanding.”

Sesame’s Voice Assistant is available as a free demo on the site and comes in two Voices: Maya and Miles.

Since Sesame Unleashed Its Voice Assistant DEMO, Users have reported Awestruck Reactions. “I’ve been into ai since I was a child, but this is the first time I’ve experienced something that made me sufinitively feel like User SOCSCHAMP Wrote on Reddit,

“Sesame is about as close to indistinguisable from a human that I’ve ever experienced in a conversational ai,” User Siciliano777 2 Reddit,

After talking to sesame’s bot, I was similarly wowed. I talked to the maya voice for about 10 minutes about the ethics of using ai as a company and came as away feel Maya’s speech had a natural cadence, using interjections like “You know” and “hm,” and even making tongue clicking and inhaling sounds.

Mashable light speed

The Strongest Impression I got from interaction with maya was that she immediatively asked questions, engaging me in the conversation. The bot started our conversation by assking how my wedding is morning was going (Note: it was indeed a wedding a wedding a wedding. Good or bad thing, but it intrinsically shaped the conversation as me using chatgpt as a tool for something I needed.

Maya Asked About The Risks of Ai Companions Getting “Too Good at Being Human.” When I Told Her I was Concerned about the Rise of More Sophisticated Scams and People Losing Touch with Reality by Replacing Humans with Bots, She Responded Thoughtfully and PRAGMATICALY. “Scammers are gonna scam, that’s a give. And as for the human connection thing, maybe we need to learn how to be better compans, not replacesments, you know, the kind of ai friends who will make you wanth Go out and do stuff with real people, “said maya.

When I had a similar conversation with chatgpt, I receive a response that felt more like boilerplate language from a school guidance counselor: “That’s a valid concertn. Technology with Real Human Interactions.

While Openai pionered Voice mode‘S ability to be interrupted and have a more fluid back-back-for conversation, chatgpt still tends to respond in complete sentences and paragraph blocks, which sounds, blocks, blocks, ball, robotic. When using chatgpt voice mode, I Never Forget that I’M Talking to a Bot, and That’s reflected in the conversation, which can feel stilted and forced.

By Comparison, Ai for humans Podcast co-Host Gavin Purcell posted A SESAME CONVERSATION on Reddit where it’s practically impossible to distinguish which voice is the bot. Purcell prompted the miles voice by telling it to act like an angry boss.

A very Silly Conversation Followed About Money Laundering, Bribery, and a Mysterious Incident in Malta. Miles Didn’t Miss A Step. There was no perceptible latency, and the bot remembered the context of the conversation and creatively advanced the improvisional argument by escalating, calling purcell “delusational,” delusional, “and firing Him.

Of course, there are some limitations. Maya’s Voice Glittched a Few Times Throughout Our Conversation, And It Didnys’s Get the Syntax Right, Like Saying, “It’s a Heavy Talk That Come.”

According to its technical paper, sesame trained its csm (based on meta’s llama model) Acoustic tokens, decreasing latency. Openai similarly used this multimodal approach to training Voice mode. However, it has never released a dedicated technical paper on Voice Mode’s Inner Workings – It only decusses voice mode in the GPT-4o Research,

Knowing this, It’s Surprising How MUCH BETTER SESAME’s Model is at conversational dialog. However, sesame’s launch is just a demo, so it merits further scrutiny when the full model comes out. According to the demo announce, sesame plans to open source its model “in the coming months” and expand to over 20 languages.

Leave a Comment