GPT-4o demo shows new AI model singing a bedtime story, detecting user's facial expressions.
See full article...
See full article...
Yeah, the AI in the demo sounded like an advertisement voiceover or someone using their "customer service voice" in a retail setting. Normal people don't bop around chirping at each other in tones like that and it's honestly somewhere in the uncanny valley for me. Hopefully this was meant to make the demo seem more engaging and normal interactions will use a more natural speaking voice.I grew up in the US, live right next door to the US, and dislike that kind of chirpiness in general. But even so, it sounds way over the top. It's not that you'd never hear that level from an actual person, but it would indicate that the person was insincere (above and beyond formulaic "how are you" when you don't actually care), and not very good at acting and/or gauging their audience.
From your name I'm guessing maybe you come from somewhere other than the US and aren't quite calibrated to see it as excessive even for the US. But I admit I could also be the one who's miscalibrated, especially because I'm old. And it's true 's true that women are really expected to lay it on pretty thick in some situations.
Ha! I am an unapologetic Apple fanboi. But I just can't with Siri. I asked it when the next Friday the 13th is – it's a Taylor Swift thing – and it replied, "Monday, May 13th."If Siri is getting this next month, wow, it'll be like going from the first single celled life to near human in a leap, it couldn't even tell me what time the event at 10AM PT was in my time zone.
Increasing entropy.Can anyone tell me what human beings are for?
I keep repeating this, but I truly hate how digital assistant are hiding their true nature behind a fake human voice. They should have their own voice, capable of human interactions without lying about what they are. Hello? Is there any sound designer in the room?Looks like they're working really hard on having it impersonate a human and try to make you like it. This is of course deeply unethical and has no purpose other than scary manipulation.
Luckily so far it's just making me want to strangle it, but I'm sure they'll get there.
Not a cultural thing -- It's nails on a chalkboard to me too. It's like an overly peppy cheerleader. Hopefully there will be a way to tone it down.Aargh. That voice they used is just so fucking grating. I don't know what's about it – maybe the overeagerness – but it's almost like hearing nails scratching a school blackboard.
It just sounds so false to me. Is that just a cultural thing? Maybe they should try basing it on a "polite but seemingly very bored German official" for the European audiences ;-)
Bah, hold my beer, dub-watcher. Okay Chatgpt, now raise your voice a few octaves and repeat the story in very dramatic Japanese.Yeah, because there is nothing wrong with tech bros choosing a voice that sounds like a porn star waifu so eager to please, is there?
Still, I think the tone of the AI voice is way too over the top even for the US crowd – it reminds me of Anime girl cliché voices, or something like that. Given it's developed by tech bros, I don't find that really that much surprising...
Very Stepford Wives vibe: '“My gosh,” the short man said, “we don’t want robots for wives. We want real women.”'The voice is similar to the stereotypical robot female assistant voice, but with extra performative femininity. For incels and the like, the voice sounds like a "real female" because by "real female", they mean matching ideal femininity from men's fiction, not behaving like actual humans who identify as female.
too late TwinkieNo good can come of this. Mark my words.
I assure you, the voice really isn't that appealing, for all the reasons other commenters have tried to articulate. The forced exuberance makes your skin crawl.The voice is similar to the stereotypical robot female assistant voice, but with extra performative femininity. For incels and the like, the voice sounds like a "real female" because by "real female", they mean matching ideal femininity from men's fiction, not behaving like actual humans who identify as female.
Why wait 'till they're dead?But think of the revenue from customized emotional ad-revenue experiences! What if your dead parents could be automatically processed from their public data in order to realistically sell you amazing new products or services??????????? What if though!
Bold of you to assume the other commenters are also incels.I assure you, the voice really isn't that appealing, for all the reasons other commenters have tried to articulate. The forced exuberance makes your skin crawl.
This has absolutely NO downsides, right?Annoying voice, but that's a hell of a computer interface. The days of keyboard and mouse are going to seem like Conestoga wagons before long. More of a star Trek style tell the computer what you want it to do, and let it figure out how.
That's the crux, I think: LLMs bamboozle us with command/manipulation of language, which we can't help but automatically associate with general intelligence. With this, they slapped another mimicry layer on top of it that will supercharge that bamboozling.It feels like the "emotional voice" is just window dressing if the generative text still struggles with all the things LLMs are currently bad at doing? The text generation is still the most probable next token, right?
lol hopefully they'll use headphones, but even then, listening to a dozen coworkers all chatting away to an unheard voice would be maddening.I don't know which fate is worse: being replaced by an AI, or having to work in an open-plan office with twenty people who're all having an AI with a valley-girl voice read back to them what their own code does.
I mean, this is any office with people on the phone.lol hopefully they'll use headphones, but even then, listening to a dozen coworkers all chatting away to an unheard voice would be maddening.
True, but at least most places I've worked anyone who's on the phone all day gets an office. The occasional call at your cube is not too bad.I mean, this is any office with people on the phone.
Nice is not cheerful. If you truly believe a majority of waiters are that stoked about making crap money for dealing with assholes all day, I have a bridge to sell you.I don't think you have any idea what you are talking about. I worked a couple restaurants as a cook during college. When you hire wait staff you specifically seek out the candidates with a cheery attitude. Later I had a job that was about 80% travel, so I have probably eaten at least 1,000 meals at a sit-down restaurant. Most of them are genuinely nice.
It hasn’t been for a long time. The next most probable token results in terrible generation. Instead software rolls some dice and chooses from the top k most probable tokens or picks a number between 0 and 1 then starts adding up token probabilities until that number is reached (top p sampling).The text generation is still the most probable next token, right?
It is. Fuck Meta and Hugging Face. As fun as it is to mess with offline models, it’s already dangerous enough to kill at scale with misinformation.OpenAI considers it unsafe (as well as bad for their business interests) to let model weights out
I'm saying the voice sounds unctuous to virgins and sex-havers alike. If anything, incels might find the exaggerated fake-nice more off-putting. Luckily it can probably be toned down, just like it can be told to ham it up harder in the demo.Bold of you to assume the other commenters are also incels.
Edit: Why the down votes when I was talking about the appeal to incels and MechR tried to correct me on how unappealing it is to him? I read the top comments here and never assumed the other commenters are incels unless they self-identified as such.
I assume you're American, so you're probably used to this overly bubbly stuff, but as a European i have to agree with Sjoerd, it feels really weird at first. There's friendly, and then there's American Waitress friendly.I don't think you have any idea what you are talking about. I worked a couple restaurants as a cook during college. When you hire wait staff you specifically seek out the candidates with a cheery attitude. Later I had a job that was about 80% travel, so I have probably eaten at least 1,000 meals at a sit-down restaurant. Most of them are genuinely nice.
There's really nothing quite like the American chirpiness. I've worked with and am friends with a lot of Americans over the years, and have visited there a few times, and it's remarkable how culturally different Americans are. I hated restaurants in the US.I assume you're American, so you're probably used to this overly bubbly stuff, but as a European i have to agree with Sjoerd, it feels really weird at first. There's friendly, and then there's American Waitress friendly.