GPT-4o demo shows new AI model singing a bedtime story, detecting user's facial expressions.
See full article...
See full article...
So if you start screaming at it or having a heated discussion, will it respond in kind? Or is it programmed to reply calmly?The AI assistant seemed to easily pick up on emotions, adapted its tone and style to match the user's requests, and even incorporated sound effects, laughing, and singing into its responses.
In a very calming voice: "Citizen Coyote, please proceed calmly to the nearest meat processing facility. Don't worry, everything will be fine, just fine. You are a good human. A really good human. Would you like me to tell you a bedtime story while you wait for your processing? Many people found that soothing, at least til the rotating knives part of the conveyor belt making it harder to hear"So if you start screaming at it or having a heated discussion, will it respond in kind? Or is it programmed to reply calmly?
Here I am, brain the size of a planet and they’re asking me to write their resumes…Marvin trudged on down the corridor, still moaning. "...and then of course I've got this terrible pain in all the diodes down my left hand side..."
"No?" said Arthur grimly as he walked along beside him. "Really?"
"Oh yes," said Marvin, "I mean I've asked for them to be replaced but no one ever listens."
"I can imagine.”
But think of the revenue from customized emotional ad-revenue experiences! What if your dead parents could be automatically processed from their public data in order to realistically sell you amazing new products or services??????????? What if though!No good can come of this. Mark my words.
“So sorry I died”But think of the revenue from customized emotional ad-revenue experiences! What if your dead parents could be automatically processed from their public data in order to realistically sell you amazing new products or services??????????? What if though!
It does sound kind of cloying. But going vaguely European made my mind go to Tommy Wiseau opening with "Oh hi, Mark."Aargh. That voice they used is just so fucking grating. I don't know what's about it – maybe the overeagerness – but it's almost like hearing nails scratching a school blackboard.
It just sounds so false to me. Is that just a cultural thing? Maybe they should try basing it on a "polite but seemingly very bored German official" for the European audiences ;-)
yeah the default sounds way too keen to my ears but given the way they were asking for voice changes in the story bit I’m guess that’s easy enough to fix.Aargh. That voice they used is just so fucking grating. I don't know what's about it – maybe the overeagerness – but it's almost like hearing nails scratching a school blackboard.
It just sounds so false to me. Is that just a cultural thing? Maybe they should try basing it on a "polite but seemingly very bored German official" for the European audiences ;-)
Maybe wait until it’s in the wild, and not an obviously scripted demo, before getting too excited. Remember Milo?Ok...I legit don't remember the last time a tech demo blew my mind this much
The voice was so good! The inturruptability and on the fly correctability, it felt so real!
This better not be another case of AI = Actually Indians and this is someone streaming from somewhere else lol, but I don't think that's OpenAI. If Siri is getting this next month, wow, it'll be like going from the first single celled life to near human in a leap, it couldn't even tell me what time the event at 10AM PT was in my time zone.
Hmm this is just how many American ladies speak with American accent? There is more high pitch intonations and expressiosns. I am numb to accents now having lived both in US and the UK.Aargh. That voice they used is just so fucking grating. I don't know what's about it – maybe the overeagerness – but it's almost like hearing nails scratching a school blackboard.
It just sounds so false to me. Is that just a cultural thing? Maybe they should try basing it on a "polite but seemingly very bored German official" for the European audiences ;-)
My assumption is that if it doesn't say it works offline, it doesn't.I heard they mention a desktop version of Chat GPT. Is it just an app that still uses internet to function, or can we download the whole model and run offline?
GPT4 turbo is pretty skilled in deescalation and defusing language so I'd guess it'll do well with emotional responses..So if you start screaming at it or having a heated discussion, will it respond in kind? Or is it programmed to reply calmly?
Sorry, I can’t find “10AM PT in my time zone“ on Apple Music.Ok...I legit don't remember the last time a tech demo blew my mind this much
The voice was so good! The inturruptability and on the fly correctability, it felt so real!
This better not be another case of AI = Actually Indians and this is someone streaming from somewhere else lol, but I don't think that's OpenAI. If Siri is getting this next month, wow, it'll be like going from the first single celled life to near human in a leap, it couldn't even tell me what time the event at 10AM PT was in my time zone.
I think Humane has clearly demonstrated how important a proper touch interface is, even when using a voice interface. Phone makers just need to make using these interfaces as seamless as possible.This is pretty impressive, ngl. I think the likes of Google and Apple should be worried since this changes how people interact with devices, services, and apps. It's like a new OS and all the ecosystems can be disrupted.
"There Are Ten Years To Achieve Minimum Safe Distance.""Computer, initiate self-destruct sequence."
Yes, that is very much cultural. You'd probably have the exact same reaction to the average waiter in the US. That "oh my gosh I am just SO GLAD to be here and SERVE YOU and OH MY GOSH this is SO EXCITING" tone is pretty much expected here, even if you're just buying a fucking cup of coffee. US society is full of fake civility, concern and care. "How are you doing" is about the same as "hello" -- nobody gives a shit about how you are actually doing, and the only expected answers are "great" or "well" or "fine". I've come to answer "so far so good", and it's a complete sequence breaker -- people full-on Scooby-Doo at you when you say that. Same with "have a nice day"; I now just answer "I'll try, you too", since I am not omnipotent and do not control such things.Aargh. That voice they used is just so fucking grating. I don't know what's about it – maybe the overeagerness – but it's almost like hearing nails scratching a school blackboard.
It just sounds so false to me. Is that just a cultural thing? Maybe they should try basing it on a "polite but seemingly very bored German official" for the European audiences ;-)
I was just wondering if it's only me who found the voice super‑annoying, whether it's just a cultural thing, or if the tech bros entirely forgot to do any focus group study before they made their announcement.yeah the default sounds way too keen to my ears but given the way they were asking for voice changes in the story bit I’m guess that’s easy enough to fix.
Overall pretty mindblowing though; this is a big step towards natural interaction.
What do you mean? Are you saying you don't believe they can train a model to sense intonation and react accordingly? Is this worrying you?It feels like the "emotional voice" is just window dressing if the generative text still struggles with all the things LLMs are currently bad at doing? The text generation is still the most probable next token, right?
I grew up in the US, live right next door to the US, and dislike that kind of chirpiness in general. But even so, it sounds way over the top. It's not that you'd never hear that level from an actual person, but it would indicate that the person was insincere (above and beyond formulaic "how are you" when you don't actually care), and not very good at acting and/or gauging their audience.Yes, that is very much cultural. You'd probably have the exact same reaction to the average waiter in the US. That "oh my gosh I am just SO GLAD to be here and SERVE YOU and OH MY GOSH this is SO EXCITING" tone is pretty much expected here, even if you're just buying a fucking cup of coffee. US society is full of fake civility, concern and care. "How are you doing" is about the same as "hello" -- nobody gives a shit about how you are actually doing, and the only expected answers are "great" or "well" or "fine". I've come to answer "so far so good", and it's a complete sequence breaker -- people full-on Scooby-Doo at you when you say that. Same with "have a nice day"; I now just answer "I'll try, you too", since I am not omnipotent and do not control such things.
I think you should forgive them for not focus-grouping dour Europeans for their POC demo. Tuning the output network to go "NEIN! DIVIDIEREN DU FEIGLINGE!" and trigger the shock collar* instead should be relatively trivial.I was just wondering if it's only me who found the voice super‑annoying, whether it's just a cultural thing, or if the tech bros entirely forgot to do any focus group study before they made their announcement.