I consulted on, and was interviewed about smart speakers and some of the considerations of having an ubiquitous voice assistant in our lives.
Hosted by JOSHUA MCNICHOLS and CAROLINE ADOLPH
MAR 25, 2019 at 12:04 AM
Anthonio Pettit is a veteran at writing for smart speakers now, and he’s worked on several of them such as Alexa, Cortana, and Samsung’s Bixby. He has even worked on the digital assistants living in car dashboards made by Toyota, Audi and Lexus.
“I can’t tell you how many times I’ve heard – “‘Lexa, bring me a beer!”
Pettit says they all have what are called “false accepts.” Like if you hit the button on your steering wheel, or you said something that sounded kind of like the wake word. That triggered the device to start recording.
“You know, when people aren’t aware that they’re being recorded – or sometimes even when they are — you hear weird stuff.”
In his job, Pettit heard stuff from all over the country.
Sounds of yelling. People using intimate nicknames for each other. A near car accident. (That was from one of the dashboard mounted assistants.)
“You’re never quite alone, when you’re speaking to these speakers. Potentially. There’s always the chance that your data is being used for quality assurance and that there’s somebody on the other side – a human – who is overhearing it.”
Tech companies like Amazon say they aren’t interested in surveilling us, they need the info they are overhearing to help the AI inside the smart speaker craft a better response. That’s because – some of Alexa’s answers don’t make a lot of sense. One recording featured a guy who just could not believe what Alexa told him when he asked what hamsters eat.
“So they called in their friend, and they’re like – ask it what hamsters eat.”
Alexa told them: Hamsters eat chicken, cucumber and nut.”
Joshua looked it up and found that hamsters are omnivores, so it’s not technically wrong – though they’re more likely to eat bugs than chicken.
“It was as weird for me to hear it as it was for them.”
At the time, Pettit’s job was to mark that recording as “not actually related to Whole Foods shopping.” That’s true to the company’s character. Ultimately, Amazon wants to sell you stuff.
Today, Pettit actually writes answers for smart speakers. There are much bigger, more important things to write answers for — answers that will help these companies win or lose territory in the Wake Word War.
“Like – if you tell Alexa you’ve been hungry, it’ll ask:
‘Which of these 3,000 restaurants do you want to order food from?’
But if you tell Cortana you’re hungry, it’ll say:
‘Why not try eating something?’”
With the really hard questions, Pettit says the writers will put several smart speaker brands on a table and listen to all of them respond just to see what the industry standard is.
“Like if somebody said ‘I’m lonely.’ So it’s not really a task, it’s sort of an emphatic statement. And yet, you’d want to be able to respond in a way that signals to the person doing the inputting that they’re being heard. But also not oversell the capabilities of the speaker. You’re not really a therapist, you’re not really someone who’s going to be able to provide an answer.”
This brings us to the edge of the precipice, because what the person telling the smart speaker that they’re lonely is doing is reaching out, as one would reach out to a friend.
And a smart speaker cannot be that friend.