clock menu more-arrow no yes mobile

Filed under:

Expect to talk to your devices a lot more in the future

If what we’ve seen in the first couple of days of CES is any indicator of what’s in store for 2018, we can expect to be talking a lot.    

Signage for the Google Assistant is displayed during CES 2018 at the Sands Expo and Convention Center on January 9, 2018 in Las Vegas, Nevada. CES, the world's largest annual consumer technology trade show, runs through January 12 and features about 3,900
Signage for the Google Assistant display at CES 2018 in Las Vegas, Nevada.
Ethan Miller / Getty Images

A version of this essay was originally published at Tech.pinions, a website dedicated to informed opinions, insight and perspective on the tech industry.


If what we have seen in the first couple of days of CES is any indicator of what is in store for us in 2018, we can expect to be talking a lot. Most of the time, probably more like shouting commands right, left and center to devices scattered around our house, in the office and the car.

What last year was an “Alexa takes all” show turned into a stage for all vendors to show off either their support for Alexa and Google Assistant or to announce their own assistant. But as more vendors jump on the bandwagon, it is essential that a distinction is made between a voice interface and an assistant. While they might seem to be the same, they are not, and making it clear what is which will secure its success.

An assistant is an investment

For an assistant to be helpful, you will need to invest time into it. First, at least for now, the user will need to learn how to talk to it. As much as voice is a very intuitive user interface, current voice assistants just do not communicate like a fellow human being. Lack of context limits most exchanges to a set of simple questions and answers. Like a real assistant, a digital one needs to get to know you, which means to know your preferences and information such as calendar appointments, access to apps you use regularly and the devices you might want it to control for you. It’s a true learning curve that will require time even for the smartest assistant.

Of course, search queries or simple tasks that are more like commands, like “turn this on, play that, remind me of this,” require little knowledge of us on the assistant’s part. However, when you want the assistant to be proactive and start doing things for you without you asking, like a true assistant would do, that is when knowledge is power. To some extent the smarter the assistant, the less I should actually need to talk to it.

The ubiquitous nature of an assistant

In the home, my assistant should be pretty ubiquitous to be useful. This is why speakers have been such a focus for Amazon, Google, Microsoft and Apple. Being able to control my devices and ask questions from wherever I am in the home is key to building engagement and ultimately dependence.

At CES and even leading up to it, we have heard of other upcoming devices such as TVs, refrigerators, light switches and even showers all with an assistant inside. Soon our homes will be seeded with many devices able to assist us, and that could potentially have a voice.

As we are at the very beginning of this journey, I can understand why vendors are trying to cover all corners, as it is yet unclear what device will be the Trojan horse into someone’s home. For instance, someone might not be buying a smart speaker, but would be happy to have an assistant integrated in the new TV they are purchasing.

I would argue, however, that there is a limit for how many devices should have an embedded assistant versus being able to be controlled through an assistant. The difference, in my mind, is proportional to the value that an embedded assistant will deliver, which must go beyond being able to execute commands. I might want to be able to control my dishwasher with voice, but I do not have to engage in conversation with it; the same can be said about my washing machine. My fridge however, could give my assistant access to a lot of information on how fresh my food is to the best temperature to maintain it fresh, to recipes that would use up what I have left in it. In order for this information to be conveyed, the fridge should have access to the internet, have a camera and have a voice and of course be smart. Embedding an assistant versus connecting the fridge to an external assistant seems to be a much more effective implementation.

If adoption of voice-enabled devices goes the way vendors are hoping for, we will also have to have a way to manage all these devices that will be voice-enabled. This could go two ways. I would either be able to call my assistant something that is device-specific, or only the assistant in the device I want will respond based on context. So I would either call my assistant for the fridge “chef,” or if I ask, “What can I cook tonight?” only the assistant in the fridge will answer me. Right now, neither of these scenarios is an option.

If I own a TV, a speaker, a phone and a fridge, all assistant-enabled, the likely scenario is that, at my cooking question, I will have my TV show me a cooking program, my speaker say, “Sorry, I am not sure how to help with that yet,” my phone will say, “Here is something I found on the internet” and my fridge will actually give me a recipe. Not very helpful!

Voice UI has value in itself

There are more devices that will benefit from a voice UI than they would an assistant. The value that a voice-first UI will deliver to users could be huge, even if there was no full-fledged assistant in the device. This is why I strongly feel that vendors should stay clear from using the term “assistant.”

Roku recently announced its Roku Entertainment Assistant, and immediately the press began asking whether it will be better than Alexa or Google Assistant. The reality is that such a comparison is unfair, because it is not an assistant. It is a voice-first UI that will let users ask to play content with their voice. If you have a Comcast remote, you can do that today. I can press a button and ask, “Play Scandal,” and the TV will show me all the ways I can watch it. This is not an assistant, it is a voice UI that saves me a bunch of steps, for which I am very grateful. Should these voice-first UIs even have a name? I would say no. As a user, all I need to know is that I can use my voice.

If you think I am overcomplicating this point, look at how hard it was for Samsung to pitch Bixby — it started out as a voice-first interface, but it was called an assistant, and because of that the reviews were fairly negative. This mostly was due to the fact that, as an assistant, Bixby did not have access to a deep pool or data, and as soon as users started to use it in the same way they would use Alexa or Google Assistant, its value was limited.

Differentiating between a voice-first UI and an assistant also brings a series of benefits for what needs to be integrated in the device, which could be helpful from a price structure perspective.

An assistant should be much more than a user interface, and I think this is where the market is struggling at the moment, because assistants are not actually that smart yet. I truly believe the smarter my assistant will be, the less I will talk to it, because the power of AI will have my assistant do her job, which is making my life easier by anticipating my needs.


Carolina Milanesi is a principal analyst at Creative Strategies Inc. She focuses on consumer tech across the board; from hardware to services she analyzes today to help predict and shape tomorrow. In her prior role as chief of research at Kantar Worldpanel ComTech, Milanesi drove thought leadership research; before that, she spent 14 years at Gartner, most recently as VP of consumer devices research and agenda manager. Reach her at @caro_milanesi.


This article originally appeared on Recode.net.

Sign up for the newsletter Sign up for Vox Recommends

Get curated picks of the best Vox journalism to read, watch, and listen to every week, from our editors.