AI That Listens, Sees, and Understands — On the Edge
Sensory Perspectives on Medium

How Many Voice Assistants Do We Really Need?

19th Aug, 2024
3 min read
How Many Voice Assistants Do We Really Need?
Originally Published in
Sensory Perspectives on AI on Medium

That’s easy…Just one. One that knows all the current knowledge of the internet, doesn’t hallucinate, knows everything about me, can carry out actions on my behalf, answers to my voice with my wakeword, and runs on device so there’s no risk in my private data leaking out. I think it’s kind of what Apple Siri is striving for with on device LLM’s and Apple Intelligence, but they aren’t there yet.

Nobody is there yet. In the next 5 or so years we will likely need multiple assistants. The assistants can have different
domains of expertise and by staying focused each one can be smaller and more intelligent within its domain. By being smaller it can run on device, so the most private voice assistants will have these more targeted capabilities. For example, a medical assistant can know all my personal health information and history and is a health expert, so it can intelligently keep up on all the latest studies and provide great targeted advice. I could have a separate car assistant that runs without internet and can route to “Grandpa’s house” and knows that when I look for “nearby restaurants” I’m looking for good-tasting vegan food. It knows me!

Each assistant can have its own name, in fact, Sensory can run
multiple wake words in parallel. We can listen for Google, Alexa, Cortana, Siri or others all at the same time. We can personalize these by making them only respond to the right user through the power of speaker verification. Sensory calls this an Enrolled Fixed Trigger (wake word).

Most importantly, the technology to name your own assistant is here today! Sensory calls it
User-Defined Triggers”. You say your wakeup phrase a few times and the Sensory tech stack learns how you say it and enables a tiny low-power recognition of your voice on many dozens of different chips and platforms and in any language, all in an ultra-tiny low heat, low MIPs package!

In the short run, we will probably be calling out to a few different Voice Assistants, maybe we’ll soon have a master assistant that you get to name and that performs cognitive arbitration to figure out which domain specific models it needs engage to answer questions as accurately as possible with the privacy levels and connectivity you want!

Related Articles

Sensory Perspectives on Medium
23rd Aug, 2024
Small Language Models (SLM), Large Language Models(LLM), or Micro LLM (MLM)?
Todd MozerTodd Mozer
4 min read

LLMs & Small LLMs. Since the arrival of generative AI and Large Language Models the world has learned...

Sensory Perspectives on Medium
31st Jul, 2024
LLMs and Voice Assistants
Todd MozerTodd Mozer
5 min read

Voice Assistants & Dependency…it’s Gonna get Bad I’m not particularly concerned about the dystopian...

Sensory Perspectives on Medium
15th Jul, 2024
Your Voice Assistant is MMETYT
Todd MozerTodd Mozer
3 min read

I love Pi, the voice Assistant from Inflection. It’s extremely helpful to me and has aided in everything...