SANTA CLARA, CA – April 15, 2026 – Sensory Inc., a pioneer in on-device AI for over 30 years, today announced a breakthrough in embedded speech recognition with the launch of its latest Speech-to-Text (STT) engine. Optimized for TensorFlow Lite Micro (TFLM) and advanced Neural Processing Units (NPUs), including the Arm® Ethos™-U55, this new engine delivers unparalleled accuracy and performance in an ultra-compact footprint.
Breaking down the barriers of localized AI, Sensory’s STT engine supports 37 languages, enabling manufacturers to deploy truly global products with a single, ultra-efficient architecture. Supported languages include:
Afrikaans, Arabic, Belarusian, Bengali, Bulgarian, Catalan, Croatian, Czech, Danish, Dutch, English, Farsi, Finnish, French, German, Greek, Hebrew, Hindi, Hungarian, Indonesian, Italian, Japanese, Korean, Malay, Mandarin, Norwegian, Polish, Portuguese, Romanian, Russian, Spanish, Swahili, Swedish, Thai, Turkish, Ukrainian, and Vietnamese.
Maximum Power, Minimum Footprint
By leveraging specialized Neural Processing Units (NPUs), Sensory’s STT engine is built to eliminate the performance-draining data transfers between a CPU and an NPU. This architecture offloads the entire tensor computation graph to the hardware accelerator, which can significantly reduce power consumption and latency. By keeping the CPU idle during inference, device manufacturers can extend battery life in portables or reserve processing cycles for complex system tasks and UI management.
The engine is available in two optimized configurations:
Technical Specifications & Performance
| Feature | Domain-Specific Model | General-Purpose Model |
|---|---|---|
| Ideal Use Case | Targeted Commands
Large vocabulary |
Natural Language
Unlimited vocabulary |
| Model Size | 2.7 MB | 13 MB |
| Peak SRAM Usage | 787.11 KiB | 1.68 MB |
| Compute Requirements | 892.9 Million MACs/inference | 4.37 Billion MACs/inference |
| Acceleration | 100% NPU Mapping | 100% NPU Mapping |
Universal Compatibility: LiteRT Micro & High-Performance Silicon
Sensory’s STT engine is engineered for rapid portability across a broad ecosystem. By using LiteRT Micro (formerly known as Tensorflow Lite Micro) as the essential runtime layer, Sensory provides seamless integration for:
Privacy and Performance
“Our STT engine demonstrates that natural language interfaces can be powerful without relying on the cloud,” said Todd Mozer, Chairman and CEO of Sensory. By processing 100% of voice data on-device, Sensory helps developers ensure user privacy, lower latency, and consistent reliability in environments with limited or no connectivity.
Why Embedded STT Matters
“Our latest STT engine proves that you don’t need a cloud connection or even a big embedded model for powerful, natural language interfaces,” said Todd Mozer, Chairman and CEO of Sensory. Sensory’s on-device approach offers several critical advantages over cloud solutions:
About Sensory Inc. Sensory Inc. creates a safer and superior on device user experience through vision and voice technologies. Sensory’s technologies are widely deployed in consumer electronics applications including mobile phones, automotive, wearables, and smart home devices.
Media Contact: Amanda Defelice, Head of Marketing, Sensory Inc. [email protected]