Our Text-to-Speech software solution convert text into synthetic speech to enable AI assistant voice output, automate responses and improve user interaction across digital platforms and services.
| Model | Model size [parameters] | Platform / Core | Real Time Factor | DNS-MOS | Quantization format | Code size [MB] | Library dependency |
|---|---|---|---|---|---|---|---|
| MPU | ~20M | i.MX 95 / 6x Cortex-A55 | 0.26 | 4.35 | 8-bit int (linear layers) | ~23 | ONNX |
| - | - | i.MX 8MP / 4x Cortex-A53 | 0.52 | - | - | - | - |
| - | - | i.MX 8MM / 4x Cortex-A53 | 0.59 | - | - | - | - |
| - | - | i.MX 8MN / 4x Cortex-A53 | 0.62 | - | - | - | - |
| - | - | i.MX 91 / 1x Cortex-A55 | 0.97 | - | - | - | - |
| MCU | ~2.2M | Cortex-M33/NPU | 0.24 | 4.17 | 8-bit int + 16a_8w int | 150 kB + 2.6 MB model size (and 1.7 MB + 2.6 MB SRAM) | TFLite |