Speechly features
Feature availability varies depending on if you are using pre-recorded audio or live streaming audio, your application language and your project payment plan.
Category | Feature | Pre-recorded | Live streaming | Availability |
---|---|---|---|---|
Models | Conformer RNN‑T models | 3 | 3 | |
Whisper models | 1 | |||
Model adaptation / training | ✓ | ✓ | RNN‑T models only | |
Data annotation service | ✓ | ✓ | Enterprise plans only | |
Transcription | Pre-recorded audio | ✓ | ✓ | |
Live streaming audio | ✓ | |||
Language support | 99 | English | ||
Language detection | ✓ | |||
Model selection | ✓ | ✓ | ||
Word level timestamps | ✓ | ✓ | RNN‑T models only | |
Punctuation | * | * | ||
Number & date formatting | * | * | ||
Silence segmentation | * | ✓ | RNN‑T models only | |
Interim results | ✓ | RNN‑T models only | ||
Voice activity detection | ✓ | ✓ | ||
Evaluate ASR accuracy | ✓ | |||
Speech understanding | Intent detection | ✓ | RNN‑T models only | |
Entity detection | ✓ | RNN‑T models only | ||
Lookups | ✓ | RNN‑T models only | ||
Text labeling | * | * | English language only | |
Language translation | * | Whisper models only | ||
Evaluate NLU accuracy | ✓ | |||
Audio analysis | Language detection | ✓ | * | |
Audio event labeling | * | * | ||
Tone of voice | * | * | ||
Lyrics transcription | * | * | English language only | |
Supported audio formats | WAV | ✓ | ✓ | |
FLAC | ✓ | * | ||
OGG | ✓ | * | ||
MP3 | * | * | ||
AAC | ? | ? | ||
Up and down sampling | ✓ | ✓ | ||
Deployment options | On-device | ✓ | Enterprise plans only | |
On-premise | ✓ | ✓ | Enterprise plans only | |
Cloud | ✓ | ✓ | ||
Integration | Browser client | ✓ | ||
React client | ✓ | |||
Android client | ✓ | |||
iOS client | ✓ | |||
Unity client | ✓ | |||
Speechly decoder | ✓ | iOS, Android and C | ||
gRCP API | ✓ | ✓ | ||
REST API | ✓ | |||
Developer tools | Dashboard | |||
CLI |
✓ available * planned ? under consideration