Supported audio formats
Speechly supports multiple audio formats and codecs. Note that pre-recorded audio supports more formats and codecs than live streaming audio.
Pre-recorded audio
- FLAC (16-bit)
- OGG (Opus or Vorbis)
- WAV (16-bit, PCM)
Live streaming audio
- WAV (1 channel, 16‑bit, 16 kHz, PCM, max 5 minutes)
Other formats/codecs
For enterprise customers we have the capability to add support for additional formats with a few months lead time.
Tip: Easily convert audio files with Audacity or with SoX by running:
sox in.wav -c 1 -b 16 -r 16000 out.wav