Supported audio formats

Speechly supports multiple audio formats and codecs. Note that pre-recorded audio supports more formats and codecs than live streaming audio.

Pre-recorded audio

  • FLAC (16-bit)
  • OGG (Opus or Vorbis)
  • WAV (16-bit, PCM)

Live streaming audio

  • WAV (1 channel, 16‑bit, 16 kHz, PCM, max 5 minutes)

Other formats/codecs

For enterprise customers we have the capability to add support for additional formats with a few months lead time.

Tip: Easily convert audio files with Audacity or with SoX by running:

sox in.wav -c 1 -b 16 -r 16000 out.wav