Getting started with Speechly

Learn how to create a Speechly application and transcribe both live streaming and pre-recorded audio. This guide is tailored for those starting out with Speechly.

Quick introduction

Speechly is a voice technology that offers Automatic Speech Recognition (ASR) and Natural Language Understanding (NLU) tools and APIs. We've built it from the ground up and here's some feature we'll think you'll like:

⚡️ Transcribe pre-recorded or live streaming audio
🏁 Support for 99 languages
🚀 Deploy on-device, on-premise or in the cloud
📈 Obtain extreme accuracy with domain specific model training
💻 Easily integrate Speechly into your product using our SDKs and APIs
🔮 Powerful NLU features like intent and entity detection
😎 Manage your projects, applications and users with Speechly Dashboard and Speechly CLI

Create a Speechly account

Before you can use Speechly products, you need to create a Speechly account.

Signing up is free and includes:

50h of API quota for personal and non-commercial use
All the latest models
Cloud deployment
Batch and Streaming transcription
NLU features

Create a Speechly application

Once you have your account set up, you need to create a Speechly application.

Each application in Speechly hosts its own training data and settings. Your project can contain as many applications as you like. You can create an application using Speechly Dashboard or Speechly CLI.

Speechly Dashboard
Speechly CLI

Open Create a new application
Give your application a Name
Press Create application

To access your project from Speechly CLI, you need to create a Speechly API token by going to Project settings → API tokens. Make sure to copy and store the token, you'll need it soon.

Install Speechly CLI:

# Using Homebrew
brew tap speechly/tap
brew install speechly

# Using Scoop
scoop bucket add speechly https://github.com/speechly/scoop-bucket
scoop install speechly

Add your project:

speechly projects add --apikey YOUR_API_TOKEN

Create a new application:

mkdir my-app
cd my-app
speechly create "My first app"

Copy the App ID, you'll need it shortly.

Transcribe live streaming audio

Streaming transcription works in real-time and is perfect for working with live streaming audio, for example when capturing audio from the device microphone. To demonstrate this, we'll be using Speechly Dashboard.

Open Preview

Open Speechly Dashboard
Open your Application
Go to the Preview tab

Start talking

Once there, press the microphone button, give the site access to your microphone, and start talking. Notice how the transcript appears in real-time!

preview

Analyze the response

Open the browser developer console to see the JSON response that's emitted for each speech segment:

{
  "id": 0,
  "contextId": "9af98c09-c974-4393-9368-0c64a6c3e583",
  "isFinal": true,
  "words": [
    {
      "value": "welcome",
      "index": 2
    },
    {
      "value": "to",
      "index": 3
    },
    {
      "value": "my",
      "index": 4
    },
    {
      "value": "first",
      "index": 5
    },
    {
      "value": "application",
      "index": 6
    }
  ],
  "entities": [],
  "intent": {
    "intent": "",
    "isFinal": true
  }
}

What’s next?

Now that you have received your first streaming transcript using Speechly, check out Speechly On-device. It's an excellent way of transcribing live streaming audio in real-time, accurately and cost-effectively right on the users’ device.

Transcribe pre-recorded audio

Batch transcription works asynchronously and is perfect for working with pre-recorded audio. To demonstrate this, we'll be using the transcribe command. If you used Speechly Dashboard in the previous steps, now is a good time to install and set up Speechly CLI.

Choose an audio file

Use an existing audio file, record your own or use our sample audio file. See Supported audio formats for more information.

Info

The transcribe command also supports transcribing multiple files. Create a JSON Lines file with each audio on their own line using the format: {"audio": "path/to/file.wav"}. Then simply pass the JSON Lines file as the input file!

Upload audio

Open your terminal, navigate to the location where your audio file is and run:

speechly transcribe path/to/file.wav -a YOUR-APP-ID

You can use the same App ID as in the previous example.

See results

Your transcript will appear in the terminal once it's ready.

Example output
hi i'm neil degrasse tyson astrophysicist in addition to probing the
secrets of the universe also a movie buff today i introduce you to a
film that everyone thought was lost forever until a print was recently
discovered in a hollywood vault future thirty eight forgotten treasure
from nineteen thirty eight it's one of the first color pictures preceding
gone with the wind and the wizard of oz by a year but what interest me
most is the science finally a movie that gets time travel right

What’s next?

Now that you have received your first batch transcript using Speechly, check out Speechly On-premise. It's an excellent way of transcribing large quantities of pre-recorded audio accurately and asynchronously in a customized installation.

Getting started with Speechly

Quick introduction​

Create a Speechly account​

Create a Speechly application​

Transcribe live streaming audio​

Open Preview​

Start talking​

Analyze the response​

What’s next?​

Transcribe pre-recorded audio​

Choose an audio file​

Upload audio​

See results​

What’s next?​

Quick introduction

Create a Speechly account

Create a Speechly application

Transcribe live streaming audio

Open Preview

Start talking

Analyze the response

What’s next?

Transcribe pre-recorded audio

Choose an audio file

Upload audio

See results

What’s next?