Deepgram JavaScript SDK

Official JavaScript SDK for Deepgram. Power your apps with world-class speech and Language AI models.

Documentation
Migrating from earlier versions
- V2 to V3
- V3.* to V3.4
- V3.* to V4
Installation
- UMD
- ESM
Initialization
- Getting an API Key
Scoped Configuration
- 1. Global Defaults
- 2. Namespace-specific Configurations
- 3. Transport Options
- 4. Examples
Pre-Recorded (Synchronous)
- Remote Files
- Local Files
- Browser
Pre-Recorded (Asynchronous / Callbacks)
- Remote Files
- Local Files
- Browser
Streaming Audio
Transcribing to captions
Voice Agent
Text to Speech Rest
Text to Speech Streaming
Text Intelligence
Authentication
- Get Token Details
Projects
- Get Projects
- Get Project
- Update Project
- Delete Project
Keys
- List Keys
- Get Key
- Create Key
- Delete Key
Members
- Get Members
- Remove Member
Scopes
- Get Member Scopes
- Update Scope
Invitations
- List Invites
- Send Invite
- Delete Invite
- Leave Project
Usage
- Get All Requests
- Get Request
- Summarize Usage
- Get Fields
Billing
- Get All Balances
- Get Balance
Models
- Get All Project Models
- Get Model
On-Prem APIs
- List On-Prem credentials
- Get On-Prem credentials
- Create On-Prem credentials
- Delete On-Prem credentials
Backwards Compatibility
Development and Contributing
- Debugging and making changes locally
Getting Help

Documentation

You can learn more about the Deepgram API at developers.deepgram.com.

Migrating from earlier versions

V2 to V3

We have published a migration guide on our docs, showing how to move from v2 to v3.

V3.* to V3.4

We recommend using only documented interfaces, as we strictly follow semantic versioning (semver) and breaking changes may occur for undocumented interfaces. To ensure compatibility, consider pinning your versions if you need to use undocumented interfaces.

V3.* to V4

The Voice Agent interfaces have been updated to use the new Voice Agent V1 API. Please refer to our Documentation on Migration to new V1 Agent API.

Installation

You can install this SDK directly from npm.

npm install @deepgram/sdk
# - or -
# yarn add @deepgram/sdk

UMD

You can now use plain <script>s to import deepgram from CDNs, like:

<script src="https://cdn.jsdelivr.net/npm/@deepgram/sdk"></script>

or even:

<script src="https://unpkg.com/@deepgram/sdk"></script>

Then you can use it from a global deepgram variable:

<script>
  const { createClient } = deepgram;
  const _deepgram = createClient("deepgram-api-key");

  console.log("Deepgram Instance: ", _deepgram);
  // ...
</script>

ESM

You can now use type="module" <script>s to import deepgram from CDNs, like:

<script type="module">
  import { createClient } from "https://cdn.jsdelivr.net/npm/@deepgram/sdk/+esm";
  const deepgram = createClient("deepgram-api-key");

  console.log("Deepgram Instance: ", deepgram);
  // ...
</script>

Initialization

All of the examples below will require createClient.

import { createClient } from "@deepgram/sdk";
// - or -
// const { createClient } = require("@deepgram/sdk");

const deepgram = createClient(DEEPGRAM_API_KEY);

Getting an API Key

🔑 To access the Deepgram API you will need a free Deepgram API Key.

Scoped Configuration

The SDK supports scoped configuration. You'll be able to configure various aspects of each namespace of the SDK from the initialization. Below outlines a flexible and customizable configuration system for the Deepgram SDK. Here's how the namespace configuration works:

1. Global Defaults

The global namespace serves as the foundational configuration applicable across all other namespaces unless overridden.
Includes general settings like URL and headers applicable for all API calls.
If no specific configurations are provided for other namespaces, the global defaults are used.

2. Namespace-specific Configurations

Each namespace (listen, manage, onprem, read, speak) can have its specific configurations which override the global settings within their respective scopes.
Allows for detailed control over different parts of the application interacting with various Deepgram API endpoints.

3. Transport Options

Configurations for both fetch and websocket can be specified under each namespace, allowing different transport mechanisms for different operations.
For example, the fetch configuration can have its own URL and proxy settings distinct from the websocket.
The generic interfaces define a structure for transport options which include a client (like a fetch or WebSocket instance) and associated options (like headers, URL, proxy settings).

This configuration system enables robust customization where defaults provide a foundation, but every aspect of the client's interaction with the API can be finely controlled and tailored to specific needs through namespace-specific settings. This enhances the maintainability and scalability of the application by localizing configurations to their relevant contexts.

4. Examples

Change the API url used for all SDK methods

Useful for using different API environments (for e.g. beta).

import { createClient } from "@deepgram/sdk";
// - or -
// const { createClient } = require("@deepgram/sdk");

const deepgram = createClient(DEEPGRAM_API_KEY, {
  global: { fetch: { options: { url: "https://api.beta.deepgram.com" } } },
});

Change the API url used for the Voice Agent websocket

Useful for using a voice agent proxy (for e.g. 3rd party provider auth).

import { createClient } from "@deepgram/sdk";
// - or -
// const { createClient } = require("@deepgram/sdk");

const deepgram = createClient(DEEPGRAM_API_KEY, {
  global: { websocket: { options: { url: "ws://localhost:8080" } } },
});

Change the API url used for transcription only

Useful for on-prem installations. Only affects requests to /listen endpoints.

import { createClient } from "@deepgram/sdk";
// - or -
// const { createClient } = require("@deepgram/sdk");

const deepgram = createClient(DEEPGRAM_API_KEY, {
  listen: { fetch: { options: { url: "http://localhost:8080" } } },
});

Override fetch transmitter

Useful for providing a custom http client.

import { createClient } from "@deepgram/sdk";
// - or -
// const { createClient } = require("@deepgram/sdk");

const yourFetch = async () => {
  return Response("...etc");
};

const deepgram = createClient(DEEPGRAM_API_KEY, {
  global: { fetch: { client: yourFetch } },
});

Proxy requests in the browser

This SDK now works in the browser. If you'd like to make REST-based requests (pre-recorded transcription, on-premise, and management requests), then you'll need to use a proxy as we do not support custom CORS origins on our API. To set up your proxy, you configure the SDK like so:

import { createClient } from "@deepgram/sdk";

const deepgram = createClient("proxy", {
  global: { fetch: { options: { proxy: { url: "http://localhost:8080" } } } },
});

Important: You must pass "proxy" as your API key, and use the proxy to set the Authorization header to your Deepgram API key.

Your proxy service should replace the Authorization header with Authorization: token <DEEPGRAM_API_KEY> and return results verbatim to the SDK.

Check out our example Node-based proxy here: Deepgram Node Proxy.

Set custom headers for fetch

Useful for many things.

import { createClient } from "@deepgram/sdk";

const deepgram = createClient("proxy", {
  global: { fetch: { options: { headers: { "x-custom-header": "foo" } } } },
});

Pre-Recorded (Synchronous)

Remote Files

Transcribe audio from a URL.

const { result, error } = await deepgram.listen.prerecorded.transcribeUrl(
  {
    url: "https://dpgr.am/spacewalk.wav",
  },
  {
    model: "nova",
  }
);

@deepgram/sdk

Deepgram JavaScript SDK

Documentation

Migrating from earlier versions

V2 to V3

V3.* to V3.4

V3.* to V4

Installation

UMD

ESM

Initialization

Getting an API Key

Scoped Configuration

1. Global Defaults

2. Namespace-specific Configurations

3. Transport Options

4. Examples

Change the API url used for all SDK methods

Change the API url used for the Voice Agent websocket

Change the API url used for transcription only

Override fetch transmitter

Proxy requests in the browser

Set custom headers for fetch

Pre-Recorded (Synchronous)

Remote Files

Local Files

Browser

Pre-Recorded (Asynchronous / Callbacks)

Remote Files

Local Files

Browser

Streaming Audio

Browser

Transcribing to captions

Voice Agent

Text to Speech Rest

Text to Speech Streaming

Text Intelligence

Authentication

Get Token Details

Grant Token

Projects

Get Projects

Get Project

Update Project

Delete Project

Keys

List Keys

Get Key

Create Key

Delete Key

Members

Get Members

Remove Member

Scopes

Get Member Scopes

Update Scope

Invitations

List Invites

Send Invite

Delete Invite

Leave Project

Usage

Get All Requests

Get Request

Summarize Usage

Get Fields

Summarize Usage

Billing

Get All Balances

Get Balance

Models

Get All Project Models

Get Model

On-Prem APIs

List On-Prem credentials

Get On-Prem credentials

Create On-Prem credentials

Delete On-Prem credentials

Backwards Compatibility

Weekly Downloads