Search results
99 packages found
The AssemblyAI JavaScript SDK provides an easy-to-use interface for interacting with the AssemblyAI API, which supports async and real-time transcription, as well as the latest LeMUR models.
Picovoice Leopard Node.js binding
React component and hook to initiate a SpeechRecognition session
Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
- audio
- javascript
- youtube
- typescript
- sdk
- ffmpeg
- speech
- subtitles
- srt
- webvtt
- speech-to-text
- transcription
- stt
- asr
- View more
Picovoice Cheetah Node.js binding
- ai
- asr
- automatic speech recognition
- nlu
- offline
- private
- speech recognition
- speech-to-text
- voice assistant
- voice commands
- voice
Node.js bindings for OpenAI's Whisper. Runs local on CPU.
- OpenAI
- Whisper
- CPP
- C++
- Bindings
- Transcribe
- Transcriber
- Transcript
- Transcription
- Audio
- Speech
- Speech-to-Text
- STT
- TTS
- View more
A library for using Web Speech API with Angular
- angular
- ng
- speech-recognition
- speech-to-text
- speech
- speech-synthesis
- speech-api
- speechrecognition
- text-to-speech
Local audio transcription on CPU. Node.js bindings for OpenAI's Whisper.
- OpenAI
- Whisper
- CPP
- C++
- Bindings
- Transcribe
- Transcriber
- Transcript
- Transcription
- Audio
- Speech
- Speech-to-Text
- STT
- TTS
- View more
A Node.js library for audio processing and transcription using the Whisper tool. It supports converting audio files to text using various pre-trained models
- audio
- processing
- transcription
- speech-to-text
- Whisper
- sst
- voice recognition
- voice-to-text
- ASR
- automatic speech recognition
- speech processing
- natural language processing
- NLP
- deep learning
- View more
Node bindings for OpenAI's Whisper. Optimized for CPU.
- OpenAI
- Whisper
- CPP
- C++
- Bindings
- Transcript
- Transcriber
- Audio
- Speech
- Speech-to-Text
- Timestamps
- nodejs whisper
- whisper nodejs
- generate timestamps
- View more
Cheetah Speech-to-Text engine for web browsers (via WebAssembly)
Leopard Speech-to-Text engine for web browsers (via WebAssembly)
An easy-to-use speech toolset. Includes tools for synthesis, recognition, alignment, speech translation, language detection, source separation and more.
- speech
- text-to-speech
- speech synthesis
- speech-to-text
- speech recognition
- speech alignment
- forced alignment
- speech translation
- language identification
- language detection
- source separation
React hook for Cheetah Web SDK
A Node.js client library for the Aristech Speech-to-Text API
React hook for Leopard Web SDK
A package for handling voice commands and speech recognition.
Node.js bindings for OpenAI's Whisper. Optimized for CPU.
🎙️ WhisperMix is a versatile module for transcribing audio using OpenAI’s Whisper or Groq’s Whisper v3 model.
- whisper
- openai
- groq
- transcription
- speech-to-text
- audio
- voice
- ai
- machine-learning
- nlp
- natural-language-processing
- audio-processing
- voice-recognition
- speech-recognition
- View more
This package summarizes transcriptions of YouTube videos, with support for multiple languages.