Search results
75 packages found
Sort by: Default
- Default
- Most downloaded this week
- Most downloaded this month
- Most dependents
- Recently published
Node implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid string to store as WebVTT or SRT caption files.
- audio
- javascript
- youtube
- typescript
- sdk
- ffmpeg
- speech
- subtitles
- srt
- webvtt
- speech-to-text
- transcription
- stt
- asr
- View more
Polyfill Web Speech API with Cognitive Services Speech-to-Text service
- cognitive services
- dictation
- microphone
- polyfill
- react
- speak
- speech recognition
- speech synthesis
- speech to text
- speechsynthesis
- stt
- text to speech
- tts
- unified speech
- View more
Add real-time speech to text functionality into your website with no effort
Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.
Kaldi in-browser speech recognition based on a WASM build of the Vosk library
Mr.🆖 AI - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system.
Speech-to-text and text-to-speech using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- embedded systems
- open source
- zipformer
- asr
- tts
- stt
- c++
- onnxruntime
- onnx
- View more
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Voice assistant (Recognize and recorder)
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Node.js bindings for OpenAI's Whisper. Optimized for CPU.
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Promise based implementation of Yandex Speech Kit API
实时获取斗鱼弹幕
MPEG2 transport stream parser for Node.js with support for television broadcast PSIP tables
Hanzo Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.