Search results
105 packages found
🎙️ WhisperMix is a versatile module for transcribing audio using OpenAI’s Whisper or Groq’s Whisper v3 model.
- whisper
- openai
- groq
- transcription
- speech-to-text
- audio
- voice
- ai
- machine-learning
- nlp
- natural-language-processing
- audio-processing
- voice-recognition
- speech-recognition
- View more
This package summarizes transcriptions of YouTube videos, with support for multiple languages.
Make your app understand language. Summarize conversations, categorize articles, and more.
- nlp
- language
- natural language processing
- oneai
- one ai
- ai
- one
- natural language understanding
- natural language
- text
- text processing
- text classification
- text analysis
- language ai
- View more
Speech-to-text and text-to-speech using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- embedded systems
- open source
- zipformer
- asr
- tts
- stt
- c++
- onnxruntime
- onnx
- View more
A simple tool to merge multiple WebVTT (.vtt) files into a single file.
A package for converting DNA sequences into RNA
Video Summary SDK is a powerful Node.js module for advanced video processing, offering features like speech-to-text transcription, content summarization, automatic chapter extraction, and comprehensive video summarization. It's perfect for developers look
- video processing
- speech-to-text
- video summarization
- audio transcription
- whisper
- audio summarization
- transcription
- chapter extraction
- video analysis
- Node.js
Voicely - A CLI tool to transcribe audio files and summarize the transcriptions using OpenAI's APIs.
Common functions for Sentira AI
Mirador 3 plugin which renders a separate window, with OCR text
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
Speech-to-text, text-to-speech, and speaker diarization using Next-gen Kaldi without internet connection
- speech to text
- text to speech
- transcription
- real-time speech recognition
- without internet connection
- locally
- local
- embedded systems
- open source
- diarization
- speaker diarization
- speaker recognition
- speaker
- speaker segmentation
- View more
This is a node.js module used to transcribe wav files using Olaris v2 realtime transcription service
a JavaScript library that provides a robust and flexible solution for real-time audio transcription. It is designed to transcribe audio streams and can be easily integrated into web applications.
👂 RxJS operator for realtime speech-to-text (STT/S2T) using Deepgram speeh-to-text
- speech-to-text
- s2t
- stt
- transcription
- transcribe
- voice control
- deepgram
- speech recognition
- rxjs
- reactive
- observables
- stream
- streaming
- realtime
👂 RxJS operator for realtime speech-to-text (STT/S2T) using AWS Transcribe
- speech-to-text
- s2t
- stt
- transcription
- transcribe
- voice control
- aws transcribe
- speech recognition
- rxjs
- reactive
- observables
- stream
- streaming
- realtime