High level wrappers for AI conversation tools: text to speech, speech to text, LLMs...
- There be dragons here! This is a work in progress and not ready for production use.
- Deepgram (browser connection via websockets, use short lived API keys)
- OpenAI Speech API (server side to protect API keys)
- Play audio returned from server (in browser)
- OpenAI GPT-4 (server side + client side)
- web-llm (client side, WASM)