pinned
Runtime error
Agents
Bark with Voice Cloning
π
Proxy multimodal chat requests with text, images, and audio
Send text, images, and audio to get AI chat responses
Transcribe audio into text using different models
Transcribe audio files into text
Transcribe audio to text using Whisper and Distil-Whisper