Zero Shot voice cloning with llasa 3b (Unofficial Demo)
Transcribe audio files or YouTube videos into text
Generate videos from text prompts and optional images
Generate MIDI music from prompts