openai/whisper-large-v3 Automatic Speech Recognition ⢠2B ⢠Updated Aug 12, 2024 ⢠6.59M ⢠⢠5.32k
google/pix2struct-widget-captioning-large Visual Question Answering ⢠1B ⢠Updated Apr 10, 2024 ⢠47 ⢠20
Salesforce/blip-image-captioning-large Image-to-Text ⢠0.5B ⢠Updated Feb 3, 2025 ⢠1.33M ⢠1.44k
mistralai/Mistral-7B-Instruct-v0.1 Text Generation ⢠7B ⢠Updated Jul 24, 2025 ⢠283k ⢠1.82k
Runtime error Featured 272 Edit Video By Editing Text ā 272 Audio-based video editing using AI-generated transcription
Runtime error Featured 307 AudioLDM2 Text2Audio Text2Music Generation š 307 Generate audio and waveform video from text