Miscellaneous - a GayatriValley Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

GayatriValley 's Collections

Miscellaneous

updated Dec 13, 2024

Build error

Featured

792

Unique3D

⚡

792

Create a 1M faces 3D colored model from an image!
Runtime error

53

Paligemma Doc

📚

53

Try PaliGemma on document understanding tasks
wangfuyun/PCM_Weights

Text-to-Image • Updated Oct 30, 2024 • 21 • 99
Running on Zero

457

Stable Audio Open Zero

🔥

457

Generate custom audio clips from text prompts
Paused

Featured

314

PaliGemma Demo

🤲

314

Annotate and describe images with text prompts
atcsecure/dolphin-2.9.2-qwen72b-8.0bpw-h8-exl2

Text Generation • Updated Jun 9, 2024 • 2 • 2
stabilityai/stable-video-diffusion-img2vid-xt

Image-to-Video • Updated Jul 10, 2024 • 132k • 3.23k
DAMO-NLP-SG/VideoLLaMA2-7B

Visual Question Answering • 8B • Updated Aug 13, 2024 • 901 • 42
SakanaAI/DiscoPOP-zephyr-7b-gemma

Text Generation • 9B • Updated Jun 13, 2024 • 16 • 36
madebyollin/taesd3

Updated Jun 14, 2024 • 730 • 38
hpcai-tech/OpenSora-VAE-v1.2

0.4B • Updated Jun 17, 2024 • 6.15k • 57
Running

Featured

84

NaRCan

💊

84

Edit your video style using a text prompt and control maps
MaziyarPanahi/calme-2.1-qwen2-72b-GGUF

Text Generation • 73B • Updated Aug 2, 2024 • 150 • 13
Build error

Featured

93

DiffIR2VR

👌

93

Video upscaler/restorer
CAMB-AI/MARS5-TTS

Text-to-Speech • Updated Jul 5, 2024 • 64 • 482
dphn/dolphin-vision-72b

Text Generation • 73B • Updated Jul 16, 2024 • 363 • 133
Running on Zero

Featured

72

Florence-2 for Videos

🎬

72

Annotate video with object boxes and captions
Running on Zero

132

FLUX.1-dev + Captioner

🐨

132

Generate images from prompts or images
Runtime error

Featured

367

Video Transcription Smart Summary

⚡

367

Generate summaries from YouTube videos or uploaded videos
qnguyen3/nanoLLaVA-1.5

Image-Text-to-Text • 1B • Updated Sep 21, 2024 • 64 • 112
Runtime error

Featured

124

nanoLLaVA-1.5

🚀

124

Chat about images by uploading them
zai-org/codegeex4-all-9b

Text Generation • 9B • Updated Jul 18, 2024 • 2.78k • 265
Sleeping

10

Langflow Crewai

💻

10

Build and run language models visually
Running on Zero

Featured

969

Tile Upscaler

🚀

969

Upscale and enhance images using tile ControlNet
Running

Featured

218

Whisper Timestamped

🕒

218

In-browser speech recognition w/ word-level timestamps
Running on Zero

Featured

2.06k

IDM VTON

👕

2.06k

High-fidelity Virtual Try-on
deepseek-ai/DeepSeek-V2-Chat-0628

Text Generation • 236B • Updated Jul 18, 2024 • 3.13k • 177
TheDrummer/Big-Tiger-Gemma-27B-v1-GGUF

27B • Updated Jul 14, 2024 • 1.53k • 73
fal/AuraFlow

Text-to-Image • Updated Jul 18, 2024 • 342 • • 654
xinsir/controlnet-union-sdxl-1.0

Text-to-Image • Updated Jul 30, 2024 • 150k • 1.68k
TheBloke/MythoMax-L2-13B-GPTQ

Text Generation • 13B • Updated Sep 27, 2023 • 579 • 218
Gryphe/MythoMax-L2-13b

Text Generation • Updated Apr 21, 2024 • 1.37k • • 372
Gryphe/Pantheon-RP-1.0-8b-Llama-3

Text Generation • 8B • Updated May 13, 2024 • 26 • • 51
Gryphe/Tiamat-8b-1.2-Llama-3-DPO

Text Generation • 8B • Updated May 3, 2024 • 3 • 6
BeaverLegacy/Smegmma-9B-v1

Text Generation • 10B • Updated Jul 13, 2024 • 5 • 50
mradermacher/Nymph_8B-i1-GGUF

8B • Updated Aug 2, 2024 • 25 • 2
Runtime error

29

MusiConGen

🪩

29
mlabonne/Meta-Llama-3.1-8B-Instruct-abliterated

Text Generation • 8B • Updated Sep 14, 2024 • 4.74k • • 195
FunAudioLLM/SenseVoiceSmall

Updated Jul 31, 2024 • 2.31k • 357
Running on Zero

MCP

24

Video-to-Audio Ldm

🎧

24

Video-to-Audio Generation with Hidden Alignment
CofeAI/Tele-FLM-1T

Text Generation • Updated Jan 10 • 219 • 82
maxin-cn/Cinemo

Image-to-Video • Updated Aug 14, 2024 • 2 • 32
Running on Zero

Featured

204

Cinemo

🎥

204

Multimodal Image-to-Video
Running

20

Mms Zeroshot

🌍

20

Transcribe audio in any language using text data
Sleeping

Featured

56

AccDiffusion

🏆

56

Generate images from text prompts
Runtime error

Featured

185

Artist

🎨

185

Aesthetically Controllable Text-Driven Stylization w/o Train
Runtime error

95

EchoMimic

🐨

95

Generate lifelike video animations from images and audio
HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • 8B • Updated Dec 2, 2024 • 151k • 302
parler-tts/parler-tts-mini-v1

Text-to-Speech • 0.9B • Updated Nov 25, 2024 • 17k • 152
parler-tts/parler-tts-large-v1

Text-to-Speech • 2B • Updated Nov 22, 2024 • 11.5k • 272
Qwen/Qwen2-Audio-7B

Audio-Text-to-Text • 8B • Updated Nov 20, 2024 • 24.4k • 159
black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Jun 27, 2025 • 779k • • 12.3k
Runtime error

214

CatVTON

🐈

214

Try on clothes virtually with images
wanglab/ecg-fm

Updated May 5, 2025 • 15
XLabs-AI/flux-lora-collection

Text-to-Image • Updated Aug 14, 2024 • 581
Runtime error

58

Vgg Heads

🖼

58
migtissera/Tess-3-Mistral-Nemo-12B

12B • Updated Sep 4, 2024 • 44 • 13
nisten/all-human-diseases

Viewer • Updated Aug 19, 2024 • 2.2k • 74 • 106
DAMO-NLP-SG/VideoLLaMA2-72B

Visual Question Answering • 75B • Updated Aug 14, 2024 • 10 • 10
answerdotai/answerai-colbert-small-v1

33.4M • Updated Nov 18, 2024 • 1.3M • 157
mlabonne/Hermes-3-Llama-3.1-8B-lorablated-GGUF

8B • Updated Aug 16, 2024 • 890 • 31
labotollama3/lobotollama-5.5b

Text Generation • 6B • Updated Apr 22, 2024 • 1 • 4
Mozilla/whisperfile

Updated Oct 2, 2024 • 1.41k • 255
Runtime error

45

FAI Fuzer Medium v0.3

🎨

45

Generate enhanced images by blending foreground with custom backgrounds
ZhengPeng7/BiRefNet

Image Segmentation • 0.2B • Updated 8 days ago • 958k • 524
Runtime error

10k

Kolors Virtual Try-On

👕

10k

Try on clothes on a person image
fal/AuraFace-v1

Updated Aug 26, 2024 • 144
dphn/dolphin-2.9.4-gemma2-2b

3B • Updated Aug 27, 2024 • 67 • 38
pzc163/MiniCPMv2_6-prompt-generator

Updated Aug 24, 2024 • 39 • 49
Running on Zero

1.03k

CogVideoX-5B

🎥

1.03k

Text-to-Video
yifeihu/TB-OCR-preview-0.1

Image-Text-to-Text • 4B • Updated Sep 6, 2024 • 41 • 129
InstantX/FLUX.1-dev-Controlnet-Union

Updated Aug 26, 2024 • 9.06k • 471
Running on Zero

Featured

86

Qwen2-VL-2B

🔥

86

Generate text from images or videos
Qwen/Qwen2-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Jan 12, 2025 • 2.02M • 487
Running

Featured

59

Groq Gradio Voice Assistant

👁

59

Transcribe speech and generate AI response
IntelLabs/LlavaOLMoBitnet1B

Updated Aug 30, 2024 • 2 • 29
facebook/sapiens

Updated Sep 20, 2024 • 31 • 243
Running on Zero

28

Tb Ocr

📈

28

Convert image text to markdown format
YuWangX/memoryllm-8b-chat

10B • Updated Nov 17, 2024 • 217 • 20
Running

211

HivisionIDPhotos

🌖

211

Create professional ID photos with automatic background removal
virtuals-protocol/mario-videogamegen

Updated Sep 6, 2024 • 13
Running on Zero

266

Qwen2-VL-7B

🔥

266

Answer questions about any uploaded image
Running on Zero

Featured

281

Latent Navigation

🪐

281

Travel through the model latent space
mattshumer/Reflection-Llama-3.1-70B

Text Generation • 71B • Updated Sep 24, 2024 • 222 • 1.71k
Configuration error

Featured

116

ViewCrafter

🐨

116

Create a video from an image with camera motion
Runtime error

18

Text Image Analyzer

💻

18

Analyse any image with Llama3.2
vidore/colqwen2-v0.1

Visual Document Retrieval • Updated Mar 21, 2025 • 103k • 193
Runtime error

12

Llama 3.2 Vision Free

🐢

12
facebook/Self-taught-evaluator-llama3.1-70B

Updated Sep 30, 2024 • 42
openai/clip-vit-large-patch14-336

Zero-Shot Image Classification • Updated Oct 4, 2022 • 3.88M • 286
jasperai/Flux.1-dev-Controlnet-Upscaler

Image-to-Image • Updated Mar 22, 2025 • 3.72k • 857
Running on Zero

Featured

326

Diffusers Image Fill

🏃

326

Fill and edit images using masks
Sleeping

36

PDF to Page Images Dataset

📂

36

Convert PDFs to individual page images
Running on Zero

Featured

72

ColPali fine-tuning Query Generator

🔍

72

Generate document search queries from a page image
Runtime error

10

Vision Pipeline

🌍

10

Answer questions about uploaded images and documents
nvidia/NVLM-D-72B

Image-Text-to-Text • 79B • Updated Jan 14, 2025 • 107k • 775
Running on Zero

1k

Whisper Turbo

🤯

1k

Transcribe audio or YouTube videos into text
davanstrien/ufo-ColPali

Viewer • Updated Sep 23, 2024 • 2.24k • 102 • 25
jadechoghari/openmusic

Text-to-Audio • Updated Oct 10, 2024 • 14 • 72
Build error

214

OpenMusic

🎶

214

Generate music from text descriptions
Running

458

PDF2Audio

📚

458

Generate audio‑ready scripts from your documents
Running on Zero

239

Ultrapixel-demo

😻

239

Ultra-high resolution image synthesis
PleIAs/OCRonos-Vintage

Text Generation • 0.1B • Updated Aug 8, 2024 • 217 • 81
Running on Zero

275

EzAudio

🟣

275

Generate and edit realistic audio from text prompts
stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • 0.7B • Updated Feb 4, 2025 • 129k • 1.53k
Running on CPU Upgrade

988

Open VLM Leaderboard

🌎

988

VLMEvalKit Evaluation Results Collection
Build error

64

ArxivCopilot

🏢

64

Generate personalized research profiles and chat with Arxiv Copilot
gpt-omni/mini-omni

Text-to-Speech • Updated Sep 4, 2024 • 437
mistral-community/pixtral-12b-240910

Image-Text-to-Text • Updated Oct 1, 2024 • 8 • 382
ICTNLP/Llama-3.1-8B-Omni

9B • Updated Nov 14, 2024 • 160 • 418
fishaudio/fish-speech-1.4

Text-to-Speech • Updated Nov 5, 2024 • 541 • 454
bartowski/Reflection-Llama-3.1-70B-GGUF

Text Generation • 71B • Updated Sep 7, 2024 • 494 • 53
lelapa/InkubaLM-0.4B

Text Generation • Updated Sep 5, 2024 • 124 • 57
Running

144

Qwen 2.5 Code Interpreter

🐍

144

Run code snippets and get instant results
Runtime error

311

Virtual Try On

👕

311

High-fidelity Virtual Try-on
Runtime error

36

Ferret Demo

📚

36

Describe image contents with prompts
Running on L4

62

ColPali 🤝 Vespa - Visual Retrieval

👀

62

Visual Retrieval with ColPali and Vespa
oxyapi/oxy-1-small

Text Generation • 15B • Updated Apr 30, 2025 • 1.2k • • 84
QuantFactory/MN-Chunky-Lotus-12B-GGUF

12B • Updated Dec 4, 2024 • 49 • 4
Running

25

ScholarCopilot

📊

25

Using RAG LLM to assist your academic writing
Running on Zero

609

Leffa

👗

609

Generate new person images with swapped clothes or poses
Lightricks/LTX-Video

Image-to-Video • Updated Jul 16, 2025 • 235k • • 2.11k

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs