VLMs - a blanchefort Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

blanchefort 's Collections

Judge

Datasets for Embodied

Ru text encoders

VLMs

VLMs

updated Feb 14, 2025

Qwen/Qwen2-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Feb 6, 2025 • 1.14M • • 1.25k
NVEagle/Eagle-X5-13B-Chat

Image-Text-to-Text • 15B • Updated Sep 16, 2024 • 33 • 28
internlm/internlm-xcomposer2d5-7b

Visual Question Answering • Updated Jul 22, 2024 • 541 • 209
AIRI-Institute/OmniFusion

Updated Apr 10, 2024 • 59
OpenGVLab/InternVideo2_chat_8B_HD

Video-Text-to-Text • 8B • Updated Dec 18, 2024 • 171 • 18
OpenGVLab/InternVideo2-Chat-8B

Video-Text-to-Text • 8B • Updated Oct 10, 2024 • 541 • 23
zai-org/cogvlm2-video-llama3-chat

Text Generation • 13B • Updated Jul 24, 2024 • 173 • 52
nyu-visionx/cambrian-34b

Text Generation • 35B • Updated Jun 28, 2024 • 10 • 27
zai-org/cogvlm-base-490-hf

Text Generation • 18B • Updated Nov 20, 2023 • 65 • 7
zai-org/cogvlm-chat-hf

Text Generation • 18B • Updated Dec 19, 2023 • 1.05k • 198
zai-org/cogvlm-grounding-generalist-hf

Text Generation • 18B • Updated Dec 11, 2023 • 135 • 16
Qwen/Qwen-VL

Text Generation • Updated Jan 25, 2024 • 18.3k • 273
liuhaotian/llava-v1.5-7b

Image-Text-to-Text • Updated May 8, 2024 • 124k • 526
LanguageBind/MoE-LLaVA-Phi2-2.7B-4e-384

Text Generation • 6B • Updated Feb 1, 2024 • 43 • 32
LanguageBind/Video-LLaVA-7B-hf

Image-to-Text • 7B • Updated May 16, 2024 • 7.63k • 46
openvla/openvla-7b-prismatic

Image-Text-to-Text • Updated Jul 9, 2024 • 48 • 6
openvla/openvla-7b-finetuned-libero-object

Image-Text-to-Text • 8B • Updated Oct 9, 2024 • 896 • 1
openvla/openvla-7b-finetuned-libero-10

Image-Text-to-Text • 8B • Updated Oct 9, 2024 • 1.16k • 4
IntelLabs/LlavaOLMoBitnet1B

Updated Aug 30, 2024 • 39 • 29
mistral-community/pixtral-12b-240910

Image-Text-to-Text • Updated Oct 1, 2024 • 22 • 382
LanguageBind/MoE-LLaVA-StableLM-1.6B-4e

Text Generation • 3B • Updated Feb 1, 2024 • 75 • 8
llava-hf/LLaVA-NeXT-Video-7B-hf

Video-Text-to-Text • 7B • Updated Nov 11, 2025 • 54.7k • 121
Qwen/Qwen-VL-Chat

Text Generation • Updated Jan 25, 2024 • 34.1k • 380
LanguageBind/Video-LLaVA-7B

Text Generation • 7B • Updated Apr 9, 2024 • 1.67k • 88
LanguageBind/LanguageBind_Image

Zero-Shot Image Classification • Updated Feb 1, 2024 • 11.2k • 11
LanguageBind/LanguageBind_Video

Zero-Shot Image Classification • Updated Feb 1, 2024 • 337 • 3
llava-hf/llava-1.5-13b-hf

Image-Text-to-Text • 13B • Updated Jan 27, 2025 • 12.1k • 33
llava-hf/llava-1.5-7b-hf

Image-Text-to-Text • 7B • Updated Jun 6, 2025 • 897k • 327
FreedomIntelligence/LongLLaVA-53B-A13B

Image-Text-to-Text • 52B • Updated Nov 28, 2024 • 63 • 20
meta-llama/Llama-3.2-11B-Vision

Image-Text-to-Text • 11B • Updated Sep 27, 2024 • 9.15k • 578
BAAI/Emu3-VisionTokenizer

Feature Extraction • 0.3B • Updated Oct 8, 2024 • 2.61k • 61
openbmb/MiniCPM-V-2_6

Image-Text-to-Text • 8B • Updated Jun 13, 2025 • 73.2k • 1.02k
openbmb/MiniCPM-V

Visual Question Answering • 3B • Updated Jan 15, 2025 • 507 • 192
openbmb/MiniCPM-V-2

Visual Question Answering • 3B • Updated Jan 15, 2025 • 61k • 483
openbmb/MiniCPM-Llama3-V-2_5

Image-Text-to-Text • 9B • Updated Jan 15, 2025 • 45.4k • 1.41k
nvidia/NVLM-D-72B

Image-Text-to-Text • 79B • Updated Jan 14, 2025 • 108k • 775
vikhyatk/moondream2

Image-Text-to-Text • 2B • Updated Sep 23, 2025 • 2.9M • 1.36k
allenai/Molmo-72B-0924

Image-Text-to-Text • 73B • Updated Oct 9, 2025 • 526 • 295
allenai/MolmoE-1B-0924

Image-Text-to-Text • Updated Apr 24, 2025 • 1.39k • 156
allenai/Molmo-7B-D-0924

Image-Text-to-Text • 8B • Updated 21 days ago • 15.3k • 559
allenai/Molmo-7B-O-0924

Image-Text-to-Text • 8B • Updated Oct 9, 2025 • 848 • 162
deepseek-ai/Janus-1.3B

Any-to-Any • 2B • Updated Jan 27, 2025 • 3.54k • 592
neulab/Pangea-7B

8B • Updated Oct 24, 2024 • 12.3k • 131
neulab/Pangea-7B-hf

8B • Updated Oct 28, 2025 • 226 • 13
BAAI/Aquila-VL-2B-llava-qwen

Visual Question Answering • 2B • Updated Nov 25, 2024 • 297 • 61
mistralai/Pixtral-Large-Instruct-2411

Updated Jul 28, 2025 • 59 • 430
google/paligemma2-10b-pt-224

Image-Text-to-Text • 10B • Updated Dec 5, 2024 • 1.31k • 8
google/paligemma2-3b-pt-224

Image-Text-to-Text • 3B • Updated Dec 5, 2024 • 64.5k • 161
vidore/colqwen2-v1.0

Visual Document Retrieval • Updated Jun 5, 2025 • 64.9k • 116
deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1, 2025 • 53.1k • 3.55k
deepseek-ai/Janus-Pro-1B

Any-to-Any • Updated Feb 1, 2025 • 6.78k • 466
nvidia/Eagle2-9B

Image-Text-to-Text • 9B • Updated Jan 28, 2025 • 76 • 62
openbmb/MiniCPM-o-2_6

Any-to-Any • 9B • Updated Oct 5, 2025 • 79.3k • 1.28k
DAMO-NLP-SG/VideoLLaMA3-7B

Video-Text-to-Text • 8B • Updated Sep 2, 2025 • 84k • 71
DAMO-NLP-SG/VideoLLaMA3-2B

Video-Text-to-Text • 2B • Updated Sep 3, 2025 • 2.64k • 16
AIDC-AI/Ovis2-8B

Image-Text-to-Text • 9B • Updated Aug 15, 2025 • 949 • 75

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs