Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
blanchefort
's Collections
Video encoders
Judge
Datasets for Embodied
Ru text encoders
Text2Image
VLMs
VLMs
updated
Feb 14, 2025
Upvote
-
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Feb 6, 2025
•
1.14M
•
•
1.25k
NVEagle/Eagle-X5-13B-Chat
Image-Text-to-Text
•
15B
•
Updated
Sep 16, 2024
•
33
•
28
internlm/internlm-xcomposer2d5-7b
Visual Question Answering
•
Updated
Jul 22, 2024
•
541
•
209
AIRI-Institute/OmniFusion
Updated
Apr 10, 2024
•
59
OpenGVLab/InternVideo2_chat_8B_HD
Video-Text-to-Text
•
8B
•
Updated
Dec 18, 2024
•
171
•
18
OpenGVLab/InternVideo2-Chat-8B
Video-Text-to-Text
•
8B
•
Updated
Oct 10, 2024
•
541
•
23
zai-org/cogvlm2-video-llama3-chat
Text Generation
•
13B
•
Updated
Jul 24, 2024
•
173
•
52
nyu-visionx/cambrian-34b
Text Generation
•
35B
•
Updated
Jun 28, 2024
•
10
•
27
zai-org/cogvlm-base-490-hf
Text Generation
•
18B
•
Updated
Nov 20, 2023
•
65
•
7
zai-org/cogvlm-chat-hf
Text Generation
•
18B
•
Updated
Dec 19, 2023
•
1.05k
•
198
zai-org/cogvlm-grounding-generalist-hf
Text Generation
•
18B
•
Updated
Dec 11, 2023
•
135
•
16
Qwen/Qwen-VL
Text Generation
•
Updated
Jan 25, 2024
•
18.3k
•
273
liuhaotian/llava-v1.5-7b
Image-Text-to-Text
•
Updated
May 8, 2024
•
124k
•
526
LanguageBind/MoE-LLaVA-Phi2-2.7B-4e-384
Text Generation
•
6B
•
Updated
Feb 1, 2024
•
43
•
32
LanguageBind/Video-LLaVA-7B-hf
Image-to-Text
•
7B
•
Updated
May 16, 2024
•
7.63k
•
46
openvla/openvla-7b-prismatic
Image-Text-to-Text
•
Updated
Jul 9, 2024
•
48
•
6
openvla/openvla-7b-finetuned-libero-object
Image-Text-to-Text
•
8B
•
Updated
Oct 9, 2024
•
896
•
1
openvla/openvla-7b-finetuned-libero-10
Image-Text-to-Text
•
8B
•
Updated
Oct 9, 2024
•
1.16k
•
4
IntelLabs/LlavaOLMoBitnet1B
Updated
Aug 30, 2024
•
39
•
29
mistral-community/pixtral-12b-240910
Image-Text-to-Text
•
Updated
Oct 1, 2024
•
22
•
382
LanguageBind/MoE-LLaVA-StableLM-1.6B-4e
Text Generation
•
3B
•
Updated
Feb 1, 2024
•
75
•
8
llava-hf/LLaVA-NeXT-Video-7B-hf
Video-Text-to-Text
•
7B
•
Updated
Nov 11, 2025
•
54.7k
•
121
Qwen/Qwen-VL-Chat
Text Generation
•
Updated
Jan 25, 2024
•
34.1k
•
380
LanguageBind/Video-LLaVA-7B
Text Generation
•
7B
•
Updated
Apr 9, 2024
•
1.67k
•
88
LanguageBind/LanguageBind_Image
Zero-Shot Image Classification
•
Updated
Feb 1, 2024
•
11.2k
•
11
LanguageBind/LanguageBind_Video
Zero-Shot Image Classification
•
Updated
Feb 1, 2024
•
337
•
3
llava-hf/llava-1.5-13b-hf
Image-Text-to-Text
•
13B
•
Updated
Jan 27, 2025
•
12.1k
•
33
llava-hf/llava-1.5-7b-hf
Image-Text-to-Text
•
7B
•
Updated
Jun 6, 2025
•
897k
•
327
FreedomIntelligence/LongLLaVA-53B-A13B
Image-Text-to-Text
•
52B
•
Updated
Nov 28, 2024
•
63
•
20
meta-llama/Llama-3.2-11B-Vision
Image-Text-to-Text
•
11B
•
Updated
Sep 27, 2024
•
9.15k
•
578
BAAI/Emu3-VisionTokenizer
Feature Extraction
•
0.3B
•
Updated
Oct 8, 2024
•
2.61k
•
61
openbmb/MiniCPM-V-2_6
Image-Text-to-Text
•
8B
•
Updated
Jun 13, 2025
•
73.2k
•
1.02k
openbmb/MiniCPM-V
Visual Question Answering
•
3B
•
Updated
Jan 15, 2025
•
507
•
192
openbmb/MiniCPM-V-2
Visual Question Answering
•
3B
•
Updated
Jan 15, 2025
•
61k
•
483
openbmb/MiniCPM-Llama3-V-2_5
Image-Text-to-Text
•
9B
•
Updated
Jan 15, 2025
•
45.4k
•
1.41k
nvidia/NVLM-D-72B
Image-Text-to-Text
•
79B
•
Updated
Jan 14, 2025
•
108k
•
775
vikhyatk/moondream2
Image-Text-to-Text
•
2B
•
Updated
Sep 23, 2025
•
2.9M
•
1.36k
allenai/Molmo-72B-0924
Image-Text-to-Text
•
73B
•
Updated
Oct 9, 2025
•
526
•
295
allenai/MolmoE-1B-0924
Image-Text-to-Text
•
Updated
Apr 24, 2025
•
1.39k
•
156
allenai/Molmo-7B-D-0924
Image-Text-to-Text
•
8B
•
Updated
21 days ago
•
15.3k
•
559
allenai/Molmo-7B-O-0924
Image-Text-to-Text
•
8B
•
Updated
Oct 9, 2025
•
848
•
162
deepseek-ai/Janus-1.3B
Any-to-Any
•
2B
•
Updated
Jan 27, 2025
•
3.54k
•
592
neulab/Pangea-7B
8B
•
Updated
Oct 24, 2024
•
12.3k
•
131
neulab/Pangea-7B-hf
8B
•
Updated
Oct 28, 2025
•
226
•
13
BAAI/Aquila-VL-2B-llava-qwen
Visual Question Answering
•
2B
•
Updated
Nov 25, 2024
•
297
•
61
mistralai/Pixtral-Large-Instruct-2411
Updated
Jul 28, 2025
•
59
•
430
google/paligemma2-10b-pt-224
Image-Text-to-Text
•
10B
•
Updated
Dec 5, 2024
•
1.31k
•
8
google/paligemma2-3b-pt-224
Image-Text-to-Text
•
3B
•
Updated
Dec 5, 2024
•
64.5k
•
161
vidore/colqwen2-v1.0
Visual Document Retrieval
•
Updated
Jun 5, 2025
•
64.9k
•
116
deepseek-ai/Janus-Pro-7B
Any-to-Any
•
Updated
Feb 1, 2025
•
53.1k
•
3.55k
deepseek-ai/Janus-Pro-1B
Any-to-Any
•
Updated
Feb 1, 2025
•
6.78k
•
466
nvidia/Eagle2-9B
Image-Text-to-Text
•
9B
•
Updated
Jan 28, 2025
•
76
•
62
openbmb/MiniCPM-o-2_6
Any-to-Any
•
9B
•
Updated
Oct 5, 2025
•
79.3k
•
1.28k
DAMO-NLP-SG/VideoLLaMA3-7B
Video-Text-to-Text
•
8B
•
Updated
Sep 2, 2025
•
84k
•
71
DAMO-NLP-SG/VideoLLaMA3-2B
Video-Text-to-Text
•
2B
•
Updated
Sep 3, 2025
•
2.64k
•
16
AIDC-AI/Ovis2-8B
Image-Text-to-Text
•
9B
•
Updated
Aug 15, 2025
•
949
•
75
Upvote
-
Share collection
View history
Collection guide
Browse collections