Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-16-512 Video-Text-to-Text • 4B • Updated Jun 12, 2025 • 18 • 1
Zhang199/TinyLLaVA-Video-Qwen2.5-3B-Group-1fps-512 Video-Text-to-Text • 4B • Updated Apr 24, 2025 • 29
TinyLLaVA-Video: A Simple Framework of Small-scale Large Multimodal Models for Video Understanding Paper • 2501.15513 • Published Jan 26, 2025