VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs Paper • 2511.20272 • Published 29 days ago • 1
VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs Paper • 2511.20272 • Published 29 days ago • 1
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision Paper • 2512.01342 • Published 24 days ago • 14
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Paper • 2403.15377 • Published Mar 22, 2024 • 27
TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning Paper • 2410.19702 • Published Oct 25, 2024 • 1
Make Your Training Flexible: Towards Deployment-Efficient Video Models Paper • 2503.14237 • Published Mar 18 • 5
ExpVid: A Benchmark for Experiment Video Understanding & Reasoning Paper • 2510.11606 • Published Oct 13 • 4
ExpVid: A Benchmark for Experiment Video Understanding & Reasoning Paper • 2510.11606 • Published Oct 13 • 4