A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation
AI & ML interests
None defined yet.
Recent Activity
datasets
7
JavisVerse/JavisUnd-Eval
Updated
•
16
JavisVerse/MM-PreTrain
Viewer
•
Updated
•
273k
•
20
JavisVerse/JavisInst-Omni
Viewer
•
Updated
•
91.4k
•
12
JavisVerse/AV-FineTune
Viewer
•
Updated
•
1.8M
•
12
JavisVerse/JavisBench
Viewer
•
Updated
•
22.3k
•
63
JavisVerse/JavisData-audios
Viewer
•
Updated
•
788k
•
37
JavisVerse/TAVGBench_clean
Viewer
•
Updated
•
1.58M
•
16