Towards Scalable Pre-training of Visual Tokenizers for Generation
-
MiniMaxAI/VTP-Small-f16d64
Image Feature Extraction • 0.2B • Updated • 2.61k • 8 -
MiniMaxAI/VTP-Base-f16d64
Image Feature Extraction • 0.3B • Updated • 2.84k • 14 -
MiniMaxAI/VTP-Large-f16d64
Image Feature Extraction • 0.7B • Updated • 2.96k • 10 -
Towards Scalable Pre-training of Visual Tokenizers for Generation
Paper • 2512.13687 • Published • 81