ModernVBERT: Towards Smaller Visual Document Retrievers Paper β’ 2510.01149 β’ Published Oct 1, 2025 β’ 30
Should We Still Pretrain Encoders with Masked Language Modeling? Paper β’ 2507.00994 β’ Published Jul 1, 2025 β’ 80
ViDoRe Benchmark V2: Raising the Bar for Visual Retrieval Paper β’ 2505.17166 β’ Published May 22, 2025
EuroBERT: Scaling Multilingual Encoders for European Languages Paper β’ 2503.05500 β’ Published Mar 7, 2025 β’ 80
MMTEB: Massive Multilingual Text Embedding Benchmark Paper β’ 2502.13595 β’ Published Feb 19, 2025 β’ 43
EuroLLM: Multilingual Language Models for Europe Paper β’ 2409.16235 β’ Published Sep 24, 2024 β’ 29
ColPali: Efficient Document Retrieval with Vision Language Models Paper β’ 2407.01449 β’ Published Jun 27, 2024 β’ 50
Towards Trustworthy Reranking: A Simple yet Effective Abstention Mechanism Paper β’ 2402.12997 β’ Published Feb 20, 2024 β’ 9
CroissantLLM: A Truly Bilingual French-English Language Model Paper β’ 2402.00786 β’ Published Feb 1, 2024 β’ 26
Revisiting Instruction Fine-tuned Model Evaluation to Guide Industrial Applications Paper β’ 2310.14103 β’ Published Oct 21, 2023 β’ 1