MLM vs CLM Collection Research material on research about pre-training encoders, with extensive comparison on masked language modeling paradigm vs causal langage modeling. • 5 items • Updated Dec 1, 2025
MLM versus CLM for NLP tasks Collection Related paper: "Should We Still Pretrain Encoders with Masked Language Modeling?" • 1 item • Updated Sep 11, 2025
EuroBERT Encoding model Collection Suite of models for improved integration into RAG (for information retrieval), designed for ease-of-use and practicability in industrial context • 5 items • Updated Sep 11, 2025 • 1
EuroBERT Encoding model Collection Suite of models for improved integration into RAG (for information retrieval), designed for ease-of-use and practicability in industrial context • 5 items • Updated Sep 11, 2025 • 1
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1, 2025 • 80
Should We Still Pretrain Encoders with Masked Language Modeling? Paper • 2507.00994 • Published Jul 1, 2025 • 80
Abstention Reranking Collection Related paper: "Towards Trustworthy Reranking: A Simple yet Effective Abstention Mechanism" (accepted at TMLR 2024) • 3 items • Updated Apr 10, 2025
EuroBERT Encoding model Collection Suite of models for improved integration into RAG (for information retrieval), designed for ease-of-use and practicability in industrial context • 5 items • Updated Sep 11, 2025 • 1
MMTEB: Massive Multilingual Text Embedding Benchmark Paper • 2502.13595 • Published Feb 19, 2025 • 43
EuroBERT Encoding model Collection Suite of models for improved integration into RAG (for information retrieval), designed for ease-of-use and practicability in industrial context • 5 items • Updated Sep 11, 2025 • 1