Bolmo: Byteifying the Next Generation of Language Models Paper • 2512.15586 • Published 21 days ago • 14
Bolmo: Byteifying the Next Generation of Language Models Paper • 2512.15586 • Published 21 days ago • 14
Retrofitting (Large) Language Models with Dynamic Tokenization Paper • 2411.18553 • Published Nov 27, 2024 • 2
Cross-Tokenizer Distillation via Approximate Likelihood Matching Paper • 2503.20083 • Published Mar 25, 2025 • 1