Bitnet.cpp: Efficient Edge Inference for Ternary LLMs Paper • 2502.11880 • Published Feb 17, 2025 • 5
GenRecal: Generation after Recalibration from Large to Small Vision-Language Models Paper • 2506.15681 • Published Jun 18, 2025 • 41