Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated 1 day ago • 40
NeMo Gym Collection Collection of RL verifiable data for NeMo Gym • 13 items • Updated 10 days ago • 32
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated 2 days ago • 350
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper • 2406.15877 • Published Jun 22, 2024 • 48
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 147
Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning Paper • 2311.11077 • Published Nov 18, 2023 • 29