One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling Paper • 2601.03111 • Published Jan 6 • 10
MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning Paper • 2507.16812 • Published Jul 22, 2025 • 63
OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_general_ins_89_10_1_bs4M_seq8k_20B Text Generation • Updated Jul 7, 2025
OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_91_bs4M_seq8k_20B Text Generation • Updated Jul 7, 2025
OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_general_ins_89_10_1_bs4M_seq8k_20B Text Generation • Updated Jul 7, 2025
OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_91_bs4M_seq8k_20B Text Generation • Updated Jul 7, 2025