YuLan-Mini Resources Collection Pre-Training & post-training resources for YuLan-Mini • 29 items • Updated May 7 • 3
Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models Paper • 2508.10751 • Published Aug 14 • 28
Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models Paper • 2508.10751 • Published Aug 14 • 28
Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models Paper • 2508.10751 • Published Aug 14 • 28 • 2
Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models Paper • 2503.21380 • Published Mar 27 • 38
Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models Paper • 2503.21380 • Published Mar 27 • 38 • 4
Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models Paper • 2503.21380 • Published Mar 27 • 38
JiuZhang 2.0: A Unified Chinese Pre-trained Language Model for Multi-task Mathematical Problem Solving Paper • 2306.11027 • Published Jun 19, 2023
Imitate, Explore, and Self-Improve: A Reproduction Report on Slow-thinking Reasoning Systems Paper • 2412.09413 • Published Dec 12, 2024 • 1
Technical Report: Enhancing LLM Reasoning with Reward-guided Tree Search Paper • 2411.11694 • Published Nov 18, 2024
Towards Effective and Efficient Continual Pre-training of Large Language Models Paper • 2407.18743 • Published Jul 26, 2024
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models Paper • 2405.14365 • Published May 23, 2024
Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint Paper • 2401.06081 • Published Jan 11, 2024 • 1