RyanYr (Yurun Yuan)

Collections 2

models 27

RyanYr/grpo_neg0.001-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1230-975b46d_actor

Updated Jun 30

RyanYr/grpo_neg0.01-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1230-975b46d_actor

Updated Jun 30

RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1350-ab3ac3de_actor

Updated Jun 29 • 1

RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4_actor_1350-ab3ac3de

Text Generation • 2B • Updated Jun 28 • 5

RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1230-975b46d_actor

Updated Jun 28 • 1

RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4_actor_1230-975b46d

Text Generation • 2B • Updated Jun 28 • 8

View 27 models

datasets 1,069

RyanYr/Qwen3-4B-Instruct-2507-has_test-1_42_dev

Updated Oct 15 • 3

RyanYr/Qwen3-4B-Instruct-2507-has_test-256_45

Updated Oct 15 • 2

RyanYr/Qwen3-4B-Instruct-2507-has_test-256_44

Updated Oct 15 • 2

RyanYr/Qwen3-4B-Instruct-2507-has_test-256_43

Updated Oct 15 • 3

RyanYr/Qwen3-4B-Instruct-2507-has_test-256_42

Updated Oct 15 • 2

RyanYr/Qwen2.5-7B-It-MathEval-K16-temp1.0_agg

Viewer • Updated Sep 23 • 1.55k • 6

RyanYr/Llama-3.1-8B-It-MathEval-K16-temp1.0-agg

Viewer • Updated Sep 23 • 1.55k • 6

RyanYr/Qwen2.5-7B-It-mmlupro-K16-temp1.0-agg

Viewer • Updated Sep 23 • 12k • 13

RyanYr/Llama-3.1-8B-It-mmlupro-K16-temp1.0-rm

Viewer • Updated Sep 11 • 12k • 16

RyanYr/Llama-3.1-8B-It-mmlu-K16-temp1.0-rm

Viewer • Updated Sep 10 • 14k • 15

View 1,069 datasets

Yurun Yuan

AI & ML interests

Organizations

Collections 2

peiyi9979/Math-Shepherd

Idavidrein/gpqa

AI-MO/NuminaMath-CoT

Magpie-Align/Magpie-Reasoning-V1-150K

openbmb/UltraInteract_sft

peiyi9979/Math-Shepherd

Idavidrein/gpqa

AI-MO/NuminaMath-CoT

Magpie-Align/Magpie-Reasoning-V1-150K

openbmb/UltraInteract_sft

models 27

RyanYr/grpo_neg0.5-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1230-975b46d_actor

RyanYr/grpo_neg0.1-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1230-975b46d_actor

RyanYr/brm-dapo-qwen2.5math-7B-base-lr2.5e-6-bs512-mbs8192-beta0.002-n16

RyanYr/brm-dapo-qwen2.5math-7B-base-lr2.5e-6-bs512-beta0.002-n16

RyanYr/grpo_neg0.001-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1230-975b46d_actor

RyanYr/grpo_neg0.01-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1230-975b46d_actor

RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1350-ab3ac3de_actor

RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4_actor_1350-ab3ac3de

RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4-ref1230-975b46d_actor

RyanYr/grpo-aime24-qwen2.5math-1.5B-base-mbs128-n4_actor_1230-975b46d

datasets 1,069

RyanYr/Qwen3-4B-Instruct-2507-has_test-1_42_dev

RyanYr/Qwen3-4B-Instruct-2507-has_test-256_45

RyanYr/Qwen3-4B-Instruct-2507-has_test-256_44

RyanYr/Qwen3-4B-Instruct-2507-has_test-256_43

RyanYr/Qwen3-4B-Instruct-2507-has_test-256_42

RyanYr/Qwen2.5-7B-It-MathEval-K16-temp1.0_agg

RyanYr/Llama-3.1-8B-It-MathEval-K16-temp1.0-agg

RyanYr/Qwen2.5-7B-It-mmlupro-K16-temp1.0-agg

RyanYr/Llama-3.1-8B-It-mmlupro-K16-temp1.0-rm

RyanYr/Llama-3.1-8B-It-mmlu-K16-temp1.0-rm

Yurun Yuan

AI & ML interests

Organizations

Collections 2

models 27 Sort: Recently updated

datasets 1,069 Sort: Recently updated

models 27

datasets 1,069