TAUR-dev/M-9_2_25__yolo_run-sft
2B
•
Updated
•
6
TAUR-dev/M-0827_rl_reflect_countdown__test-rl
0.6B
•
Updated
•
3
TAUR-dev/M-jack_experiments__all_stages_tacc2-rl
0.6B
•
Updated
•
5
TAUR-dev/M-jack_experiments__all_stages_tacc-rl
0.6B
•
Updated
•
16
TAUR-dev/M-0827_rl_reflect_countdown__0epoch_3and4args__grpo_minibs32_lr1e-6_rolloutn16-rl
2B
•
Updated
•
3
TAUR-dev/M-0827_rl_reflect_countdown__0epoch_4args__grpo_minibs32_lr1e-6_rolloutn16-rl
2B
•
Updated
•
4
TAUR-dev/M-0827_rl_reflect_countdown__2epoch_3args__grpo_minibs32_lr1e-6_rolloutn16-rl
2B
•
Updated
•
4
TAUR-dev/M-0827_rl_reflect_countdown__0epoch_3args__grpo_minibs32_lr1e-6_rolloutn16-rl
2B
•
Updated
•
4
TAUR-dev/M-0827_rl_reflect_countdown__4epoch_4args__grpo_minibs32_lr1e-6_rolloutn16-rl
2B
•
Updated
•
6
TAUR-dev/M-0827_rl_reflect_countdown__2epoch_3and4args__grpo_minibs32_lr1e-6_rolloutn16-rl
2B
•
Updated
•
3
TAUR-dev/M-reflection_countdown_4args_sft_4epochs-sft
2B
•
Updated
•
4
TAUR-dev/M-reflection_countdown_3args_sft_2epochs-sft
2B
•
Updated
•
5
TAUR-dev/M-reflection_countdown_3args_4args_sft_2epochs-sft
2B
•
Updated
•
5
TAUR-dev/M-reflection_countdown_4args_sft_1epoch-sft
2B
•
Updated
•
5
TAUR-dev/M-reflection_countdown_3args_sft_1epoch-sft
2B
•
Updated
•
5