AI & ML interests
None yet
Organizations
None yet
LuyiCui/slow_fast_reason-sft-s1k-1.1_full
Text Generation
•
8B
•
Updated
•
10
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-SAPO
2B
•
Updated
•
5
LuyiCui/sft-amc_aime-R1-Distill-Qwen-1.5B
2B
•
Updated
•
14
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
LuyiCui/Qwen2.5-1.5B-Instruct-CEPO
Text Generation
•
2B
•
Updated
•
8
LuyiCui/Qwen2.5-Math-1.5B-GRPO
Updated
LuyiCui/Qwen2.5-1.5B-GRPO
Updated
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO-123
Text Generation
•
2B
•
Updated
•
9
LuyiCui/Qwen2.5-1.5B-Open-R1-GRPO
Text Generation
•
2B
•
Updated
•
15
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO-3
Updated
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO-2-2
Text Generation
•
2B
•
Updated
•
11
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO-2
Text Generation
•
2B
•
Updated
•
12
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO-1
Text Generation
•
2B
•
Updated
•
10
Feature Extraction
•
3B
•
Updated
•
11
Feature Extraction
•
2B
•
Updated
•
11
Text Generation
•
0.5B
•
Updated
•
13
LuyiCui/Qwen2.5-1.5B-Open-R1-Distill
Updated
Feature Extraction
•
2B
•
Updated
•
8
Feature Extraction
•
0.5B
•
Updated
•
11
LuyiCui/DeepSeek-R1-Distill-Qwen-1.5B-DPO
Text Generation
•
2B
•
Updated
•
10
•
1