Qipeng Chen
lechatelierlenz
ยท
AI & ML interests
multimodal reasoning, dLLM
Recent Activity
upvoted
a
paper
about 11 hours ago
Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies
upvoted
a
paper
15 days ago
On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models