Gongyao Jiang's picture

2

Gongyao Jiang

yaozz

AI & ML interests

None yet

Recent Activity

updated a model 2 months ago

yaozz/Chart-Answer-Selector

published a model 2 months ago

yaozz/Chart-Answer-Selector

upvoted an article 3 months ago

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

View all activity

Organizations

None yet

updated a model 2 months ago

yaozz/Chart-Answer-Selector

8B • Updated Oct 31, 2025 • 4

published a model 2 months ago

yaozz/Chart-Answer-Selector

8B • Updated Oct 31, 2025 • 4

upvoted an article 3 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

267

upvoted a collection almost 2 years ago

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 2 days ago • 211