Hu

Alexhu1999

AI & ML interests

None yet

Recent Activity

upvoted a paper 18 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

upvoted a paper 18 days ago

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

published a dataset 3 months ago

Alexhu1999/qwen3_embedings

View all activity

Organizations

upvoted 2 papers 18 days ago

Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs

Paper • 2601.08763 • Published 22 days ago • 146

Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning

Paper • 2601.09667 • Published 21 days ago • 86

published a dataset 3 months ago

Alexhu1999/qwen3_embedings

Updated Nov 2, 2025

updated a Space 4 months ago

Trackio

🚀

Track and visualize project metrics

published a Space 4 months ago

Trackio

🚀

Track and visualize project metrics

updated a model 5 months ago

Alexhu1999/cerebras_kto_iseuiuc

Updated Sep 21, 2025

published a model 5 months ago

Alexhu1999/cerebras_kto_iseuiuc

Updated Sep 21, 2025

updated a dataset 5 months ago

Alexhu1999/maicrl

Updated Sep 19, 2025

published a dataset 5 months ago

Alexhu1999/maicrl

Updated Sep 19, 2025

liked 5 datasets 5 months ago

updated a model 5 months ago

Alexhu1999/lfm2_vl

1B • Updated Sep 1, 2025 • 1

liked a model 6 months ago

NexaAI/OmniNeural-4B

Any-to-Any • Updated Nov 7, 2025 • 26 • 161

updated 2 models 6 months ago

Alexhu1999/Qwen3-4B-GSPO-email-retriever

4B • Updated Aug 15, 2025

Alexhu1999/Qwen3-4B-GSPO-email-retriever-120steps

4B • Updated Aug 14, 2025

published a model 6 months ago

Alexhu1999/Qwen3-4B-GSPO-email-retriever-120steps

4B • Updated Aug 14, 2025

updated a model 6 months ago

Alexhu1999/Qwen3-4B-DAPO-email-retriever

4B • Updated Aug 13, 2025

Hu

AI & ML interests

Recent Activity

Organizations

Alexhu1999's activity

Trackio

Trackio