-
oceanpty/TOA-Ultrafeedback-SFT-Rand-lla3.1-8b-inst
Viewer • Updated • 59.9k • 25 -
oceanpty/TOA-Ultrafeedback-SFT-Rand-qwen2-7b-inst
Viewer • Updated • 59.9k • 35 -
oceanpty/TOA-Ultrafeedback-SFT-PRS-lla3.1-8b-inst
Viewer • Updated • 59.9k • 27 -
oceanpty/TOA-Ultrafeedback-SFT-PRS-qwen2-7b-inst
Viewer • Updated • 59.9k • 30
Hai Ye
oceanpty
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
2 days ago
Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding
authored
a paper
about 1 month ago
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
Organizations
None yet