Henry Zhang's picture

4

Henry Zhang

Henry0709

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago

Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models

upvoted a paper 19 days ago

Reward Inside the Model: A Lightweight Hidden-State Reward Model for LLM's Best-of-N sampling

upvoted a paper 4 months ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

View all activity

Organizations

None yet

models 1

Henry0709/llama-3.1-8b-dpo-checkpoint-200

Text Generation • Updated Sep 17, 2025

datasets 0

None public yet