Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
Henry Zhang
Henry0709
Follow
Aster2024's profile picture
1 follower
·
0 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
19 days ago
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
upvoted
a
paper
19 days ago
Reward Inside the Model: A Lightweight Hidden-State Reward Model for LLM's Best-of-N sampling
upvoted
a
paper
4 months ago
EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning
View all activity
Organizations
None yet
models
1
Henry0709/llama-3.1-8b-dpo-checkpoint-200
Text Generation
•
Updated
Sep 17, 2025
datasets
0
None public yet