1 5

Changhao

lichangh20

https://lichangh20.github.io/

lichangh20

AI & ML interests

RL, Agent, Efficient ML

Recent Activity

updated a model about 11 hours ago

lichangh20/Qwen3-4B-Instruct_sft_2epoch_data_analysis

published a model about 11 hours ago

lichangh20/Qwen3-4B-Instruct_sft_2epoch_data_analysis

upvoted an article about 2 months ago

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

View all activity

Organizations

Papers 4

models 3

datasets 15

lichangh20/s1K_initial_filtered_for_llama8b

Viewer • Updated May 2, 2025 • 1k • 3

lichangh20/olympiadbench

Viewer • Updated Apr 22, 2025 • 674 • 102

lichangh20/minervamath

Viewer • Updated Apr 22, 2025 • 272 • 6

lichangh20/s1K_simplified_filtered_for_adapter

Viewer • Updated Mar 24, 2025 • 927 • 6

lichangh20/s1K_initial_filtered_for_qwen7b_simplified_summarized

Viewer • Updated Mar 15, 2025 • 997 • 4

lichangh20/s1K_initial_filtered_for_qwen7b_summarized

Viewer • Updated Mar 12, 2025 • 997 • 5

lichangh20/s1K_filtered_for_qwen7b_sft

Viewer • Updated Mar 11, 2025 • 899 • 6

lichangh20/s1k_eval_sampled_1of12

Viewer • Updated Mar 9, 2025 • 77 • 4

lichangh20/s1k_train_sampled_1of12

Viewer • Updated Mar 9, 2025 • 77 • 6

lichangh20/gpqa_sampled_1of3

Viewer • Updated Mar 9, 2025 • 66 • 2

View 15 datasets

Changhao

AI & ML interests

Recent Activity

Organizations

Papers 4

models 3 Sort: Recently updated

datasets 15 Sort: Recently updated

models 3

datasets 15