Step: 480 Source: outputs/GRPO/qwen25vl_7b_g60_l600_json-format-args_re-args-curr_web_verb_p_acc-rt_tsp100_bs16_rl-v1-1103_adaptive_kl0.005_lr1e-6/global_step_480/actor/huggingface Created: 2025-11-12T22:13:15+00:00 Description: Huggingface checkpoint from global step 480