mehuldamani/qwen3_8b_ambigQA_rlcr_multiple_newAccReward_standardPrompt_initFromFirstModel_weighAccMore Updated 9 days ago
mehuldamani/qwen3_8b_ambigQA_rlcr_multiple_newAccReward_standardPrompt_initFromBase_weighAccMore Updated 9 days ago
mehuldamani/sft-base-half-tranches-v1-global-step-394 Text Classification • 8B • Updated 25 days ago • 19