CohenQu/Instruct-POPE-iter1-step280-POPE-hard-first_guide-no_guide-iter2 4B • Updated Nov 10, 2025 • 42
CohenQu/Qwen2.5-3B-Instruct_Continue_vs_Terminate.05.00 Text Generation • 3B • Updated Aug 14, 2025 • 1
CohenQu/sft_Qwen3-1.7B_Continue_vs_Terminate.05.00_orchard Text Generation • 2B • Updated Jul 29, 2025 • 1
CohenQu/sft_Qwen3-1.7B_Continue_vs_Terminate.05.01_orchard Text Generation • 2B • Updated Jul 29, 2025 • 1
CohenQu/Joint-Train-deepscalar_RL_hard_500_verl_0.35_0.001_0.001_32_32_20k_4_0713 2B • Updated Jul 14, 2025 • 2
CohenQu/Joint-Train-deepscalar_RL_hard_500_verl_0.35_0.001_0.001_32_32_20k_4_0710 2B • Updated Jul 12, 2025 • 2
CohenQu/Joint-Train-deepscalar_RL_hard_500_verl_0.35_0.001_0.001_32_32_20k_4_new 2B • Updated Jun 28, 2025 • 3