GLM-5.2 / .eval_results /swe-bench_pro.yaml
ZHANGYUXUAN-zR's picture
Add community evaluation results for DEEP-SWE, GPQA, HLE, SWE-BENCH_PRO (#12)
e32aaf0
Raw
History Blame Contribute Delete
161 Bytes
- dataset:
id: ScaleAI/SWE-bench_Pro
task_id: SWE_Bench_Pro
value: 62.1
source:
url: https://huggingface.co/zai-org/GLM-5.2
name: Model Card