GLM-5.2 / .eval_results /gpqa.yaml
ZHANGYUXUAN-zR's picture
Add community evaluation results for DEEP-SWE, GPQA, HLE, SWE-BENCH_PRO (#12)
e32aaf0
Raw
History Blame Contribute Delete
149 Bytes
- dataset:
id: Idavidrein/gpqa
task_id: diamond
value: 91.2
source:
url: https://huggingface.co/zai-org/GLM-5.2
name: Model Card