YAML Metadata Error:Invalid content in Eval Result file .eval_results/hle.yaml

Check out the documentation for more information.

Show details
Task ID "hle" does not match any task in dataset "cais/hle". Available: none

YAML Metadata Error:Invalid content in Eval Result file .eval_results/hle.yaml

Check out the documentation for more information.

Show details
Task ID "hle" does not match any task in dataset "cais/hle". Available: none
GLM-5.2 / .eval_results /hle.yaml
ZHANGYUXUAN-zR's picture
Add community evaluation results for DEEP-SWE, GPQA, HLE, SWE-BENCH_PRO (#12)
e32aaf0
Raw
History Blame Contribute Delete
298 Bytes
- dataset:
id: cais/hle
task_id: hle
value: 40.5
source:
url: https://huggingface.co/zai-org/GLM-5.2
name: Model Card
- dataset:
id: cais/hle
task_id: hle
value: 54.7
source:
url: https://huggingface.co/zai-org/GLM-5.2
name: Model Card
notes: "With tools"