view article Article Featuring Every Eval Ever Results on Hugging Face Model Pages +5 deepmage121, evijit, SaylorTwift, janbatzner, borgr, irenesolaiman, julien-c • 1 day ago • 31
Every Eval Ever: A Unifying Schema and Community Repository for AI Evaluation Results Paper • 2606.14516 • Published 19 days ago • 3
Evaluation Cards: An Interpretive Layer for AI Evaluation Reporting Paper • 2606.09809 • Published 23 days ago • 4
view article Article Introducing Evaluation Cards: A Live Interpretive Layer for Understanding the AI Evaluations Ecosystem evaleval • 20 days ago • 1
LLM-42: Enabling Determinism in LLM Inference with Verified Speculation Paper • 2601.17768 • Published Jan 30 • 1
The Silent Hyperparameter: Quantifying the Impact of Inference Backends on LLM Reproducibility Paper • 2605.19537 • Published May 20 • 1
view article Article LeRobot Humanoid: An Open, Low-Cost, 3D-Printed Humanoid for Robot Learning VirgileBatto • May 21 • 63
view article Article Harness, Scaffold, and the AI Agent Terms Worth Getting Right sergiopaniego, ariG23498 • May 25 • 125
view article Article AI and the Future of Cybersecurity: Why Openness Matters +1 meg, yjernite, clem • Apr 21 • 42
view article Article 🏟️ Smol AI WorldCup: A 5-Axis Benchmark That Reveals What Small Language Models Can Really Do FINAL-Bench • Mar 10 • 38
view article Article Did GPT 5.2 make a breakthrough discovery in theoretical physics? dlouapre • Feb 19 • 63
view article Article GGML and llama.cpp join HF to ensure the long-term progress of Local AI +4 ggerganov, ngxson, allozaur, lysandre, victor, julien-c • Feb 20 • 507
view article Article Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek huggingface • Jan 27 • 45