ginipick

ginipick

·

AI & ML interests

None yet

Recent Activity

liked a model about 10 hours ago

FINAL-Bench/metacog-adapter-JGOS-31B-Citizen

liked a Space 1 day ago

reacted to ginigen-ai's post with 🔥 1 day ago

🧠 Does your LLM know when it's about to be wrong? Most leaderboards measure accuracy. We measure metacognition — whether a model catches its own errors. Benchmark + leaderboard + adapters, all open. 🎉 The surprise: even a K-AI #1 model (JGOS-31B-Citizen) is the strongest on multiple-choice traps (trap_rate 0.005 — ~2 misses in 400) yet blind to its own free-form mistakes (self-confidence AUROC = 0.5, pure random). A tiny base-frozen adapter recovers that signal. Two independent axes (never compared across a row): ① trap_rate — does it fall for tempting trap options? (lower = stronger) ② adapter gain Δ — how much a lightweight adapter catches errors the model itself misses. (higher = more adapter value) What's open: 📊 300+100 trap problems (each with a hidden trap + TICOS type) 🏆 24-model leaderboard 🧩 11 per-model adapters — adapters, NOT fine-tunes (base stays frozen; the adapter just reads the hidden state → P(wrong)) Submit any HF model → auto-scored daily at 09:00 KST and added to the board. 🏆 Leaderboard → https://huggingface.co/spaces/ginigen-ai/Metacognition-Leaderboard-Space 📊 Benchmark → https://huggingface.co/datasets/ginigen-ai/Metacognition-Bench 🧩 Adapters → https://huggingface.co/collections/FINAL-Bench/metacognition-adapters-6a42c032e6beb803dd032961 📊 Article → https://huggingface.co/blog/ginigen-ai/metacognition Benchmark by ginigen-ai · Adapters by FINAL-Bench (Darwin/Chimera platform + AETHER metacognition tech).

View all activity

Organizations

ginipick 's models 11

ginipick/Qwen-Image-Edit-Rapid-AIO

Text-to-Image • Updated Nov 2, 2025 • 1

ginipick/GLM-4.6

Text Generation • 357B • Updated Nov 2, 2025 • 4

ginipick/neutts-air

Text-to-Speech • 0.7B • Updated Nov 2, 2025 • 7 • 1

ginipick/MiniMax-M2

Text Generation • 229B • Updated Nov 2, 2025 • 3

ginipick/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated Nov 2, 2025 • 3

ginipick/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated Nov 2, 2025 • 4

ginipick/Gemma-3-R1984-4B

Image-Text-to-Text • 4B • Updated Apr 22, 2025 • 5 • • 52

ginipick/QwQ-32B-NF4

Text Generation • 33B • Updated Mar 21, 2025 • 3 • 30

ginipick/wan-lora-cat

Text-to-Video • Updated Mar 16, 2025 • 3

ginipick/c-bag

Updated Mar 13, 2025

ginipick/flux-lora-eric-cat

Text-to-Image • Updated Dec 2, 2024 • 35 • • 230