Amazing work! Any plans on releasing A3B / A4B MoE?

#12
by user09128495 - opened

Wow, v2 version is awesome for its size! So far, no bad experiences with Q4, great job!
Any plans for MoE variants (e.g., 26B A4B or 35B A3B)?

Haha yeah, I'll get to a MoE one down the road! Fun fact, I actually already did a small MoE distill a while back: Mellum2 12B-A2.5B (https://huggingface.co/yuxinlu1/Mellum2-12B-A2.5B-Claude-4.6-4.8-Opus-Thinking-GGUF). Never ran any real benchmark on it, but it sure is fast πŸ˜„

Sign up or log in to comment