Ollama Port

#13
by xentriom - opened

Hiii~
Congrats on v2! I have copied the model over to Ollama to make it easier for Ollama folks like me. It uses the same name and with full credits to you. Let me know if you want it removed ^^

https://ollama.com/xentriom/gemma-4-12B-agentic-fable5-composer2.5-v2

yuxinlu1 pinned discussion

Hiii! 🙏 This is so cool, thank you for porting it over and for the full credit. Please keep it up, making it easy for
the Ollama crowd is exactly what I hoped people would do.

One quick heads up, since v2 is all about tools and agentic work: Gemma 4 has its own native tool-calling format, and
Ollama's default template tends to leak those tool tokens out as plain text instead of clean tool_calls. If you or
anyone runs into that, it's a template fix in the Modelfile (mirroring what llama.cpp does with --jinja), and it also
helps to set sane sampling (temp 1.0, top_p 0.95, top_k 64, repeat 1.1) since Gemma 4 can degenerate at the stock
repeat penalty. Happy to share the exact template I use if it would save you time. Thanks again! 😄

I haven’t encountered that issue yet but I’ll make a note of it in the readme. If you want I’ll be happy to modify it with the exact template and reupload it for everyone else.

Also no problem at all! Thank you for creating this!

xentriom changed discussion title from Ollama to Ollama Port

Hi xentriom,I tried ollama run hf.co/yuxinlu1/gemma-4-12B-agentic-fable5-composer2.5-v2-3.5x-tau2-GGUF:Q8_0 but it lead to download an incomplete 465 MB shard and failed to launch the model.

Hi xentriom,I tried ollama run hf.co/yuxinlu1/gemma-4-12B-agentic-fable5-composer2.5-v2-3.5x-tau2-GGUF:Q8_0 but it lead to download an incomplete 465 MB shard and failed to launch the model.

Try:
ollama run xentriom/gemma-4-12B-agentic-fable5-composer2.5-v2:Q8_0

yesssss! i was running the regular model and it didn't support tools or thinking according to the CLI and it leaked it's thinking

I commented on the wrong model, never mind me 🫣

Sign up or log in to comment