Commit History

Add Speculative decoding (MTP draft) section: verified build llama.cpp b9553 (9e3b928fd), regression note for b9702/b9717
190a313
verified

yuxinlu1 commited on

Announcements: add pinned-discussion pointer (sampler/tool-parsing fixes); trim Q2_K note
e8a1d38
verified

yuxinlu1 commited on

Drop Q2_K from this release; move note to Announcements (failed stress-testing, Q3_K_M is the floor)
f09d355
verified

yuxinlu1 commited on

Upload gemma4-v2-Q8_0.gguf with huggingface_hub
a466c6d
verified

yuxinlu1 commited on

Pin Q2_K takedown notice; remove Q2_K from quant table
6137557
verified

yuxinlu1 commited on

Remove broken Q2_K quant (gibberish output); re-quantizing with imatrix
237c1f4
verified

yuxinlu1 commited on

Upload gemma4-v2-Q6_K.gguf with huggingface_hub
25bf4b8
verified

yuxinlu1 commited on

Upload gemma4-v2-Q4_K_M.gguf with huggingface_hub
8019aae
verified

yuxinlu1 commited on

Upload gemma4-v2-Q3_K_M.gguf with huggingface_hub
584f1be
verified

yuxinlu1 commited on

Upload gemma4-v2-Q2_K.gguf with huggingface_hub
67c6635
verified

yuxinlu1 commited on

Upload MTP/gemma-4-12B-it-MTP-BF16.gguf with huggingface_hub
8d36f56
verified

yuxinlu1 commited on

Upload MTP/gemma-4-12B-it-MTP-F16.gguf with huggingface_hub
1cf3645
verified

yuxinlu1 commited on

Upload MTP/gemma-4-12B-it-MTP-Q8_0.gguf with huggingface_hub
22a26b2
verified

yuxinlu1 commited on

Upload README.md with huggingface_hub
73b7f55
verified

yuxinlu1 commited on

initial commit
79c23fd
verified

yuxinlu1 commited on