John
cmp-nct
AI & ML interests
None yet
Recent Activity
new activity
about 8 hours ago
zai-org/GLM-4.7-Flash:llama.cpp inference - 20 times (!) slower than OSS 20 on a RTX 5090
new activity
about 1 month ago
unsloth/Nemotron-3-Nano-30B-A3B-GGUF:Should UD-Q6_K_XL identical to Q6_K.gguf?