mirror of https://github.com/ollama/ollama.git
With the new version of GGML in #12245, KV cache quantization no longer causes a fallback to CPU. |
||
---|---|---|
.. | ||
ggml.go | ||
ggml_test.go | ||
gguf.go | ||
gguf_test.go | ||
type.go |