mirror of https://github.com/ollama/ollama.git
With the new version of GGML in #12245, KV cache quantization no longer causes a fallback to CPU. |
||
---|---|---|
.. | ||
ggml | ||
gguf | ||
util/bufioutil | ||
config.go |
With the new version of GGML in #12245, KV cache quantization no longer causes a fallback to CPU. |
||
---|---|---|
.. | ||
ggml | ||
gguf | ||
util/bufioutil | ||
config.go |