mirror of https://github.com/ollama/ollama.git
With the new version of GGML in #12245, KV cache quantization no longer causes a fallback to CPU. |
||
|---|---|---|
| .. | ||
| ggml | ||
| gguf | ||
| util/bufioutil | ||
| config.go | ||
With the new version of GGML in #12245, KV cache quantization no longer causes a fallback to CPU. |
||
|---|---|---|
| .. | ||
| ggml | ||
| gguf | ||
| util/bufioutil | ||
| config.go | ||