ollama/ml/nn
Michael Yang ad95d5b30b
use split activations when possible (#12293)
* use ggml_*_split activations when possible

* forward qkv
2025-09-16 09:51:19 -07:00
..
fast ml: add more rope options (#10775) 2025-05-20 15:51:08 -07:00
pooling embed: cleanup (#12299) 2025-09-16 09:48:42 -07:00
rope chore: fix some inconsistent function name in comment 2025-08-13 09:50:27 -07:00
attention.go use split activations when possible (#12293) 2025-09-16 09:51:19 -07:00
convolution.go next ollama runner (#7913) 2025-02-13 16:31:21 -08:00
embedding.go next ollama runner (#7913) 2025-02-13 16:31:21 -08:00
linear.go update vendored llama.cpp and ggml (#11823) 2025-08-14 14:42:58 -07:00
normalization.go next ollama runner (#7913) 2025-02-13 16:31:21 -08:00