ollama/model/models
Michael Yang c253433d68
embed: cleanup (#12299)
* cleanup

* use pooling.TypeNone

* pooling test
2025-09-16 09:48:42 -07:00
..
bert embed: cleanup (#12299) 2025-09-16 09:48:42 -07:00
gemma2 model: implement bert in ollama engine (#9080) 2025-09-15 15:35:59 -07:00
gemma3 embed: cleanup (#12299) 2025-09-16 09:48:42 -07:00
gemma3n model: implement bert in ollama engine (#9080) 2025-09-15 15:35:59 -07:00
gptoss batch: use tensors for outputs (#12185) 2025-09-15 14:33:06 -07:00
llama batch: use tensors for outputs (#12185) 2025-09-15 14:33:06 -07:00
llama4 batch: use tensors for outputs (#12185) 2025-09-15 14:33:06 -07:00
mistral3 batch: use tensors for outputs (#12185) 2025-09-15 14:33:06 -07:00
mllama batch: use tensors for outputs (#12185) 2025-09-15 14:33:06 -07:00
qwen2 batch: use tensors for outputs (#12185) 2025-09-15 14:33:06 -07:00
qwen3 batch: use tensors for outputs (#12185) 2025-09-15 14:33:06 -07:00
qwen25vl batch: use tensors for outputs (#12185) 2025-09-15 14:33:06 -07:00
models.go model: implement bert in ollama engine (#9080) 2025-09-15 15:35:59 -07:00