Michael Yang
|
c253433d68
|
embed: cleanup (#12299)
* cleanup
* use pooling.TypeNone
* pooling test
|
2025-09-16 09:48:42 -07:00 |
Michael Yang
|
3f6642f6fc
|
model: implement bert in ollama engine (#9080)
* fix truncate
* s/SentencePieceModel/SentencePiece/
* bert
* wordpiece
* refactor pooling
* more tokenizers
* normalize embeddings
|
2025-09-15 15:35:59 -07:00 |
Michael Yang
|
6f7117145f
|
batch: use tensors for outputs (#12185)
this cleans up the model interface slightly without too much impact in
other areas
|
2025-09-15 14:33:06 -07:00 |
Michael Yang
|
5994e8e8fd
|
embedding gemma model (#12181)
* ollama: add embeddings
|
2025-09-04 09:09:07 -07:00 |