Michael Yang
3f6642f6fc
model: implement bert in ollama engine ( #9080 )
...
* fix truncate
* s/SentencePieceModel/SentencePiece/
* bert
* wordpiece
* refactor pooling
* more tokenizers
* normalize embeddings
2025-09-15 15:35:59 -07:00
Michael Yang
5994e8e8fd
embedding gemma model ( #12181 )
...
* ollama: add embeddings
2025-09-04 09:09:07 -07:00
Michael Yang
fb92b61754
logutil: add Trace and TraceContext helpers ( #12110 )
2025-09-02 13:09:12 -07:00
Michael Yang
333e360422
model: handle multiple eos tokens ( #10577 )
...
* get eos_token_id from generation_config.json
* refactor
* include both ids and strings in trace
* comments
* remove special case for gemma3 special vocab (#10743 )
2025-05-16 13:40:23 -07:00