bert
|
embed: cleanup (#12299)
|
2025-09-16 09:48:42 -07:00 |
deepseek2
|
Fixed Deepseek2 adding nil tensor error
|
2025-10-03 14:20:06 -07:00 |
gemma2
|
gemma: fix rope scaling for qat models (#12348)
|
2025-09-19 15:04:40 -07:00 |
gemma3
|
gemma: fix rope scaling for qat models (#12348)
|
2025-09-19 15:04:40 -07:00 |
gemma3n
|
fix(llama): other llama flavours (#12308)
|
2025-09-17 12:12:21 -07:00 |
gptoss
|
multi-regexp pretokenizer (#12325)
|
2025-09-23 13:21:47 -07:00 |
llama
|
multi-regexp pretokenizer (#12325)
|
2025-09-23 13:21:47 -07:00 |
llama4
|
add pre:, suf: to tags (#12274)
|
2025-09-23 16:08:57 -07:00 |
mistral3
|
multi-regexp pretokenizer (#12325)
|
2025-09-23 13:21:47 -07:00 |
mllama
|
multi-regexp pretokenizer (#12325)
|
2025-09-23 13:21:47 -07:00 |
qwen2
|
multi-regexp pretokenizer (#12325)
|
2025-09-23 13:21:47 -07:00 |
qwen3
|
multi-regexp pretokenizer (#12325)
|
2025-09-23 13:21:47 -07:00 |
qwen25vl
|
multi-regexp pretokenizer (#12325)
|
2025-09-23 13:21:47 -07:00 |
models.go
|
Grace/deepseek v3 migration (#12385)
|
2025-09-24 15:19:47 -07:00 |