ollama/server
Devon Rifkin ad3c7c9bda
strip out thinking tags in message history for qwen3 & r1 (#10490)
* strip out thinking tags in message history for qwen3 & r1

This is in advance of "proper" support where we'll make reasoning
configurable and we'll parse out thinking/reasoning tags and provide
them to the caller. These models expect there to be no thinking tags in
the message history, so this should improve quality

* parse model names instead of hacky prefix check
2025-04-30 13:57:45 -07:00
..
internal fix superfluous call to WriteHeader 2025-04-25 16:58:49 -07:00
testdata/tools
auth.go
create.go explicitly decode maxarraysize 1024 2025-04-25 16:59:01 -07:00
create_test.go
download.go server: organize error types (#9465) 2025-03-28 11:50:22 -07:00
fixblobs.go
fixblobs_test.go
images.go explicitly decode maxarraysize 1024 2025-04-25 16:59:01 -07:00
images_test.go api: return model capabilities from the show endpoint (#10066) 2025-04-01 15:21:46 -07:00
layer.go
manifest.go
manifest_test.go
model.go explicitly decode maxarraysize 1024 2025-04-25 16:59:01 -07:00
model_test.go
modelpath.go server: organize error types (#9465) 2025-03-28 11:50:22 -07:00
modelpath_test.go
prompt.go gemma3: Allow multiple image in a single input 2025-03-14 15:38:54 -07:00
prompt_test.go
routes.go strip out thinking tags in message history for qwen3 & r1 (#10490) 2025-04-30 13:57:45 -07:00
routes_create_test.go
routes_delete_test.go
routes_generate_test.go Revert "increase default context length to 4096 (#10364)" 2025-04-28 16:54:11 -07:00
routes_list_test.go
routes_test.go strip out thinking tags in message history for qwen3 & r1 (#10490) 2025-04-30 13:57:45 -07:00
sched.go Fix "Stopping..." scheduler hang (#10487) 2025-04-30 11:26:52 -07:00
sched_test.go Revert "increase default context length to 4096 (#10364)" 2025-04-28 16:54:11 -07:00
sparse_common.go
sparse_windows.go
upload.go