ollama/server
Daniel Hiltgen 15e3611d3d
logs: quiet down context canceled on completion and scheduler noise (#12553)
* logs: quiet down context canceled on completion

If the client closes the connection before Completion finishes, we were
logging at error level implying the runner crashed which was misleading.

time=2025-10-08T22:59:20.566-07:00 level=ERROR source=server.go:1490 msg="post predict" error="Post \"http://127.0.0.1:57736/completion\": context canceled"

* quiet down scheduler log error on expected case

Since we don't hold the lock while performing memory load calculations, other
runners can unload in parallel, so finding no runner to unload is a valid scenario
which we shouldn't log at error level.
2025-10-09 10:37:47 -07:00
..
internal refactor: use the built-in max/min to simplify the code (#12280) 2025-09-16 17:14:21 -07:00
auth.go fix nil deref in auth.go 2024-07-26 14:14:48 -07:00
create.go engine: add remote proxy (#12307) 2025-09-17 14:40:53 -07:00
create_test.go engine: add remote proxy (#12307) 2025-09-17 14:40:53 -07:00
download.go server: abort download on empty digest 2025-05-27 11:28:48 -07:00
fixblobs.go server: replace blob prefix separator from ':' to '-' (#3146) 2024-03-14 20:18:06 -07:00
fixblobs_test.go server: replace blob prefix separator from ':' to '-' (#3146) 2024-03-14 20:18:06 -07:00
images.go templates: fix crash in improperly defined templates (#12483) 2025-10-02 17:25:55 -07:00
images_test.go Reapply "feat: incremental gguf parser (#10822)" (#11114) (#11119) 2025-06-20 11:11:40 -07:00
layer.go One corrupt manifest should not wedge model operations (#7515) 2024-11-05 14:21:45 -08:00
manifest.go One corrupt manifest should not wedge model operations (#7515) 2024-11-05 14:21:45 -08:00
manifest_test.go One corrupt manifest should not wedge model operations (#7515) 2024-11-05 14:21:45 -08:00
model.go tools: refactor tool call parsing and enable streaming (#10415) 2025-05-23 14:19:31 -07:00
modelpath.go server: add hint to the error message when model path access fails (#10843) 2025-05-24 13:17:04 -07:00
modelpath_test.go lint: enable usetesting, disable tenv (#10594) 2025-05-08 11:42:14 -07:00
prompt.go Revert "add truncate and shift parameters (#12519)" (#12545) 2025-10-08 17:57:57 -07:00
prompt_test.go Revert "add truncate and shift parameters (#12519)" (#12545) 2025-10-08 17:57:57 -07:00
quantization.go skip quantizing per_layer_token_embd (#11207) 2025-06-26 21:49:35 -07:00
quantization_test.go Reapply "feat: incremental gguf parser (#10822)" (#11114) (#11119) 2025-06-20 11:11:40 -07:00
routes.go routes: structured outputs for gpt-oss (#12460) 2025-10-08 19:13:38 -07:00
routes_create_test.go engine: add remote proxy (#12307) 2025-09-17 14:40:53 -07:00
routes_debug_test.go Use runners for GPU discovery (#12090) 2025-10-01 15:12:32 -07:00
routes_delete_test.go Update the /api/create endpoint to use JSON (#7935) 2024-12-31 18:02:30 -08:00
routes_generate_test.go routes: structured outputs for gpt-oss (#12460) 2025-10-08 19:13:38 -07:00
routes_harmony_streaming_test.go Use runners for GPU discovery (#12090) 2025-10-01 15:12:32 -07:00
routes_list_test.go Update the /api/create endpoint to use JSON (#7935) 2024-12-31 18:02:30 -08:00
routes_test.go engine: add remote proxy (#12307) 2025-09-17 14:40:53 -07:00
sched.go logs: quiet down context canceled on completion and scheduler noise (#12553) 2025-10-09 10:37:47 -07:00
sched_test.go Use runners for GPU discovery (#12090) 2025-10-01 15:12:32 -07:00
sparse_common.go Don't hard fail on sparse setup error 2024-08-09 12:16:19 -07:00
sparse_windows.go Don't hard fail on sparse setup error 2024-08-09 12:16:19 -07:00
upload.go server: always print upload/download part info (#8832) 2025-02-04 19:30:49 -08:00