ollama/envconfig
Jesse Gross 8111c35be8 llm: New memory management
This changes the memory allocation strategy from upfront estimation to
tracking actual allocations done by the engine and reacting to that. The
goal is avoid issues caused by both under-estimation (crashing) and
over-estimation (low performance due to under-utilized GPUs).

It is currently opt-in and can be enabled for models running on the
Ollama engine by setting OLLAMA_NEW_ESTIMATES=1. Behavior in other
cases is unchanged and will continue to use the existing estimates.
2025-08-20 16:56:54 +02:00
..
config.go llm: New memory management 2025-08-20 16:56:54 +02:00
config_test.go feat: add trace log level (#10650) 2025-05-12 11:43:00 -07:00