ollama

Commit Graph

Author	SHA1	Message	Date
Parth Sareen	20b53eaa72	tests: add tool calling integration test (#12232 )	2025-09-09 14:01:11 -07:00
Daniel Hiltgen	517807cdf2	perf: build graph for next batch async to keep GPU busy (#11863 ) * perf: build graph for next batch in parallel to keep GPU busy This refactors the main run loop of the ollama runner to perform the main GPU intensive tasks (Compute+Floats) in a go routine so we can prepare the next batch in parallel to reduce the amount of time the GPU stalls waiting for the next batch of work. * tests: tune integration tests for ollama engine This tunes the integration tests to focus more on models supported by the new engine.	2025-08-29 14:20:28 -07:00
Daniel Hiltgen	ed4e139314	Integration test improvements (#9654 ) Add some new test coverage for various model architectures, and switch from orca-mini to the small llama model.	2025-04-16 14:25:55 -07:00

Author

SHA1

Message

Date

Parth Sareen

20b53eaa72

tests: add tool calling integration test (#12232 )

2025-09-09 14:01:11 -07:00

Daniel Hiltgen

517807cdf2

perf: build graph for next batch async to keep GPU busy (#11863 )

* perf: build graph for next batch in parallel to keep GPU busy

This refactors the main run loop of the ollama runner to perform the main GPU
intensive tasks (Compute+Floats) in a go routine so we can prepare the next
batch in parallel to reduce the amount of time the GPU stalls waiting for the
next batch of work.

* tests: tune integration tests for ollama engine

This tunes the integration tests to focus more on models supported
by the new engine.

2025-08-29 14:20:28 -07:00

Daniel Hiltgen

ed4e139314

Integration test improvements (#9654 )

Add some new test coverage for various model architectures,
and switch from orca-mini to the small llama model.

2025-04-16 14:25:55 -07:00

3 Commits