ollama

Commit Graph

Author	SHA1	Message	Date
Daniel Hiltgen	17a023f34b	Add v12 + v13 cuda support (#12000 ) * Add support for upcoming NVIDIA Jetsons The latest Jetsons with JetPack 7 are moving to an SBSA compatible model and will not require building a JetPack specific variant. * cuda: bring back dual versions This adds back dual CUDA versions for our releases, with v11 and v13 to cover a broad set of GPUs and driver versions. * win: break up native builds in build_windows.ps1 * v11 build working on windows and linux * switch to cuda v12.8 not JIT * Set CUDA compression to size * enhance manual install linux docs	2025-09-10 12:05:18 -07:00
Daniel Hiltgen	7ccfd97a93	doc: clarify both rocm and main bundle necessary (#11900 ) Some users expect the rocm bundles to be self-sufficient, but are designed to be additive.	2025-08-14 12:54:55 -07:00
ycomiti	4151ef8cf7	Update linux.md (#11462 )	2025-07-22 11:17:31 -07:00
Krzysztof Jeziorny	fc0309615e	docs: update link to AMD drivers in linux.md (#10973 )	2025-06-06 23:30:04 -04:00
‮rekcäH nitraM‮	25248f4bd5	Better WantedBy declaration The problem with default.target is that it always points to the target that is currently started. So if you boot into single user mode or the rescue mode still Ollama tries to start. I noticed this because either tried (and failed) to start all the time during a system update, where Ollama definitely is not wanted.	2025-03-07 10:26:31 +01:00
Azis Alvriyanto	b901a712c6	docs: improve syntax highlighting in code blocks (#8854 )	2025-02-07 09:55:07 -08:00
Abhinav Pant	7814019708	docs: add step for removing libraries in linux.md (#8897 )	2025-02-06 14:54:58 -08:00
Melroy van den Berg	bfdeffc375	docs: use OLLAMA_VERSION=0.5.7 for install version override (#8802 )	2025-02-03 13:54:08 -08:00
Daniel Hiltgen	4879a234c4	build: Make target improvements (#7499 ) * llama: wire up builtin runner This adds a new entrypoint into the ollama CLI to run the cgo built runner. On Mac arm64, this will have GPU support, but on all other platforms it will be the lowest common denominator CPU build. After we fully transition to the new Go runners more tech-debt can be removed and we can stop building the "default" runner via make and rely on the builtin always. * build: Make target improvements Add a few new targets and help for building locally. This also adjusts the runner lookup to favor local builds, then runners relative to the executable, and finally payloads. * Support customized CPU flags for runners This implements a simplified custom CPU flags pattern for the runners. When built without overrides, the runner name contains the vector flag we check for (AVX) to ensure we don't try to run on unsupported systems and crash. If the user builds a customized set, we omit the naming scheme and don't check for compatibility. This avoids checking requirements at runtime, so that logic has been removed as well. This can be used to build GPU runners with no vector flags, or CPU/GPU runners with additional flags (e.g. AVX512) enabled. * Use relative paths If the user checks out the repo in a path that contains spaces, make gets really confused so use relative paths for everything in-repo to avoid breakage. * Remove payloads from main binary * install: clean up prior libraries This removes support for v0.3.6 and older versions (before the tar bundle) and ensures we clean up prior libraries before extracting the bundle(s). Without this change, runners and dependent libraries could leak when we update and lead to subtle runtime errors.	2024-12-10 09:47:19 -08:00
Jeffrey Morgan	b42a596425	docs: add customization section in linux.md (#7709 )	2024-11-17 11:48:12 -08:00
Jeffrey Morgan	108fb6c1d1	docs: improve linux install documentation (#6683 ) Includes small improvements to document layout and code blocks	2024-09-06 22:05:37 -07:00
Tomoya Fujita	133770a548	docs: add group to manual Linux isntructions and verify service is running (#6430 )	2024-09-04 14:45:09 -04:00
Daniel Hiltgen	4e1c4f6e0b	Update manual instructions with discrete ROCm bundle (#6445 )	2024-08-27 13:42:28 -07:00
Daniel Hiltgen	f9e31da946	Review comments	2024-08-19 10:36:15 -07:00
Daniel Hiltgen	88bb9e3328	Adjust layout to bin+lib/ollama	2024-08-19 09:38:53 -07:00
Napuh	896495de7b	Add instructions to easily install specific versions on faq.md (#4084 ) * Added instructions to easily install specific versions on faq.md * Small typo * Moved instructions on how to install specific version to linux.md * Update docs/linux.md * Update docs/linux.md --------- Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>	2024-06-09 10:49:03 -07:00
Mohamed A. Fouad	ee02f548c8	Update linux.md (#3847 ) Add -e to viewing logs in order to show end of ollama logs	2024-05-06 15:02:25 -07:00
Daniel Hiltgen	4a5c9b8035	Finish unwinding idempotent payload logic The recent ROCm change partially removed idempotent payloads, but the ggml-metal.metal file for mac was still idempotent. This finishes switching to always extract the payloads, and now that idempotentcy is gone, the version directory is no longer useful.	2024-03-09 08:34:39 -08:00
Daniel Hiltgen	6c5ccb11f9	Revamp ROCm support This refines where we extract the LLM libraries to by adding a new OLLAMA_HOME env var, that defaults to `~/.ollama` The logic was already idempotenent, so this should speed up startups after the first time a new release is deployed. It also cleans up after itself. We now build only a single ROCm version (latest major) on both windows and linux. Given the large size of ROCms tensor files, we split the dependency out. It's bundled into the installer on windows, and a separate download on windows. The linux install script is now smart and detects the presence of AMD GPUs and looks to see if rocm v6 is already present, and if not, then downloads our dependency tar file. For Linux discovery, we now use sysfs and check each GPU against what ROCm supports so we can degrade to CPU gracefully instead of having llama.cpp+rocm assert/crash on us. For Windows, we now use go's windows dynamic library loading logic to access the amdhip64.dll APIs to query the GPU information.	2024-03-07 10:36:50 -08:00
Jeffrey Morgan	1c8435ffa9	Update domain name references in docs and install script (#2435 )	2024-02-09 15:19:30 -08:00
Tristram Oaten	40a0a90a88	Add group delete to uninstall instructions (#1924 ) After executing the `userdel ollama` command, I saw this message: ```sh $ sudo userdel ollama userdel: group ollama not removed because it has other members. ``` Which reminded me that I had to remove the dangling group too. For completeness, the uninstall instructions should do this too. Thanks!	2024-01-12 00:07:00 -05:00
Michael Yang	92119de9d8	update linux.md	2023-10-25 14:57:50 -07:00
Bruce MacDonald	cecf83141e	Linux uninstall instructions (#894 )	2023-10-24 14:07:05 -04:00
Jeffrey Morgan	f9b2f999ac	update readme with `docker` setup and link to `import.md`	2023-10-15 02:23:03 -04:00
Jiayu Liu	4fc10acce9	add some missing code directives in docs (#664 )	2023-10-01 11:51:01 -07:00
Jeffrey Morgan	5306b0269d	Update linux.md	2023-09-25 16:10:32 -07:00
Jeffrey Morgan	0fb5268496	Update linux.md	2023-09-25 10:06:23 -07:00
Jeffrey Morgan	ee3032ad89	improvements to `docs/linux.md`	2023-09-24 21:50:07 -07:00
Jeffrey Morgan	5b7a27281d	improvements to `docs/linux.md`	2023-09-24 21:38:23 -07:00
Jeffrey Morgan	d2a784e33e	add `docs/linux.md`	2023-09-24 21:34:44 -07:00

30 Commits