fedf71635e 
								
							 
						 
						
							
							
								
								Extend api/show and ollama show to return more model info ( #4881 )  
							
							... 
							
							
							
							* API Show Extended
* Initial Draft of Information
Co-Authored-By: Patrick Devine <pdevine@sonic.net>
* Clean Up
* Descriptive arg error messages and other fixes
* Second Draft of Show with Projectors Included
* Remove Chat Template
* Touches
* Prevent wrapping from files
* Verbose functionality
* Docs
* Address Feedback
* Lint
* Resolve Conflicts
* Function Name
* Tests for api/show model info
* Show Test File
* Add Projector Test
* Clean routes
* Projector Check
* Move Show Test
* Touches
* Doc update
---------
Co-authored-by: Patrick Devine <pdevine@sonic.net> 
							
						 
						
							2024-06-19 14:19:02 -07:00  
				
					
						
							
							
								 
						
							
								171796791f 
								
							 
						 
						
							
							
								
								Adjust mmap logic for cuda windows for faster model load  
							
							... 
							
							
							
							On Windows, recent llama.cpp changes make mmap slower in most
cases, so default to off.  This also implements a tri-state for
use_mmap so we can detect the difference between a user provided
value of true/false, or unspecified. 
							
						 
						
							2024-06-17 16:54:30 -07:00  
				
					
						
							
							
								 
						
							
								89c79bec8c 
								
							 
						 
						
							
							
								
								Add ModifiedAt Field to /api/show ( #5033 )  
							
							... 
							
							
							
							* Add Mod Time to Show
* Error Handling 
							
						 
						
							2024-06-15 20:53:56 -07:00  
				
					
						
							
							
								 
						
							
								c69bc19e46 
								
							 
						 
						
							
							
								
								move OLLAMA_HOST to envconfig ( #5009 )  
							
							
							
						 
						
							2024-06-12 18:48:16 -04:00  
				
					
						
							
							
								 
						
							
								4bf1da4944 
								
							 
						 
						
							
							
								
								Separate ListResponse and ModelResponse for api/tags vs api/ps ( #4842 )  
							
							... 
							
							
							
							* Remove false time fields
* Struct Separation for List and Process
* Remove Marshaler 
							
						 
						
							2024-06-06 10:11:45 -07:00  
				
					
						
							
							
								 
						
							
								c895a7d13f 
								
							 
						 
						
							
							
								
								some gocritic  
							
							
							
						 
						
							2024-06-04 11:13:30 -07:00  
				
					
						
							
							
								 
						
							
								6845988807 
								
							 
						 
						
							
							
								
								Ollama `ps` command for showing currently loaded models ( #4327 )  
							
							
							
						 
						
							2024-05-13 17:17:36 -07:00  
				
					
						
							
							
								 
						
							
								6602e793c0 
								
							 
						 
						
							
							
								
								Use `--quantize` flag and `quantize` api parameter ( #4321 )  
							
							... 
							
							
							
							* rename `--quantization` to `--quantize`
* backwards
* Update api/types.go
Co-authored-by: Michael Yang <mxyng@pm.me>
---------
Co-authored-by: Michael Yang <mxyng@pm.me> 
							
						 
						
							2024-05-10 13:06:13 -07:00  
				
					
						
							
							
								 
						
							
								c02db93243 
								
							 
						 
						
							
							
								
								omit empty done reason  
							
							
							
						 
						
							2024-05-09 16:45:29 -07:00  
				
					
						
							
							
								 
						
							
								cfa84b8470 
								
							 
						 
						
							
							
								
								add done_reason to the api ( #4235 )  
							
							
							
						 
						
							2024-05-09 13:30:14 -07:00  
				
					
						
							
							
								 
						
							
								d5eec16d23 
								
							 
						 
						
							
							
								
								use model defaults for `num_gqa`, `rope_frequency_base ` and `rope_frequency_scale` ( #1983 )  
							
							
							
						 
						
							2024-05-09 09:06:13 -07:00  
				
					
						
							
							
								 
						
							
								d77c1c5f9d 
								
							 
						 
						
							
							
								
								api: fill up API documentation ( #3596 )  
							
							... 
							
							
							
							* api: fill up API documentation
Followup for #2878 
Now that the documentation is more complete, mention it in the README.
Updates #2840 
* fix typo/lint
* Update README.md
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com>
---------
Co-authored-by: Jeffrey Morgan <jmorganca@gmail.com> 
							
						 
						
							2024-05-07 16:27:46 -07:00  
				
					
						
							
							
								 
						
							
								af47413dba 
								
							 
						 
						
							
							
								
								Add MarshalJSON to Duration ( #3284 )  
							
							... 
							
							
							
							---------
Co-authored-by: Patrick Devine <patrick@infrahq.com> 
							
						 
						
							2024-05-06 15:59:18 -07:00  
				
					
						
							
							
								 
						
							
								9009bedf13 
								
							 
						 
						
							
							
								
								better checking for OLLAMA_HOST variable ( #3661 )  
							
							
							
						 
						
							2024-04-29 19:14:07 -04:00  
				
					
						
							
							
								 
						
							
								993cf8bf55 
								
							 
						 
						
							
							
								
								llm: limit generation to 10x context size to avoid run on generations ( #3918 )  
							
							... 
							
							
							
							* llm: limit generation to 10x context size to avoid run on generations
* add comment
* simplify condition statement 
							
						 
						
							2024-04-25 19:02:30 -04:00  
				
					
						
							
							
								 
						
							
								62be2050dd 
								
							 
						 
						
							
							
								
								chore: use errors.New to replace fmt.Errorf will much better ( #3789 )  
							
							
							
						 
						
							2024-04-20 22:11:06 -04:00  
				
					
						
							
							
								 
						
							
								ad90b9ab3d 
								
							 
						 
						
							
							
								
								api: start adding documentation to package api ( #2878 )  
							
							... 
							
							
							
							* api: start adding documentation to package api
Updates #2840 
* Fix lint typo report 
							
						 
						
							2024-04-10 13:31:55 -04:00  
				
					
						
							
							
								 
						
							
								01114b4526 
								
							 
						 
						
							
							
								
								fix: rope  
							
							
							
						 
						
							2024-04-09 16:15:24 -07:00  
				
					
						
							
							
								 
						
							
								9502e5661f 
								
							 
						 
						
							
							
								
								cgo quantize  
							
							
							
						 
						
							2024-04-08 15:31:08 -07:00  
				
					
						
							
							
								 
						
							
								be517e491c 
								
							 
						 
						
							
							
								
								no rope parameters  
							
							
							
						 
						
							2024-04-05 18:05:27 -07:00  
				
					
						
							
							
								 
						
							
								3b4bab3dc5 
								
							 
						 
						
							
							
								
								Fix embeddings load model behavior ( #2848 )  
							
							
							
						 
						
							2024-02-29 17:40:56 -08:00  
				
					
						
							
							
								 
						
							
								e95b896790 
								
							 
						 
						
							
							
								
								Update types.go ( #2744 )  
							
							... 
							
							
							
							specfied -> specified 
							
						 
						
							2024-02-25 13:41:25 -05:00  
				
					
						
							
							
								 
						
							
								caf2b13c10 
								
							 
						 
						
							
							
								
								Fix infinite keep_alive ( #2480 )  
							
							
							
						 
						
							2024-02-13 15:40:32 -08:00  
				
					
						
							
							
								 
						
							
								b5cf31b460 
								
							 
						 
						
							
							
								
								add keep_alive to generate/chat/embedding api endpoints ( #2146 )  
							
							
							
						 
						
							2024-01-26 14:28:02 -08:00  
				
					
						
							
							
								 
						
							
								7c40a67841 
								
							 
						 
						
							
							
								
								Save and load sessions ( #2063 )  
							
							
							
						 
						
							2024-01-25 12:12:36 -08:00  
				
					
						
							
							
								 
						
							
								745b5934fa 
								
							 
						 
						
							
							
								
								add model to ModelResponse  
							
							
							
						 
						
							2024-01-18 14:32:55 -08:00  
				
					
						
							
							
								 
						
							
								a38d88d828 
								
							 
						 
						
							
							
								
								api: add model for all requests  
							
							... 
							
							
							
							prefer using req.Model and fallback to req.Name 
							
						 
						
							2024-01-18 14:31:37 -08:00  
				
					
						
							
							
								 
						
							
								22e93efa41 
								
							 
						 
						
							
							
								
								add show info command and fix the modelfile  
							
							
							
						 
						
							2024-01-05 12:20:05 -08:00  
				
					
						
							
							
								 
						
							
								55978c1dc9 
								
							 
						 
						
							
							
								
								clean up cache api option  
							
							
							
						 
						
							2023-12-27 14:27:45 -05:00  
				
					
						
							
							
								 
						
							
								d4ebdadbe7 
								
							 
						 
						
							
							
								
								enable `cache_prompt` by default  
							
							
							
						 
						
							2023-12-27 14:23:42 -05:00  
				
					
						
							
							
								 
						
							
								10da41d677 
								
							 
						 
						
							
							
								
								Add Cache flag to api ( #1642 )  
							
							
							
						 
						
							2023-12-22 17:16:20 -05:00  
				
					
						
							
							
								 
						
							
								d99fa6ce0a 
								
							 
						 
						
							
							
								
								send empty messages on last chat response ( #1530 )  
							
							
							
						 
						
							2023-12-18 14:23:38 -05:00  
				
					
						
							
							
								 
						
							
								d9e60f634b 
								
							 
						 
						
							
							
								
								add image support to the chat api ( #1490 )  
							
							
							
						 
						
							2023-12-12 13:28:58 -08:00  
				
					
						
							
							
								 
						
							
								910e9401d0 
								
							 
						 
						
							
							
								
								Multimodal support ( #1216 )  
							
							... 
							
							
							
							---------
Co-authored-by: Matt Apperson <mattapperson@Matts-MacBook-Pro.local> 
							
						 
						
							2023-12-11 13:56:22 -08:00  
				
					
						
							
							
								 
						
							
								9e1406e4ed 
								
							 
						 
						
							
							
								
								Don't expose model information in `/api/generate`  
							
							
							
						 
						
							2023-12-09 02:05:43 -08:00  
				
					
						
							
							
								 
						
							
								5d75505ebd 
								
							 
						 
						
							
							
								
								return model configuration in generate  
							
							
							
						 
						
							2023-12-05 14:39:02 -08:00  
				
					
						
							
							
								 
						
							
								195e3d9dbd 
								
							 
						 
						
							
							
								
								chat api endpoint ( #1392 )  
							
							
							
						 
						
							2023-12-05 14:57:33 -05:00  
				
					
						
							
							
								 
						
							
								00d06619a1 
								
							 
						 
						
							
							
								
								Revert "chat api ( #991 )" while context variable is fixed  
							
							... 
							
							
							
							This reverts commit 7a0899d62d 
							
						 
						
							2023-12-04 21:16:27 -08:00  
				
					
						
							
							
								 
						
							
								7a0899d62d 
								
							 
						 
						
							
							
								
								chat api ( #991 )  
							
							... 
							
							
							
							- update chat docs
- add messages chat endpoint
- remove deprecated context and template generate parameters from docs
- context and template are still supported for the time being and will continue to work as expected
- add partial response to chat history 
							
						 
						
							2023-12-04 18:01:06 -05:00  
				
					
						
							
							
								 
						
							
								cde31cb220 
								
							 
						 
						
							
							
								
								Allow setting parameters in the REPL ( #1294 )  
							
							
							
						 
						
							2023-11-29 09:56:42 -08:00  
				
					
						
							
							
								 
						
							
								bc22d5a38b 
								
							 
						 
						
							
							
								
								no blob response  
							
							
							
						 
						
							2023-11-15 15:16:23 -08:00  
				
					
						
							
							
								 
						
							
								1552cee59f 
								
							 
						 
						
							
							
								
								client create modelfile  
							
							
							
						 
						
							2023-11-15 15:16:23 -08:00  
				
					
						
							
							
								 
						
							
								3ca56b5ada 
								
							 
						 
						
							
							
								
								add create modelfile field  
							
							
							
						 
						
							2023-11-15 15:16:23 -08:00  
				
					
						
							
							
								 
						
							
								5cba29b9d6 
								
							 
						 
						
							
							
								
								JSON mode: add `"format" as an api parameter ( #1051 )  
							
							... 
							
							
							
							* add `"format": "json"` as an API parameter
---------
Co-authored-by: Bruce MacDonald <brucewmacdonald@gmail.com> 
							
						 
						
							2023-11-09 16:44:02 -08:00  
				
					
						
							
							
								 
						
							
								a49d6acc1e 
								
							 
						 
						
							
							
								
								add a complete /generate options example ( #1035 )  
							
							
							
						 
						
							2023-11-08 16:44:36 -08:00  
				
					
						
							
							
								 
						
							
								ec2a31e9b3 
								
							 
						 
						
							
							
								
								support raw generation requests ( #952 )  
							
							... 
							
							
							
							- add the optional `raw` generate request parameter to bypass prompt formatting and response context
-add raw request to docs 
							
						 
						
							2023-11-08 14:05:02 -08:00  
				
					
						
							
							
								 
						
							
								17678b7225 
								
							 
						 
						
							
							
								
								Restore system prompt on requests and default `num_keep` to `0`  
							
							
							
						 
						
							2023-11-03 13:25:25 -07:00  
				
					
						
							
							
								 
						
							
								06589a3b30 
								
							 
						 
						
							
							
								
								Set `NumKeep` to `4` by default ( #982 )  
							
							
							
						 
						
							2023-11-02 17:26:11 -07:00  
				
					
						
							
							
								 
						
							
								6db3691b8f 
								
							 
						 
						
							
							
								
								update default NumKeep  
							
							
							
						 
						
							2023-11-02 15:47:35 -07:00  
				
					
						
							
							
								 
						
							
								fe6f3b48f7 
								
							 
						 
						
							
							
								
								do not reload the running llm when runtime params change ( #840 )  
							
							... 
							
							
							
							- only reload the running llm if the model has changed, or the options for loading the running model have changed
- rename loaded llm to runner to differentiate from loaded model image
- remove logic which keeps the first system prompt in the generation context 
							
						 
						
							2023-10-19 10:39:58 -04:00