2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								# OpenAI compatibility
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2025-02-08 01:55:07 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								>  [!NOTE]
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								>  OpenAI compatibility is experimental and is subject to major adjustments including breaking changes. For fully-featured access to the Ollama API, see the Ollama [Python library](https://github.com/ollama/ollama-python), [JavaScript library](https://github.com/ollama/ollama-js) and [REST API](https://github.com/ollama/ollama/blob/main/docs/api.md).
 
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:25:24 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								Ollama provides experimental compatibility with parts of the [OpenAI API ](https://platform.openai.com/docs/api-reference ) to help connect existing applications to Ollama.
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								## Usage
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								### OpenAI Python library
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								```python
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								from openai import OpenAI
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								client = OpenAI(
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    base_url='http://localhost:11434/v1/',
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    # required but ignored
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    api_key='ollama',
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								)
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								chat_completion = client.chat.completions.create(
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    messages=[
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        {
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            'role': 'user',
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            'content': 'Say this is a test',
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        }
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    ],
							 
						 
					
						
							
								
									
										
										
										
											2024-09-26 02:11:22 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								    model='llama3.2',
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								)
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 06:48:44 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:05:57 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								response = client.chat.completions.create(
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    model="llava",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    messages=[
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        {
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            "role": "user",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            "content": [
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								                {"type": "text", "text": "What's in this image?"},
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								                {
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								                    "type": "image_url",
							 
						 
					
						
							
								
									
										
										
										
											2024-10-24 01:31:04 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								                    "image_url": "
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:05:57 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								                },
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            ],
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        }
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    ],
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    max_tokens=300,
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:16:23 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								)
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								completion = client.completions.create(
							 
						 
					
						
							
								
									
										
										
										
											2024-09-26 02:11:22 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								    model="llama3.2",
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:16:23 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								    prompt="Say this is a test",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								)
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:05:57 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 06:48:44 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								list_completion = client.models.list()
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 06:58:13 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-09-26 02:11:22 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								model = client.models.retrieve("llama3.2")
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 07:00:29 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								embeddings = client.embeddings.create(
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    model="all-minilm",
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:16:23 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								    input=["why is the sky blue?", "why is the grass green?"],
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 07:00:29 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								)
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								```
							 
						 
					
						
							
								
									
										
										
										
											2025-02-08 01:55:07 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-12-07 02:35:37 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								#### Structured outputs
 
							 
						 
					
						
							
								
									
										
										
										
											2025-02-08 01:55:07 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								```python
							 
						 
					
						
							
								
									
										
										
										
											2024-12-08 14:14:36 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								from pydantic import BaseModel
							 
						 
					
						
							
								
									
										
										
										
											2024-12-07 02:35:37 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								from openai import OpenAI
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								client = OpenAI(base_url="http://localhost:11434/v1", api_key="ollama")
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								# Define the schema for the response
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								class FriendInfo(BaseModel):
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    name: str
							 
						 
					
						
							
								
									
										
										
										
											2025-07-29 02:19:13 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								    age: int
							 
						 
					
						
							
								
									
										
										
										
											2024-12-07 02:35:37 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								    is_available: bool
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								class FriendList(BaseModel):
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    friends: list[FriendInfo]
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								try:
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    completion = client.beta.chat.completions.parse(
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        temperature=0,
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        model="llama3.1:8b",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        messages=[
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            {"role": "user", "content": "I have two friends. The first is Ollama 22 years old busy saving the world, and the second is Alonso 23 years old and wants to hang out. Return a list of friends in JSON format"}
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        ],
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        response_format=FriendList,
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    )
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    friends_response = completion.choices[0].message
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    if friends_response.parsed:
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        print(friends_response.parsed)
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    elif friends_response.refusal:
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        print(friends_response.refusal)
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								except Exception as e:
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    print(f"Error: {e}")
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								```
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								### OpenAI JavaScript library
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								```javascript
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								import OpenAI from 'openai'
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								const openai = new OpenAI({
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  baseURL: 'http://localhost:11434/v1/',
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  // required but ignored
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  apiKey: 'ollama',
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								})
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								const chatCompletion = await openai.chat.completions.create({
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:05:57 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								    messages: [{ role: 'user', content: 'Say this is a test' }],
							 
						 
					
						
							
								
									
										
										
										
											2024-09-26 02:11:22 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								    model: 'llama3.2',
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:05:57 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								})
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								const response = await openai.chat.completions.create({
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    model: "llava",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    messages: [
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        {
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        role: "user",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        content: [
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            { type: "text", text: "What's in this image?" },
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            {
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            type: "image_url",
							 
						 
					
						
							
								
									
										
										
										
											2024-10-24 01:31:04 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								            image_url: "
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:05:57 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								            },
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        ],
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        },
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    ],
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								})
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 06:48:44 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:16:23 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								const completion = await openai.completions.create({
							 
						 
					
						
							
								
									
										
										
										
											2024-09-26 02:11:22 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								    model: "llama3.2",
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:16:23 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								    prompt: "Say this is a test.",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								})
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 06:48:44 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								const listCompletion = await openai.models.list()
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 06:58:13 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-09-26 02:11:22 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								const model = await openai.models.retrieve("llama3.2")
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 07:00:29 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								const embedding = await openai.embeddings.create({
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  model: "all-minilm",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  input: ["why is the sky blue?", "why is the grass green?"],
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:16:23 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								})
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								```
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								### `curl`
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2025-02-08 01:55:07 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								```shell
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								curl http://localhost:11434/v1/chat/completions \
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    -H "Content-Type: application/json" \
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    -d '{
							 
						 
					
						
							
								
									
										
										
										
											2024-09-26 02:11:22 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								        "model": "llama3.2",
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								        "messages": [
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            {
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								                "role": "system",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								                "content": "You are a helpful assistant."
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            },
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            {
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								                "role": "user",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								                "content": "Hello!"
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            }
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        ]
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    }'
							 
						 
					
						
							
								
									
										
										
										
											2024-07-03 02:50:56 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:05:57 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								curl http://localhost:11434/v1/chat/completions \
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  -H "Content-Type: application/json" \
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  -d '{
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    "model": "llava",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    "messages": [
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								      {
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        "role": "user",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        "content": [
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								          {
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            "type": "text",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            "text": "What'\''s in this image?"
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								          },
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								          {
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            "type": "image_url",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            "image_url": {
							 
						 
					
						
							
								
									
										
										
										
											2024-10-24 01:31:04 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								               "url": "
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:05:57 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								            }
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								          }
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        ]
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								      }
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    ],
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    "max_tokens": 300
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  }'
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:16:23 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								curl http://localhost:11434/v1/completions \
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    -H "Content-Type: application/json" \
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    -d '{
							 
						 
					
						
							
								
									
										
										
										
											2024-09-26 02:11:22 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								        "model": "llama3.2",
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:16:23 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								        "prompt": "Say this is a test"
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    }'
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 06:48:44 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								curl http://localhost:11434/v1/models
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 06:58:13 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-09-26 02:11:22 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								curl http://localhost:11434/v1/models/llama3.2
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 07:00:29 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								curl http://localhost:11434/v1/embeddings \
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    -H "Content-Type: application/json" \
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    -d '{
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        "model": "all-minilm",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        "input": ["why is the sky blue?", "why is the grass green?"]
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    }'
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								```
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								## Endpoints
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								### `/v1/chat/completions`
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								#### Supported features
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  Chat completions
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  Streaming
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  JSON mode
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  Reproducible outputs
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:05:57 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								- [x]  Vision
							 
						 
					
						
							
								
									
										
										
										
											2024-12-08 14:16:21 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								- [x]  Tools
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								- [ ]  Logprobs
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								#### Supported request fields
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `model` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `messages` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  - [x]  Text `content` 
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:05:57 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								  - [x]  Image `content` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    - [x]  Base64 encoded image
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    - [ ]  Image URL
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  - [x]  Array of `content`  parts
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `frequency_penalty` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `presence_penalty` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `response_format` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `seed` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `stop` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `stream` 
							 
						 
					
						
							
								
									
										
										
										
											2024-12-13 09:09:30 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								- [x]  `stream_options` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  - [x]  `include_usage` 
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `temperature` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `top_p` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `max_tokens` 
							 
						 
					
						
							
								
									
										
										
										
											2024-07-26 06:34:06 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								- [x]  `tools` 
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								- [ ]  `tool_choice` 
							 
						 
					
						
							
								
									
										
										
										
											2024-07-26 06:34:06 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								- [ ]  `logit_bias` 
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								- [ ]  `user` 
							 
						 
					
						
							
								
									
										
										
										
											2024-02-09 04:03:23 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								- [ ]  `n` 
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:16:23 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								### `/v1/completions`
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								#### Supported features
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  Completions
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  Streaming
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  JSON mode
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  Reproducible outputs
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [ ]  Logprobs
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								#### Supported request fields
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `model` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `prompt` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `frequency_penalty` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `presence_penalty` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `seed` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `stop` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `stream` 
							 
						 
					
						
							
								
									
										
										
										
											2024-12-13 09:09:30 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								- [x]  `stream_options` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  - [x]  `include_usage` 
							 
						 
					
						
							
								
									
										
										
										
											2024-08-03 04:16:23 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								- [x]  `temperature` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `top_p` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `max_tokens` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `suffix` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [ ]  `best_of` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [ ]  `echo` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [ ]  `logit_bias` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [ ]  `user` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [ ]  `n` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								#### Notes
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								-  `prompt`  currently only accepts a string
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 06:48:44 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								### `/v1/models`
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								#### Notes
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								-  `created`  corresponds to when the model was last modified
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								-  `owned_by`  corresponds to the ollama username, defaulting to `"library"` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 06:58:13 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								### `/v1/models/{model}`
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								#### Notes
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								-  `created`  corresponds to when the model was last modified
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								-  `owned_by`  corresponds to the ollama username, defaulting to `"library"` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-08-02 07:00:29 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								### `/v1/embeddings`
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								#### Supported request fields
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `model` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [x]  `input` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  - [x]  string
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  - [x]  array of strings
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  - [ ]  array of tokens
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								  - [ ]  array of token arrays
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [ ]  `encoding format` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [ ]  `dimensions` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								- [ ]  `user` 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								## Models
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								Before using a model, pull it locally `ollama pull` :
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								```shell
							 
						 
					
						
							
								
									
										
										
										
											2024-09-26 02:11:22 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								ollama pull llama3.2
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								```
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								### Default model names
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								For tooling that relies on default OpenAI model names such as `gpt-3.5-turbo` , use `ollama cp`  to copy an existing model name to a temporary name:
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2025-02-08 01:55:07 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								```shell
							 
						 
					
						
							
								
									
										
										
										
											2024-09-26 02:11:22 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								ollama cp llama3.2 gpt-3.5-turbo
							 
						 
					
						
							
								
									
										
										
										
											2024-02-08 06:24:29 +08:00 
										
									 
								 
							 
							
								
							 
							
								 
							 
							
							
								```
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								Afterwards, this new model name can be specified the `model`  field:
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								```shell
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								curl http://localhost:11434/v1/chat/completions \
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    -H "Content-Type: application/json" \
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    -d '{
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        "model": "gpt-3.5-turbo",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        "messages": [
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            {
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								                "role": "user",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								                "content": "Hello!"
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            }
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        ]
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    }'
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								```
							 
						 
					
						
							
								
									
										
										
										
											2024-08-29 08:11:46 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								### Setting the context size
 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								The OpenAI API does not have a way of setting the context size for a model. If you need to change the context size, create a `Modelfile`  which looks like:
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
									
										
										
										
											2025-02-08 01:55:07 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								```
							 
						 
					
						
							
								
									
										
										
										
											2024-08-29 08:11:46 +08:00 
										
									 
								 
							 
							
								
									
										 
									 
								
							 
							
								 
							 
							
							
								FROM < some  model > 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								PARAMETER num_ctx < context  size > 
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								```
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								Use the `ollama create mymodel`  command to create a new model with the updated context size. Call the API with the updated model name:
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								```shell
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								curl http://localhost:11434/v1/chat/completions \
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    -H "Content-Type: application/json" \
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    -d '{
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        "model": "mymodel",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        "messages": [
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            {
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								                "role": "user",
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								                "content": "Hello!"
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								            }
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								        ]
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								    }'
							 
						 
					
						
							
								
							 
							
								
							 
							
								 
							 
							
							
								```