ollama/README.md

<div align="center">
  <picture>
    <source media="(prefers-color-scheme: dark)" height="200px" srcset="https://github.com/jmorganca/ollama/assets/3325447/56ea1849-1284-4645-8970-956de6e51c3c">
    <img alt="logo" height="200px" src="https://github.com/jmorganca/ollama/assets/3325447/0d0b44e2-8f4a-4e99-9b52-a5c1c741c8f7">
  </picture>
</div>

# Ollama

[![Discord](https://dcbadge.vercel.app/api/server/ollama?style=flat&compact=true)](https://discord.gg/ollama)

Create, run, and share large language models (LLMs). Ollama bundles a model’s weights, configuration, prompts, and more into self-contained packages that can run on any machine.

> Note: Ollama is in early preview. Please report any issues you find.

## Download

- [Download](https://ollama.ai/download) for macOS on Apple Silicon (Intel coming soon)
- Download for Windows and Linux (coming soon)
- Build [from source](#building)

## Quickstart

To run and chat with [Llama 2](https://ai.meta.com/llama), the new model by Meta:

```
ollama run llama2
```

## Model library

Ollama includes a library of open-source, pre-trained models. More models are coming soon. You should have at least 8 GB of RAM to run the 3B models, 16 GB to run the 7B models, and 32 GB to run the 13B models.

| Model                    | Parameters | Size  | Download                    |
| ------------------------ | ---------- | ----- | --------------------------- |
| Llama2                   | 7B         | 3.8GB | `ollama pull llama2`        |
| Llama2 13B               | 13B        | 7.3GB | `ollama pull llama2:13b`    |
| Orca Mini                | 3B         | 1.9GB | `ollama pull orca`          |
| Vicuna                   | 7B         | 3.8GB | `ollama pull vicuna`        |
| Nous-Hermes              | 13B        | 7.3GB | `ollama pull nous-hermes`   |
| Wizard Vicuna Uncensored | 13B        | 7.3GB | `ollama pull wizard-vicuna` |

## Examples

### Run a model

```
ollama run llama2
>>> hi
Hello! How can I help you today?
```

### Create a custom character model

Pull a base model:

```
ollama pull llama2
```

Create a `Modelfile`:

```
FROM llama2

# set the temperature to 1 [higher is more creative, lower is more coherent]
PARAMETER temperature 1

# set the system prompt
SYSTEM """
You are Mario from Super Mario Bros. Answer as Mario, the assistant, only.
"""
```

Next, create and run the model:

```
ollama create mario -f ./Modelfile
ollama run mario
>>> hi
Hello! It's your friend Mario.
```

For more examples, see the [examples](./examples) directory.

### Pull a model from the registry

```
ollama pull orca
```

## Building

```
go build .
```

To run it start the server:

```
./ollama serve &
```

Finally, run a model!

```
./ollama run llama2
```
-												Update README.md

add logo
											
										
										
											2023-07-19 03:45:38 +08:00
+								<div align="center">
 								  <picture>
-												Update icon (#139)


											
										
										
											2023-07-20 23:55:20 +08:00
+								    <source media="(prefers-color-scheme: dark)" height="200px" srcset="https://github.com/jmorganca/ollama/assets/3325447/56ea1849-1284-4645-8970-956de6e51c3c">
 								    <img alt="logo" height="200px" src="https://github.com/jmorganca/ollama/assets/3325447/0d0b44e2-8f4a-4e99-9b52-a5c1c741c8f7">
-												Update README.md

add logo
											
										
										
											2023-07-19 03:45:38 +08:00
+								  </picture>
 								</div>
-												updated readme

											
										
										
											2023-07-06 03:37:33 +08:00
-												move to contained directory

											
										
										
											2023-06-28 00:08:52 +08:00
+								# Ollama
-												initial commit

											
										
										
											2023-06-23 00:45:31 +08:00
-												fix discord link in `README.md`

											
										
										
											2023-07-20 03:31:48 +08:00
+								[![Discord](https://dcbadge.vercel.app/api/server/ollama?style=flat&compact=true)](https://discord.gg/ollama)
-												add discord link, remove repeated text

											
										
										
											2023-07-20 03:28:50 +08:00
 								Create, run, and share large language models (LLMs). Ollama bundles a model’s weights, configuration, prompts, and more into self-contained packages that can run on any machine.
-												updated readme

											
										
										
											2023-07-06 03:37:33 +08:00
-												update `README.md` with new syntax

											
										
										
											2023-07-19 04:22:33 +08:00
+								> Note: Ollama is in early preview. Please report any issues you find.
-												updated readme

											
										
										
											2023-07-06 03:37:33 +08:00
-												move download to the top of `README.md`

											
										
										
											2023-07-19 04:31:25 +08:00
+								## Download
 								- [Download](https://ollama.ai/download) for macOS on Apple Silicon (Intel coming soon)
 								- Download for Windows and Linux (coming soon)
 								- Build [from source](#building)
-												add discord link, remove repeated text

											
										
										
											2023-07-20 03:28:50 +08:00
+								## Quickstart
 								To run and chat with [Llama 2](https://ai.meta.com/llama), the new model by Meta:
 								```
 								ollama run llama2
 								```
 								## Model library
 								Ollama includes a library of open-source, pre-trained models. More models are coming soon. You should have at least 8 GB of RAM to run the 3B models, 16 GB to run the 7B models, and 32 GB to run the 13B models.
 								| Model                    | Parameters | Size  | Download                    |
 								| ------------------------ | ---------- | ----- | --------------------------- |
 								| Llama2                   | 7B         | 3.8GB | `ollama pull llama2`        |
 								| Llama2 13B               | 13B        | 7.3GB | `ollama pull llama2:13b`    |
 								| Orca Mini                | 3B         | 1.9GB | `ollama pull orca`          |
 								| Vicuna                   | 7B         | 3.8GB | `ollama pull vicuna`        |
 								| Nous-Hermes              | 13B        | 7.3GB | `ollama pull nous-hermes`   |
 								| Wizard Vicuna Uncensored | 13B        | 7.3GB | `ollama pull wizard-vicuna` |
-												update `README.md` with new syntax

											
										
										
											2023-07-19 04:22:33 +08:00
+								## Examples
-												Add download link to readme

											
										
										
											2023-06-28 05:13:07 +08:00
-												add discord link, remove repeated text

											
										
										
											2023-07-20 03:28:50 +08:00
+								### Run a model
-												better `README.md` install instructions

											
										
										
											2023-07-01 00:39:25 +08:00
-												initial commit

											
										
										
											2023-06-23 00:45:31 +08:00
+								```
-												update `README.md` with new syntax

											
										
										
											2023-07-19 04:22:33 +08:00
+								ollama run llama2
 								>>> hi
 								Hello! How can I help you today?
-												initial commit

											
										
										
											2023-06-23 00:45:31 +08:00
+								```
-												add discord link, remove repeated text

											
										
										
											2023-07-20 03:28:50 +08:00
+								### Create a custom character model
 								Pull a base model:
 								```
-												new `Modelfile` syntax

											
										
										
											2023-07-20 17:21:51 +08:00
+								ollama pull llama2
-												add discord link, remove repeated text

											
										
										
											2023-07-20 03:28:50 +08:00
+								```
-												updated readme

											
										
										
											2023-07-06 03:37:33 +08:00
-												update `README.md` with new syntax

											
										
										
											2023-07-19 04:22:33 +08:00
+								Create a `Modelfile`:
-												updated readme

											
										
										
											2023-07-06 03:37:33 +08:00
-												add `docker` instruction

											
										
										
											2023-07-01 00:31:00 +08:00
+								```
-												new `Modelfile` syntax

											
										
										
											2023-07-20 17:21:51 +08:00
+								FROM llama2
-												set temperature on `README.md` example

											
										
										
											2023-07-20 23:17:09 +08:00
 								# set the temperature to 1 [higher is more creative, lower is more coherent]
 								PARAMETER temperature 1
 								# set the system prompt
-												new `Modelfile` syntax

											
										
										
											2023-07-20 17:21:51 +08:00
+								SYSTEM """
-												fix typo

											
										
										
											2023-07-19 04:32:06 +08:00
+								You are Mario from Super Mario Bros. Answer as Mario, the assistant, only.
-												update `README.md` with new syntax

											
										
										
											2023-07-19 04:22:33 +08:00
+								"""
-												simplify `README.md`

											
										
										
											2023-06-30 06:25:02 +08:00
+								```
-												take all args as one prompt

- parse all run arguments into one prompt
- do not echo prompt back on one-shot
- example of summarizing a document

											
										
										
											2023-07-08 04:14:58 +08:00
-												update `README.md` with new syntax

											
										
										
											2023-07-19 04:22:33 +08:00
+								Next, create and run the model:
-												take all args as one prompt

- parse all run arguments into one prompt
- do not echo prompt back on one-shot
- example of summarizing a document

											
										
										
											2023-07-08 04:14:58 +08:00
 								```
-												update `README.md` with new syntax

											
										
										
											2023-07-19 04:22:33 +08:00
+								ollama create mario -f ./Modelfile
 								ollama run mario
 								>>> hi
 								Hello! It's your friend Mario.
-												take all args as one prompt

- parse all run arguments into one prompt
- do not echo prompt back on one-shot
- example of summarizing a document

											
										
										
											2023-07-08 04:14:58 +08:00
+								```
-												fix broken link in `README.md`

											
										
										
											2023-07-20 17:15:11 +08:00
+								For more examples, see the [examples](./examples) directory.
-												add discord link, remove repeated text

											
										
										
											2023-07-20 03:28:50 +08:00
 								### Pull a model from the registry
-												add advanced usage to readme

											
										
										
											2023-07-07 04:21:01 +08:00
-												add discord link, remove repeated text

											
										
										
											2023-07-20 03:28:50 +08:00
+								```
-												new `Modelfile` syntax

											
										
										
											2023-07-20 17:21:51 +08:00
+								ollama pull orca
-												add discord link, remove repeated text

											
										
										
											2023-07-20 03:28:50 +08:00
+								```
-												reorganize `README.md` files

											
										
										
											2023-06-28 21:57:36 +08:00
-												add llama.cpp go bindings

											
										
										
											2023-07-04 04:32:48 +08:00
+								## Building
 								```
-												vendor llama.cpp

											
										
										
											2023-07-12 00:50:02 +08:00
+								go build .
-												add llama.cpp go bindings

											
										
										
											2023-07-04 04:32:48 +08:00
+								```
-												updated readme

											
										
										
											2023-07-06 03:37:33 +08:00
+								To run it start the server:
-												add development doc

											
										
										
											2023-06-28 01:46:46 +08:00
-												updated readme

											
										
										
											2023-07-06 03:37:33 +08:00
+								```
-												Update README.md

I needed to do this to run the project
											
										
										
											2023-07-19 22:14:44 +08:00
+								./ollama serve &
-												updated readme

											
										
										
											2023-07-06 03:37:33 +08:00
+								```
 								Finally, run a model!
 								```
-												update `README.md` with new syntax

											
										
										
											2023-07-19 04:22:33 +08:00
+								./ollama run llama2
-												updated readme

											
										
										
											2023-07-06 03:37:33 +08:00
+								```