5.5 KiB
5.5 KiB
| stage | group | description | info | title |
|---|---|---|---|---|
| AI-Powered | Custom Models | Configure your GitLab instance to use GitLab Duo Self-Hosted. | To determine the technical writer assigned to the Stage/Group associated with this page, see https://handbook.gitlab.com/handbook/product/ux/technical-writing/#assignments | Configure GitLab to access GitLab Duo Self-Hosted |
DETAILS: Tier: Ultimate with GitLab Duo Enterprise - Start a trial Offering: GitLab Self-Managed Status: Beta
- Introduced in GitLab 17.1 with a flag named
ai_custom_model. Disabled by default.- Enabled on GitLab Self-Managed in GitLab 17.6.
- Changed to require GitLab Duo add-on in GitLab 17.6 and later.
- Feature flag
ai_custom_modelremoved in GitLab 17.8- Ability to set AI gateway URL using UI added in GitLab 17.9.
Prerequisites:
To configure your GitLab instance to access the available self-hosted models in your infrastructure:
- Confirm that a fully self-hosted configuration is appropriate for your use case.
- Configure your GitLab instance to access the AI gateway.
- Configure the self-hosted model.
- Configure the GitLab Duo features to use your self-hosted model.
Configure your GitLab instance to access the AI gateway
- On the left sidebar, at the bottom, select Admin.
- Select GitLab Duo.
- In the GitLab Duo section, select Change configuration.
- Under Local AI Gateway URL, enter your AI Gateway URL.
- Select Save changes.
Configure the self-hosted model
Prerequisites:
- You must be an administrator.
- You must have an Ultimate license.
- You must have a Duo Enterprise license add-on.
To configure a self-hosted model:
-
On the left sidebar, at the bottom, select Admin.
-
Select Self-hosted models.
- If the Self-hosted models menu item is not available, synchronize your
subscription after purchase:
- On the left sidebar, select Subscription.
- In Subscription details, to the right of Last sync, select synchronize subscription ({retry}).
- If the Self-hosted models menu item is not available, synchronize your
subscription after purchase:
-
Select Add self-hosted model.
-
Complete the fields:
- Deployment name: Enter a name to uniquely identify the model deployment, for example,
Mixtral-8x7B-it-v0.1 on GCP. - Model family: Select the model family the deployment belongs to. Only GitLab-approved models are in this list.
- Endpoint: Enter the URL where the model is hosted.
- For models hosted through vLLM, you must suffix the URL with
/v1. The default port is8000, so the default endpoint URL ishttps://<hostname>:8000/v1.
- For models hosted through vLLM, you must suffix the URL with
- API key: Optional. Add an API key if you need one to access the model.
- Model identifier: This is a required field if your deployment method is vLLM, Bedrock or Azure. The value of this field is based on your deployment method, and should match the following structure:
Deployment method Format Example vLLM custom_openai/<name of the model served through vLLM>custom_openai/Mixtral-8x7B-Instruct-v0.1Bedrock bedrock/<model ID of the model>bedrock/mistral.mixtral-8x7b-instruct-v0:1Azure OpenAI azure/<model ID of the model>azure/gpt-35-turboOthers The field is optional
- Deployment name: Enter a name to uniquely identify the model deployment, for example,
-
Select Create self-hosted model.
Configure self-hosted beta models
Prerequisites:
- You must be an administrator.
- You must have an Ultimate license.
- You must have a Duo Enterprise license add-on.
To enable self-hosted beta models:
- On the left sidebar, at the bottom, select Admin.
- Select GitLab Duo.
- In the GitLab Duo section, select Change configuration.
- Under Self-hosted AI models, select Use beta self-hosted models features.
- Select Save changes.
NOTE: Turning on beta self-hosted models features also accepts the GitLab Testing Agreement.
For more information, see the list of available beta models under evaluation.
Configure GitLab Duo features to use self-hosted models
Prerequisites:
- You must be an administrator.
- You must have an Ultimate license.
- You must have a Duo Enterprise license add-on.
View configured features
- On the left sidebar, at the bottom, select Admin.
- Select Self-hosted models.
- If the Self-hosted models menu item is not available, synchronize your
subscription after purchase:
- On the left sidebar, select Subscription.
- In Subscription details, to the right of Last sync, select synchronize subscription ({retry}).
- If the Self-hosted models menu item is not available, synchronize your
subscription after purchase:
- Select the AI-powered features tab.
Configure the feature to use a self-hosted model
Configure the GitLab Duo feature to send queries to the configured self-hosted model:
- On the left sidebar, at the bottom, select Admin.
- Select Self-hosted models.
- Select the AI-powered features tab.
- For the feature you want to configure, from the dropdown list, choose the self-hosted model you want to use. For example,
Mistral.