These settings are specific to the `alibabacloud-ai-search` service.
--
`api_key`:::
(Required, string)
A valid API key for the AlibabaCloud AI Search API.
`service_id`:::
(Required, string)
The name of the model service to use for the {infer} task.
+
--
Available service_ids for the `text_embedding` task:
* `ops-text-embedding-001`
* `ops-text-embedding-zh-001`
* `ops-text-embedding-en-001`
* `ops-text-embedding-002`
For the supported `text_embedding` service_ids, refer to the https://help.aliyun.com/zh/open-search/search-platform/developer-reference/text-embedding-api-details[documentation].
Available service_id for the `sparse_embedding` task:
* `ops-text-sparse-embedding-001`
For the supported `sparse_embedding` service_id, refer to the https://help.aliyun.com/zh/open-search/search-platform/developer-reference/text-sparse-embedding-api-details[documentation].
Available service_id for the `rerank` task is:
* `ops-bge-reranker-larger`
For the supported `rerank` service_id, refer to the https://help.aliyun.com/zh/open-search/search-platform/developer-reference/ranker-api-details[documentation].
--
`host`:::
(Required, string)
The name of the host address used for the {infer} task. You can find the host address at https://opensearch.console.aliyun.com/cn-shanghai/rag/api-key[ the API keys section] of the documentation.
`workspace`:::
(Required, string)
The name of the workspace used for the {infer} task.
`rate_limit`:::
(Optional, object)
By default, the `alibabacloud-ai-search` service sets the number of requests allowed per minute to `1000`.
This helps to minimize the number of rate limit errors returned from AlibabaCloud AI Search.
To modify this, set the `requests_per_minute` setting of this object in your service settings: