2023-09-29 16:12:07 +08:00
|
|
|
|
[role="xpack"]
|
|
|
|
|
[[inference-apis]]
|
|
|
|
|
== {infer-cap} APIs
|
|
|
|
|
|
2023-11-02 20:02:29 +08:00
|
|
|
|
experimental[]
|
|
|
|
|
|
2024-04-04 21:42:03 +08:00
|
|
|
|
IMPORTANT: The {infer} APIs enable you to use certain services, such as built-in
|
2024-08-14 22:04:11 +08:00
|
|
|
|
{ml} models (ELSER, E5), models uploaded through Eland, Cohere, OpenAI, Azure,
|
|
|
|
|
Google AI Studio or Hugging Face. For built-in models and models uploaded
|
|
|
|
|
through Eland, the {infer} APIs offer an alternative way to use and manage
|
|
|
|
|
trained models. However, if you do not plan to use the {infer} APIs to use these
|
|
|
|
|
models or if you want to use non-NLP models, use the
|
|
|
|
|
<<ml-df-trained-models-apis>>.
|
2024-01-09 22:46:46 +08:00
|
|
|
|
|
2024-04-04 21:42:03 +08:00
|
|
|
|
The {infer} APIs enable you to create {infer} endpoints and use {ml} models of
|
2024-08-14 22:04:11 +08:00
|
|
|
|
different providers - such as Amazon Bedrock, Anthropic, Azure AI Studio,
|
|
|
|
|
Cohere, Google AI, Mistral, OpenAI, or HuggingFace - as a service. Use
|
2024-04-04 21:42:03 +08:00
|
|
|
|
the following APIs to manage {infer} models and perform {infer}:
|
2023-09-29 16:12:07 +08:00
|
|
|
|
|
|
|
|
|
* <<delete-inference-api>>
|
|
|
|
|
* <<get-inference-api>>
|
|
|
|
|
* <<post-inference-api>>
|
|
|
|
|
* <<put-inference-api>>
|
|
|
|
|
|
2024-08-14 22:04:11 +08:00
|
|
|
|
[[inference-landscape]]
|
|
|
|
|
.A representation of the Elastic inference landscape
|
|
|
|
|
image::images/inference-landscape.png[A representation of the Elastic inference landscape,align="center"]
|
|
|
|
|
|
|
|
|
|
An {infer} endpoint enables you to use the corresponding {ml} model without
|
|
|
|
|
manual deployment and apply it to your data at ingestion time through
|
|
|
|
|
<<semantic-search-semantic-text, semantic text>>.
|
|
|
|
|
|
|
|
|
|
Choose a model from your provider or use ELSER – a retrieval model trained by
|
|
|
|
|
Elastic –, then create an {infer} endpoint by the <<put-inference-api>>.
|
|
|
|
|
Now use <<semantic-search-semantic-text, semantic text>> to perform
|
|
|
|
|
<<semantic-search, semantic search>> on your data.
|
2023-09-29 16:12:07 +08:00
|
|
|
|
|
|
|
|
|
include::delete-inference.asciidoc[]
|
|
|
|
|
include::get-inference.asciidoc[]
|
|
|
|
|
include::post-inference.asciidoc[]
|
2024-05-16 19:22:01 +08:00
|
|
|
|
include::put-inference.asciidoc[]
|
2024-08-29 15:17:27 +08:00
|
|
|
|
include::service-alibabacloud-ai-search.asciidoc[]
|
2024-07-12 22:14:54 +08:00
|
|
|
|
include::service-amazon-bedrock.asciidoc[]
|
2024-08-07 17:18:43 +08:00
|
|
|
|
include::service-anthropic.asciidoc[]
|
2024-07-01 16:12:16 +08:00
|
|
|
|
include::service-azure-ai-studio.asciidoc[]
|
|
|
|
|
include::service-azure-openai.asciidoc[]
|
|
|
|
|
include::service-cohere.asciidoc[]
|
|
|
|
|
include::service-elasticsearch.asciidoc[]
|
|
|
|
|
include::service-elser.asciidoc[]
|
|
|
|
|
include::service-google-ai-studio.asciidoc[]
|
2024-07-02 20:47:14 +08:00
|
|
|
|
include::service-google-vertex-ai.asciidoc[]
|
2024-07-01 16:12:16 +08:00
|
|
|
|
include::service-hugging-face.asciidoc[]
|
|
|
|
|
include::service-mistral.asciidoc[]
|
|
|
|
|
include::service-openai.asciidoc[]
|