Create an inference endpoint | Elasticsearch API documentation (original) (raw)
Dismiss highlight Show more
Path parameters
- The task type
Values aresparse_embedding
,text_embedding
,rerank
,completion
, orchat_completion
. - The inference Id
application/json
Body Required
- Hide chunking_settings attributes Show chunking_settings attributes object
- The maximum size of a chunk in words. This value cannot be higher than
300
or lower than20
(forsentence
strategy) or10
(forword
strategy). - The number of overlapping words for chunks. It is applicable only to a
word
chunking strategy. This value cannot be higher than half themax_chunk_size
value. - The number of overlapping sentences for chunks. It is applicable only for a
sentence
chunking strategy. It can be either1
or0
. - The chunking strategy:
sentence
orword
.
- The maximum size of a chunk in words. This value cannot be higher than
- The service type
Responses
- 200 application/json
Hide response attributes Show response attributes object- Hide chunking_settings attributes Show chunking_settings attributes object
* The maximum size of a chunk in words. This value cannot be higher than300
or lower than20
(forsentence
strategy) or10
(forword
strategy).
* The number of overlapping words for chunks. It is applicable only to aword
chunking strategy. This value cannot be higher than half themax_chunk_size
value.
* The number of overlapping sentences for chunks. It is applicable only for asentence
chunking strategy. It can be either1
or0
.
* The chunking strategy:sentence
orword
. - The service type
- The inference Id
- Values are
sparse_embedding
,text_embedding
,rerank
,completion
, orchat_completion
.
- Hide chunking_settings attributes Show chunking_settings attributes object