Optional
apiThe API token to use for authentication.
If not provided, it will be read from the DEEPINFRA_API_TOKEN
environment variable.
Optional
batchThe maximum number of texts to embed in a single request. This is limited by the DeepInfra API to a maximum of 1024.
Optional
maxThe maximum number of concurrent calls that can be made.
Defaults to Infinity
, which means no limit.
Optional
maxThe maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.
Optional
modelThe model ID to use for generating completions.
Default: sentence-transformers/clip-ViT-B-32
Optional
onCustom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.
Input parameters for the DeepInfra embeddings