Input parameters for the DeepInfra embeddings

interface DeepInfraEmbeddingsParams {
    apiToken?: string;
    batchSize?: number;
    maxConcurrency?: number;
    maxRetries?: number;
    modelName?: string;
    onFailedAttempt?: FailedAttemptHandler;
}

Hierarchy

  • EmbeddingsParams
    • DeepInfraEmbeddingsParams

Implemented by

Properties

apiToken?: string

The API token to use for authentication. If not provided, it will be read from the DEEPINFRA_API_TOKEN environment variable.

batchSize?: number

The maximum number of texts to embed in a single request. This is limited by the DeepInfra API to a maximum of 1024.

maxConcurrency?: number

The maximum number of concurrent calls that can be made. Defaults to Infinity, which means no limit.

maxRetries?: number

The maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.

modelName?: string

The model ID to use for generating completions. Default: sentence-transformers/clip-ViT-B-32

onFailedAttempt?: FailedAttemptHandler

Custom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.