Interface ChatVertexAIInput

Input to a Google Vertex AI chat model class.

interface ChatVertexAIInput {
    apiKey?: string;
    apiVersion?: string;
    authOptions?: WebGoogleAuthOptions;
    cache?: boolean | BaseCache<Generation[]>;
    callbackManager?: CallbackManager;
    callbacks?: Callbacks;
    convertSystemMessageToHumanContent?: boolean;
    endpoint?: string;
    location?: string;
    maxConcurrency?: number;
    maxOutputTokens?: number;
    maxRetries?: number;
    mediaManager?: MediaManager;
    metadata?: Record<string, unknown>;
    model?: string;
    modelName?: string;
    onFailedAttempt?: FailedAttemptHandler;
    platformType?: GooglePlatformType;
    responseMimeType?: GoogleAIResponseMimeType;
    safetyHandler?: GoogleAISafetyHandler;
    safetySettings?: GoogleAISafetySetting[];
    stopSequences?: string[];
    streamUsage?: boolean;
    streaming?: boolean;
    tags?: string[];
    temperature?: number;
    topK?: number;
    topP?: number;
    verbose?: boolean;
}

Hierarchy

ChatGoogleInput
- ChatVertexAIInput

Properties

`Optional`apiKey

apiKey?: string

Some APIs allow an API key instead

`Optional`apiVersion

apiVersion?: string

The version of the API functions. Part of the path.

`Optional`authOptions

authOptions?: WebGoogleAuthOptions

`Optional`cache

cache?: boolean | BaseCache<Generation[]>

`Optional`callbackManager

callbackManager?: CallbackManager

Deprecated

Use callbacks instead

`Optional`callbacks

callbacks?: Callbacks

`Optional`convertSystemMessageToHumanContent

convertSystemMessageToHumanContent?: boolean

`Optional`endpoint

endpoint?: string

Hostname for the API call (if this is running on GCP)

`Optional`location

location?: string

Region where the LLM is stored (if this is running on GCP)

`Optional`maxConcurrency

maxConcurrency?: number

The maximum number of concurrent calls that can be made. Defaults to Infinity, which means no limit.

`Optional`maxOutputTokens

maxOutputTokens?: number

Maximum number of tokens to generate in the completion.

`Optional`maxRetries

maxRetries?: number

The maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.

`Optional`mediaManager

mediaManager?: MediaManager

`Optional`metadata

metadata?: Record<string, unknown>

`Optional`model

model?: string

Model to use

`Optional`modelName

modelName?: string

Model to use Alias for model

`Optional`onFailedAttempt

onFailedAttempt?: FailedAttemptHandler

Custom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.

`Optional`platformType

platformType?: GooglePlatformType

What platform to run the service on. If not specified, the class should determine this from other means. Either way, the platform actually used will be in the "platform" getter.

`Optional`responseMimeType

responseMimeType?: GoogleAIResponseMimeType

Available for gemini-1.5-pro. The output format of the generated candidate text. Supported MIME types:

text/plain: Text output.
application/json: JSON response in the candidates.

Default

"text/plain"

`Optional`safetyHandler

safetyHandler?: GoogleAISafetyHandler

`Optional`safetySettings

safetySettings?: GoogleAISafetySetting[]

`Optional`stopSequences

stopSequences?: string[]

`Optional`streamUsage

streamUsage?: boolean

Whether or not to include usage data, like token counts in the streamed response chunks.

Default

true

`Optional`streaming

streaming?: boolean

Whether or not to stream.

Default

false

`Optional`tags

tags?: string[]

`Optional`temperature

temperature?: number

Sampling temperature to use

`Optional`topK

topK?: number

Top-k changes how the model selects tokens for output.

A top-k of 1 means the selected token is the most probable among all tokens in the model’s vocabulary (also called greedy decoding), while a top-k of 3 means that the next token is selected from among the 3 most probable tokens (using temperature).

`Optional`topP

topP?: number

Top-p changes how the model selects tokens for output.

Tokens are selected from most probable to least until the sum of their probabilities equals the top-p value.

For example, if tokens A, B, and C have a probability of .3, .2, and .1 and the top-p value is .5, then the model will select either A or B as the next token (using temperature).

`Optional`verbose

verbose?: boolean

Interface ChatVertexAIInput

Hierarchy

Index

Properties

Properties

`Optional`apiKey

`Optional`apiVersion

`Optional`authOptions

`Optional`cache

`Optional`callbackManager

Deprecated

`Optional`callbacks

`Optional`convertSystemMessageToHumanContent

`Optional`endpoint

`Optional`location

`Optional`maxConcurrency

`Optional`maxOutputTokens

`Optional`maxRetries

`Optional`mediaManager

`Optional`metadata

`Optional`model

`Optional`modelName

`Optional`onFailedAttempt

`Optional`platformType

`Optional`responseMimeType

Default

`Optional`safetyHandler

`Optional`safetySettings

`Optional`stopSequences

`Optional`streamUsage

Default

`Optional`streaming

Default

`Optional`tags

`Optional`temperature

`Optional`topK

`Optional`topP

`Optional`verbose

Settings

On This Page

Interface ChatVertexAIInput

Hierarchy

Index

Properties

Properties

OptionalapiKey

OptionalapiVersion

OptionalauthOptions

Optionalcache

OptionalcallbackManager

Deprecated

Optionalcallbacks

OptionalconvertSystemMessageToHumanContent

Optionalendpoint

Optionallocation

OptionalmaxConcurrency

OptionalmaxOutputTokens

OptionalmaxRetries

OptionalmediaManager

Optionalmetadata

Optionalmodel

OptionalmodelName

OptionalonFailedAttempt

OptionalplatformType

OptionalresponseMimeType

Default

OptionalsafetyHandler

OptionalsafetySettings

OptionalstopSequences

OptionalstreamUsage

Default

Optionalstreaming

Default

Optionaltags

Optionaltemperature

OptionaltopK

OptionaltopP

Optionalverbose

Settings

On This Page

`Optional`apiKey

`Optional`apiVersion

`Optional`authOptions

`Optional`cache

`Optional`callbackManager

`Optional`callbacks

`Optional`convertSystemMessageToHumanContent

`Optional`endpoint

`Optional`location

`Optional`maxConcurrency

`Optional`maxOutputTokens

`Optional`maxRetries

`Optional`mediaManager

`Optional`metadata

`Optional`model

`Optional`modelName

`Optional`onFailedAttempt

`Optional`platformType

`Optional`responseMimeType

`Optional`safetyHandler

`Optional`safetySettings

`Optional`stopSequences

`Optional`streamUsage

`Optional`streaming

`Optional`tags

`Optional`temperature

`Optional`topK

`Optional`topP

`Optional`verbose