Class ChatGoogleVertexAI

Enables calls to the Google Cloud's Vertex AI API to access Large Language Models in a chat-like fashion.

To use, you will need to have one of the following authentication methods in place:

You are logged into an account permitted to the Google Cloud project using Vertex AI.
You are running this on a machine using a service account permitted to the Google Cloud project using Vertex AI.
The GOOGLE_APPLICATION_CREDENTIALS environment variable is set to the path of a credentials file for a service account permitted to the Google Cloud project using Vertex AI.

Example

const model = new ChatGoogleVertexAI({
  temperature: 0.7,
});
const result = await model.invoke("What is the capital of France?");

Hierarchy

BaseChatGoogleVertexAI<GoogleAuthOptions>
- ChatGoogleVertexAI

Constructors

constructor

new ChatGoogleVertexAI(fields?): ChatGoogleVertexAI
Parameters
- Optional fields: GoogleVertexAIChatInput<GoogleAuthOptions<JSONClient>>
Returns ChatGoogleVertexAI
Overrides BaseChatGoogleVertexAI<GoogleAuthOptions>.constructor
- Defined in docs/api_refs/langchain/src/chat_models/googlevertexai/index.ts:33

Properties

CallOptions

CallOptions: BaseLanguageModelCallOptions

ParsedCallOptions

ParsedCallOptions: Omit<BaseLanguageModelCallOptions, never>

caller

caller: AsyncCaller

The async caller should be used by subclasses to make any async calls, which will thus benefit from the concurrency and retry logic.

connection

connection: GoogleVertexAILLMConnection<BaseLanguageModelCallOptions, GoogleVertexAIChatInstance, GoogleVertexAIChatPrediction, GoogleAuthOptions<JSONClient>>

examples

examples: ChatExample[] = []

maxOutputTokens

maxOutputTokens: number = 1024

model

model: string = "chat-bison"

streamedConnection

streamedConnection: GoogleVertexAILLMConnection<BaseLanguageModelCallOptions, GoogleVertexAIChatInstance, GoogleVertexAIChatPrediction, GoogleAuthOptions<JSONClient>>

temperature

temperature: number = 0.2

topK

topK: number = 40

topP

topP: number = 0.8

verbose

verbose: boolean

Whether to print out response text.

`Optional` cache

cache?: BaseCache<Generation[]>

`Optional` callbacks

callbacks?: Callbacks

`Optional` metadata

metadata?: Record<string, unknown>

`Optional` tags

tags?: string[]

Accessors

callKeys

get callKeys(): string[]
Keys that the language model accepts as call options.

Returns string[]
Inherited from BaseChatGoogleVertexAI.callKeys
- Defined in langchain-core/dist/language_models/base.d.ts:113

Methods

batch

batch(inputs, options?, batchOptions?): Promise<BaseMessageChunk[]>
Default implementation of batch, which calls invoke N times. Subclasses should override this method if they can batch more efficiently.
Parameters
- inputs: BaseLanguageModelInput[]
  
  Array of inputs to each batch call.
- Optional options: Partial<BaseLanguageModelCallOptions> | Partial<BaseLanguageModelCallOptions>[]
  
  Either a single call options object to apply to each batch call or an array for each call.
- Optional batchOptions: RunnableBatchOptions & {
  returnExceptions?: false;
  }
Returns Promise<BaseMessageChunk[]>
An array of RunOutputs, or mixed RunOutputs and errors if batchOptions.returnExceptions is set
Inherited from BaseChatGoogleVertexAI.batch
- Defined in langchain-core/dist/runnables/base.d.ts:76
batch(inputs, options?, batchOptions?): Promise<(Error | BaseMessageChunk)[]>
Parameters
- inputs: BaseLanguageModelInput[]
- Optional options: Partial<BaseLanguageModelCallOptions> | Partial<BaseLanguageModelCallOptions>[]
- Optional batchOptions: RunnableBatchOptions & {
  returnExceptions: true;
  }
Returns Promise<(Error | BaseMessageChunk)[]>
Inherited from BaseChatGoogleVertexAI.batch
- Defined in langchain-core/dist/runnables/base.d.ts:79
batch(inputs, options?, batchOptions?): Promise<(Error | BaseMessageChunk)[]>
Parameters
- inputs: BaseLanguageModelInput[]
- Optional options: Partial<BaseLanguageModelCallOptions> | Partial<BaseLanguageModelCallOptions>[]
- Optional batchOptions: RunnableBatchOptions
Returns Promise<(Error | BaseMessageChunk)[]>
Inherited from BaseChatGoogleVertexAI.batch
- Defined in langchain-core/dist/runnables/base.d.ts:82

bind

bind(kwargs): Runnable<BaseLanguageModelInput, BaseMessageChunk, BaseLanguageModelCallOptions>
Bind arguments to a Runnable, returning a new Runnable.
Parameters
- kwargs: Partial<BaseLanguageModelCallOptions>
Returns Runnable<BaseLanguageModelInput, BaseMessageChunk, BaseLanguageModelCallOptions>
A new RunnableBinding that, when invoked, will apply the bound args.
Inherited from BaseChatGoogleVertexAI.bind
- Defined in langchain-core/dist/runnables/base.d.ts:33

call

call(messages, options?, callbacks?): Promise<BaseMessage>
Makes a single call to the chat model.
Parameters
- messages: BaseMessageLike[]
  
  An array of BaseMessage instances.
- Optional options: string[] | BaseLanguageModelCallOptions
  
  The call options or an array of stop sequences.
- Optional callbacks: Callbacks
  
  The callbacks for the language model.
Returns Promise<BaseMessage>
A Promise that resolves to a BaseMessage.
Inherited from BaseChatGoogleVertexAI.call
- Defined in langchain-core/dist/language_models/chat_models.d.ts:99

callPrompt

callPrompt(promptValue, options?, callbacks?): Promise<BaseMessage>
Makes a single call to the chat model with a prompt value.
Parameters
- promptValue: BasePromptValue
  
  The value of the prompt.
- Optional options: string[] | BaseLanguageModelCallOptions
  
  The call options or an array of stop sequences.
- Optional callbacks: Callbacks
  
  The callbacks for the language model.
Returns Promise<BaseMessage>
A Promise that resolves to a BaseMessage.
Inherited from BaseChatGoogleVertexAI.callPrompt
- Defined in langchain-core/dist/language_models/chat_models.d.ts:107

createInstance

createInstance(messages): GoogleVertexAIChatInstance
Creates an instance of the Google Vertex AI chat model.
Parameters
- messages: BaseMessage[]
  
  The messages for the model instance.
Returns GoogleVertexAIChatInstance
A new instance of the Google Vertex AI chat model.
Inherited from BaseChatGoogleVertexAI.createInstance
- Defined in docs/api_refs/langchain/src/chat_models/googlevertexai/common.ts:309

formatParameters

formatParameters(): GoogleVertexAIModelParams
Returns GoogleVertexAIModelParams
Inherited from BaseChatGoogleVertexAI.formatParameters
- Defined in docs/api_refs/langchain/src/chat_models/googlevertexai/common.ts:369

generate

generate(messages, options?, callbacks?): Promise<LLMResult>
Generates chat based on the input messages.
Parameters
- messages: BaseMessageLike[][]
  
  An array of arrays of BaseMessage instances.
- Optional options: string[] | BaseLanguageModelCallOptions
  
  The call options or an array of stop sequences.
- Optional callbacks: Callbacks
  
  The callbacks for the language model.
Returns Promise<LLMResult>
A Promise that resolves to an LLMResult.
Inherited from BaseChatGoogleVertexAI.generate
- Defined in langchain-core/dist/language_models/chat_models.d.ts:68

generatePrompt

generatePrompt(promptValues, options?, callbacks?): Promise<LLMResult>
Generates a prompt based on the input prompt values.
Parameters
- promptValues: BasePromptValue[]
  
  An array of BasePromptValue instances.
- Optional options: string[] | BaseLanguageModelCallOptions
  
  The call options or an array of stop sequences.
- Optional callbacks: Callbacks
  
  The callbacks for the language model.
Returns Promise<LLMResult>
A Promise that resolves to an LLMResult.
Inherited from BaseChatGoogleVertexAI.generatePrompt
- Defined in langchain-core/dist/language_models/chat_models.d.ts:89

getNumTokens

getNumTokens(content): Promise<number>
Parameters
- content: MessageContent
Returns Promise<number>
Inherited from BaseChatGoogleVertexAI.getNumTokens
- Defined in langchain-core/dist/language_models/base.d.ts:130

invocationParams

invocationParams(_options?): any
Get the parameters used to invoke the model
Parameters
- Optional _options: Omit<BaseLanguageModelCallOptions, never>
Returns any
Inherited from BaseChatGoogleVertexAI.invocationParams
- Defined in langchain-core/dist/language_models/chat_models.d.ts:72

invoke

invoke(input, options?): Promise<BaseMessageChunk>
Invokes the chat model with a single input.
Parameters
- input: BaseLanguageModelInput
  
  The input for the language model.
- Optional options: BaseLanguageModelCallOptions
  
  The call options.
Returns Promise<BaseMessageChunk>
A Promise that resolves to a BaseMessageChunk.
Inherited from BaseChatGoogleVertexAI.invoke
- Defined in langchain-core/dist/language_models/chat_models.d.ts:54

map

map(): Runnable<BaseLanguageModelInput[], BaseMessageChunk[], BaseLanguageModelCallOptions>
Return a new Runnable that maps a list of inputs to a list of outputs, by calling invoke() with each input.

Returns Runnable<BaseLanguageModelInput[], BaseMessageChunk[], BaseLanguageModelCallOptions>
Inherited from BaseChatGoogleVertexAI.map
- Defined in langchain-core/dist/runnables/base.d.ts:38

pipe

pipe<NewRunOutput>(coerceable): RunnableSequence<BaseLanguageModelInput, Exclude<NewRunOutput, Error>>
Create a new runnable sequence that runs each individual runnable in series, piping the output of one runnable into another runnable or runnable-like.
Type Parameters
- NewRunOutput
Parameters
- coerceable: RunnableLike<BaseMessageChunk, NewRunOutput>
  
  A runnable, function, or object whose values are functions or runnables.
Returns RunnableSequence<BaseLanguageModelInput, Exclude<NewRunOutput, Error>>
A new runnable sequence.
Inherited from BaseChatGoogleVertexAI.pipe
- Defined in langchain-core/dist/runnables/base.d.ts:136

predict

predict(text, options?, callbacks?): Promise<string>
Predicts the next message based on a text input.
Parameters
- text: string
  
  The text input.
- Optional options: string[] | BaseLanguageModelCallOptions
  
  The call options or an array of stop sequences.
- Optional callbacks: Callbacks
  
  The callbacks for the language model.
Returns Promise<string>
A Promise that resolves to a string.
Inherited from BaseChatGoogleVertexAI.predict
- Defined in langchain-core/dist/language_models/chat_models.d.ts:123

predictMessages

predictMessages(messages, options?, callbacks?): Promise<BaseMessage>
Predicts the next message based on the input messages.
Parameters
- messages: BaseMessage[]
  
  An array of BaseMessage instances.
- Optional options: string[] | BaseLanguageModelCallOptions
  
  The call options or an array of stop sequences.
- Optional callbacks: Callbacks
  
  The callbacks for the language model.
Returns Promise<BaseMessage>
A Promise that resolves to a BaseMessage.
Inherited from BaseChatGoogleVertexAI.predictMessages
- Defined in langchain-core/dist/language_models/chat_models.d.ts:115

serialize

serialize(): SerializedLLM
Returns SerializedLLM

Deprecated
Return a json-like object representing this LLM.
Inherited from BaseChatGoogleVertexAI.serialize
- Defined in langchain-core/dist/language_models/chat_models.d.ts:81

stream

stream(input, options?): Promise<IterableReadableStream<BaseMessageChunk>>
Stream output in chunks.
Parameters
- input: BaseLanguageModelInput
- Optional options: Partial<BaseLanguageModelCallOptions>
Returns Promise<IterableReadableStream<BaseMessageChunk>>
A readable stream that is also an iterable.
Inherited from BaseChatGoogleVertexAI.stream
- Defined in langchain-core/dist/runnables/base.d.ts:97

streamLog

streamLog(input, options?, streamOptions?): AsyncGenerator<RunLogPatch, any, unknown>
Stream all output from a runnable, as reported to the callback system. This includes all inner runs of LLMs, Retrievers, Tools, etc. Output is streamed as Log objects, which include a list of jsonpatch ops that describe how the state of the run has changed in each step, and the final state of the run. The jsonpatch ops can be applied in order to construct state.
Parameters
- input: BaseLanguageModelInput
- Optional options: Partial<BaseLanguageModelCallOptions>
- Optional streamOptions: Omit<LogStreamCallbackHandlerInput, "autoClose">
Returns AsyncGenerator<RunLogPatch, any, unknown>
Inherited from BaseChatGoogleVertexAI.streamLog
- Defined in langchain-core/dist/runnables/base.d.ts:156

toJSON

toJSON(): Serialized
Returns Serialized
Inherited from BaseChatGoogleVertexAI.toJSON
- Defined in langchain-core/dist/load/serializable.d.ts:72

toJSONNotImplemented

toJSONNotImplemented(): SerializedNotImplemented
Returns SerializedNotImplemented
Inherited from BaseChatGoogleVertexAI.toJSONNotImplemented
- Defined in langchain-core/dist/load/serializable.d.ts:73

transform

transform(generator, options): AsyncGenerator<BaseMessageChunk, any, unknown>
Default implementation of transform, which buffers input and then calls stream. Subclasses should override this method if they can start producing output while input is still being generated.
Parameters
- generator: AsyncGenerator<BaseLanguageModelInput, any, unknown>
- options: Partial<BaseLanguageModelCallOptions>
Returns AsyncGenerator<BaseMessageChunk, any, unknown>
Inherited from BaseChatGoogleVertexAI.transform
- Defined in langchain-core/dist/runnables/base.d.ts:144

withConfig

withConfig(config): RunnableBinding<BaseLanguageModelInput, BaseMessageChunk, BaseLanguageModelCallOptions>
Bind config to a Runnable, returning a new Runnable.
Parameters
- config: BaseCallbackConfig
  
  New configuration parameters to attach to the new runnable.
Returns RunnableBinding<BaseLanguageModelInput, BaseMessageChunk, BaseLanguageModelCallOptions>
A new RunnableBinding with a config matching what's passed.
Inherited from BaseChatGoogleVertexAI.withConfig
- Defined in langchain-core/dist/runnables/base.d.ts:53

withFallbacks

withFallbacks(fields): RunnableWithFallbacks<BaseLanguageModelInput, BaseMessageChunk>
Create a new runnable from the current one that will try invoking other passed fallback runnables if the initial invocation fails.
Parameters
- fields: {
  fallbacks: Runnable<BaseLanguageModelInput, BaseMessageChunk, BaseCallbackConfig>[];
  }
  - fallbacks: Runnable<BaseLanguageModelInput, BaseMessageChunk, BaseCallbackConfig>[]
    
    Other runnables to call if the runnable errors.
Returns RunnableWithFallbacks<BaseLanguageModelInput, BaseMessageChunk>
A new RunnableWithFallbacks.
Inherited from BaseChatGoogleVertexAI.withFallbacks
- Defined in langchain-core/dist/runnables/base.d.ts:60

withListeners

withListeners(params): Runnable<BaseLanguageModelInput, BaseMessageChunk, BaseLanguageModelCallOptions>
Bind lifecycle listeners to a Runnable, returning a new Runnable. The Run object contains information about the run, including its id, type, input, output, error, startTime, endTime, and any tags or metadata added to the run.
Parameters
- params: {
      onEnd?: ((run, config?) => void | Promise<void>);
      onError?: ((run, config?) => void | Promise<void>);
      onStart?: ((run, config?) => void | Promise<void>);
  }
  
  The object containing the callback functions.
  - Optional onEnd?: ((run, config?) => void | Promise<void>)
    - (run, config?): void | Promise<void>
      
      Called after the runnable finishes running, with the Run object.
      
      Parameters
      
      run: Run
      
      Optional config: BaseCallbackConfig
      
      Returns void | Promise<void>
  - Optional onError?: ((run, config?) => void | Promise<void>)
    - (run, config?): void | Promise<void>
      
      Called if the runnable throws an error, with the Run object.
      
      Parameters
      
      run: Run
      
      Optional config: BaseCallbackConfig
      
      Returns void | Promise<void>
  - Optional onStart?: ((run, config?) => void | Promise<void>)
    - (run, config?): void | Promise<void>
      
      Called before the runnable starts running, with the Run object.
      
      Parameters
      
      run: Run
      
      Optional config: BaseCallbackConfig
      
      Returns void | Promise<void>
Returns Runnable<BaseLanguageModelInput, BaseMessageChunk, BaseLanguageModelCallOptions>
Inherited from BaseChatGoogleVertexAI.withListeners
- Defined in langchain-core/dist/runnables/base.d.ts:169

withRetry

withRetry(fields?): RunnableRetry<BaseLanguageModelInput, BaseMessageChunk, BaseLanguageModelCallOptions>
Add retry logic to an existing runnable.
Parameters
- Optional fields: {
  onFailedAttempt?: RunnableRetryFailedAttemptHandler;
  stopAfterAttempt?: number;
  }
  - Optional onFailedAttempt?: RunnableRetryFailedAttemptHandler
  - Optional stopAfterAttempt?: number
Returns RunnableRetry<BaseLanguageModelInput, BaseMessageChunk, BaseLanguageModelCallOptions>
A new RunnableRetry that, when invoked, will retry according to the parameters.
Inherited from BaseChatGoogleVertexAI.withRetry
- Defined in langchain-core/dist/runnables/base.d.ts:44

`Static` convertPrediction

convertPrediction(prediction): ChatGeneration
Converts a prediction from the Google Vertex AI chat model to a chat generation.
Parameters
- prediction: GoogleVertexAIChatPrediction
  
  The prediction to convert.
Returns ChatGeneration
The converted chat generation.
Inherited from BaseChatGoogleVertexAI.convertPrediction
- Defined in docs/api_refs/langchain/src/chat_models/googlevertexai/common.ts:384

`Static` convertPredictionChunk

convertPredictionChunk(output): ChatGenerationChunk
Parameters
- output: any
Returns ChatGenerationChunk
Inherited from BaseChatGoogleVertexAI.convertPredictionChunk
- Defined in docs/api_refs/langchain/src/chat_models/googlevertexai/common.ts:396

`Static` deserialize

deserialize(_data): Promise<BaseLanguageModel<any, BaseLanguageModelCallOptions>>
Parameters
- _data: SerializedLLM
Returns Promise<BaseLanguageModel<any, BaseLanguageModelCallOptions>>

Deprecated
Load an LLM from a json-like object describing it.
Inherited from BaseChatGoogleVertexAI.deserialize
- Defined in langchain-core/dist/language_models/base.d.ts:154

`Static` isRunnable

isRunnable(thing): thing is Runnable<any, any, BaseCallbackConfig>
Parameters
- thing: any
Returns thing is Runnable<any, any, BaseCallbackConfig>
Inherited from BaseChatGoogleVertexAI.isRunnable
- Defined in langchain-core/dist/runnables/base.d.ts:157

Class ChatGoogleVertexAI

Example

Hierarchy

Index

Constructors

Properties

Accessors

Methods

Constructors

constructor

Parameters

Optional fields: GoogleVertexAIChatInput<GoogleAuthOptions<JSONClient>>

Returns ChatGoogleVertexAI

Properties

CallOptions

ParsedCallOptions

caller

connection

examples

maxOutputTokens

model

streamedConnection

temperature

topK

topP

verbose

Optional cache

Optional callbacks

Optional metadata

Optional tags

Accessors

callKeys

Returns string[]

Methods

batch

Parameters

inputs: BaseLanguageModelInput[]

Optional options: Partial<BaseLanguageModelCallOptions> | Partial<BaseLanguageModelCallOptions>[]

Optional batchOptions: RunnableBatchOptions & { returnExceptions?: false; }

Returns Promise<BaseMessageChunk[]>

Parameters

inputs: BaseLanguageModelInput[]

Optional options: Partial<BaseLanguageModelCallOptions> | Partial<BaseLanguageModelCallOptions>[]

Optional batchOptions: RunnableBatchOptions & { returnExceptions: true; }

Returns Promise<(Error | BaseMessageChunk)[]>

Parameters

inputs: BaseLanguageModelInput[]

Optional options: Partial<BaseLanguageModelCallOptions> | Partial<BaseLanguageModelCallOptions>[]

Optional batchOptions: RunnableBatchOptions

Returns Promise<(Error | BaseMessageChunk)[]>

bind

Parameters

kwargs: Partial<BaseLanguageModelCallOptions>

Returns Runnable<BaseLanguageModelInput, BaseMessageChunk, BaseLanguageModelCallOptions>

call

Parameters

messages: BaseMessageLike[]

Optional options: string[] | BaseLanguageModelCallOptions

Optional callbacks: Callbacks

Returns Promise<BaseMessage>

callPrompt

Parameters

promptValue: BasePromptValue

Optional options: string[] | BaseLanguageModelCallOptions

Optional callbacks: Callbacks

Returns Promise<BaseMessage>

createInstance

Parameters

messages: BaseMessage[]

Returns GoogleVertexAIChatInstance

formatParameters

Returns GoogleVertexAIModelParams

generate

Parameters

messages: BaseMessageLike[][]

Optional options: string[] | BaseLanguageModelCallOptions

Optional callbacks: Callbacks

Returns Promise<LLMResult>

generatePrompt

Parameters

`Optional` fields: GoogleVertexAIChatInput<GoogleAuthOptions<JSONClient>>

`Optional` cache

`Optional` callbacks

`Optional` metadata

`Optional` tags

`Optional` options: Partial<BaseLanguageModelCallOptions> | Partial<BaseLanguageModelCallOptions>[]

`Optional` batchOptions: RunnableBatchOptions & {
returnExceptions?: false;
}

`Optional` options: Partial<BaseLanguageModelCallOptions> | Partial<BaseLanguageModelCallOptions>[]

`Optional` batchOptions: RunnableBatchOptions & {
returnExceptions: true;
}

`Optional` options: Partial<BaseLanguageModelCallOptions> | Partial<BaseLanguageModelCallOptions>[]

`Optional` batchOptions: RunnableBatchOptions

`Optional` options: string[] | BaseLanguageModelCallOptions

`Optional` callbacks: Callbacks

`Optional` options: string[] | BaseLanguageModelCallOptions

`Optional` callbacks: Callbacks

`Optional` options: string[] | BaseLanguageModelCallOptions

`Optional` callbacks: Callbacks

`Optional` options: string[] | BaseLanguageModelCallOptions

`Optional` callbacks: Callbacks

`Optional` _options: Omit<BaseLanguageModelCallOptions, never>

`Optional` options: BaseLanguageModelCallOptions

`Optional` options: string[] | BaseLanguageModelCallOptions

`Optional` callbacks: Callbacks

`Optional` options: string[] | BaseLanguageModelCallOptions

`Optional` callbacks: Callbacks

`Optional` options: Partial<BaseLanguageModelCallOptions>

`Optional` options: Partial<BaseLanguageModelCallOptions>

`Optional` streamOptions: Omit<LogStreamCallbackHandlerInput, "autoClose">

fields: {
fallbacks: Runnable<BaseLanguageModelInput, BaseMessageChunk, BaseCallbackConfig>[];
}