• Import from "@langchain/community/document_loaders/web/cheerio" instead. This entrypoint will be removed in 0.3.0.

Represents the parameters for configuring the CheerioWebBaseLoader. It extends the AsyncCallerParams interface and adds additional parameters specific to web-based loaders.

interface WebBaseLoaderParams {
    maxConcurrency?: number;
    maxRetries?: number;
    onFailedAttempt?: FailedAttemptHandler;
    selector?: SelectorType;
    textDecoder?: TextDecoder;
    timeout?: number;
}

Hierarchy (view full)

Properties

maxConcurrency?: number

The maximum number of concurrent calls that can be made. Defaults to Infinity, which means no limit.

maxRetries?: number

The maximum number of retries that can be made for a single call, with an exponential backoff between each attempt. Defaults to 6.

onFailedAttempt?: FailedAttemptHandler

Custom handler to handle failed attempts. Takes the originally thrown error object as input, and should itself throw an error if the input error is not retryable.

selector?: SelectorType

The selector to use to extract the text from the document. Defaults to "body".

textDecoder?: TextDecoder

The text decoder to use to decode the response. Defaults to UTF-8.

timeout?: number

The timeout in milliseconds for the fetch request. Defaults to 10s.