A method that loads the text file or blob and returns a promise that
resolves to an array of Document
instances. It reads the text from
the file or blob using the readFile
function from the
node:fs/promises
module or the text()
method of the blob. It then
parses the text using the parse()
method and creates a Document
instance for each parsed page. The metadata includes the source of the
text (file path or blob) and, if there are multiple pages, the line
number of each page.
A promise that resolves to an array of Document
instances.
Optional
splitter: BaseDocumentTransformer<DocumentInterface<Record<string, any>>[], DocumentInterface<Record<string, any>>[]>A Promise that resolves with an array of Document instances, each split according to the provided TextSplitter.
Protected
parseA protected method that parses the raw CSV data and returns an array of
strings representing the pageContent of each document. It uses the
dsvFormat
function from the d3-dsv
module to parse the CSV data. If
the column
option is specified, it checks if the column exists in the
CSV file and returns the values of that column as the pageContent. If
the column
option is not specified, it converts each row of the CSV
data into key/value pairs and joins them with newline characters.
The raw CSV data to be parsed.
An array of strings representing the pageContent of each document.
Static
importsA static method that imports the readFile
function from the
node:fs/promises
module. It is used to dynamically import the
function when needed. If the import fails, it throws an error
indicating that the fs/promises
module is not available in the
current environment.
A promise that resolves to an object containing the readFile
function from the node:fs/promises
module.
A class that extends the TextLoader class. It represents a document loader that loads documents from a CSV file. It has a constructor that takes a
filePathOrBlob
parameter representing the path to the CSV file or a Blob object, and an optionaloptions
parameter of typeCSVLoaderOptions
or a string representing the column to use as the document's pageContent.