Pandas Excel Loader
This loader extracts the text from a column of a local .xlsx file using the pandas
Python package. A single local file is passed in each time you call load_data
.
Usage
To use this loader, you need to pass in a Path
to a local file, along with a sheet_name
from which sheet to extract data. The default sheet_name=None
, which means it will load all the sheets in the excel file. You can set sheet_name="Data1
to load only the sheet named "Data1". Or you can set sheet_name=0
to load the first sheet in the excel file. You can pass any additional pandas configuration options to the pandas_config
parameter, please see the pandas documentation.
from pathlib import Path
from llama_index import download_loader
PandasExcelReader = download_loader("PandasExcelReader")
loader = PandasExcelReader()
documents = loader.load_data(file=Path('./data.xlsx'), pandas_config={"header":0})
This loader is designed to be used as a way to load data into LlamaIndex and/or subsequently used as a Tool in a LangChain Agent. See here for examples.