Logan 62f94d0eba
add concat rows to pandas excel (#262)
Co-authored-by: Jerry Liu <jerryjliu98@gmail.com>
2023-05-17 09:16:05 -07:00
..
2023-03-06 19:49:08 -03:00
2023-05-17 09:16:05 -07:00
2023-04-29 21:49:31 -07:00
2023-03-06 19:49:08 -03:00

Pandas Excel Loader

This loader extracts the text from a column of a local .xlsx file using the pandas Python package. A single local file is passed in each time you call load_data.

Usage

To use this loader, you need to pass in a Path to a local file, along with a sheet_name from which sheet to extract data. The default sheet_name=None, which means it will load all the sheets in the excel file. You can set sheet_name="Data1 to load only the sheet named "Data1". Or you can set sheet_name=0 to load the first sheet in the excel file. You can pass any additional pandas configuration options to the pandas_config parameter, please see the pandas documentation.

from pathlib import Path
from llama_index import download_loader

PandasExcelReader = download_loader("PandasExcelReader")

loader = PandasExcelReader()
documents = loader.load_data(file=Path('./data.xlsx'), pandas_config={"header":0})

This loader is designed to be used as a way to load data into LlamaIndex and/or subsequently used as a Tool in a LangChain Agent. See here for examples.