llama-hub/loader_hub/youtube_transcript
gocampo 312a5a62b5
Fix bug #121 (#122)
Fixed the regex to take in account hyphens.
2023-03-16 11:56:11 -07:00
..
2023-02-03 00:05:28 -08:00
2023-03-16 11:56:11 -07:00
2023-03-10 15:02:18 +08:00

Youtube Transcript Loader

This loader fetches the text transcript of Youtube videos using the youtube_transcript_api Python package.

Usage

To use this loader, you need to pass in an array of Youtube links.

from llama_index import download_loader

YoutubeTranscriptReader = download_loader("YoutubeTranscriptReader")

loader = YoutubeTranscriptReader()
documents = loader.load_data(ytlinks=['https://www.youtube.com/watch?v=i3OYlaoj-BM'])

This loader is designed to be used as a way to load data into LlamaIndex and/or subsequently used as a Tool in a LangChain Agent. See here for examples.