mirror of
https://github.com/run-llama/llama-hub.git
synced 2025-08-14 11:41:56 +00:00
Reddit Reader
For any subreddit(s) you're interested in, search for relevant posts using keyword(s) and load the resulting text in the post and and top-level comments into LLMs/ LangChains.
Get your Reddit credentials ready
- Visit Reddit App Preferences (https://www.reddit.com/prefs/apps) or https://old.reddit.com/prefs/apps/
- Scroll to the bottom and click "create another app..."
- Fill out the name, description, and redirect url for your app, then click "create app"
- Now you should be able to see the personal use script, secret, and name of your app. Store those as environment variables REDDIT_CLIENT_ID, REDDIT_CLIENT_SECRET, and REDDIT_USER_AGENT respecitvely.
- Additionally store the environment variables REDDIT_USERNAME and REDDIT_PASSWORD, which correspond to the credentials for your Reddit account.
Usage
LlamaIndex
from llama_index import GPTVectorStoreIndex, download_loader
RedditReader = download_loader('RedditReader')
subreddits = ['MachineLearning']
search_keys = ['PyTorch', 'deploy']
post_limit = 10
loader = RedditReader()
documents = loader.load_data(subreddits=subreddits, search_keys=search_keys, post_limit=post_limit)
index = GPTVectorStoreIndex.from_documents(documents)
index.query("What are the pain points of PyTorch users?")
LangChain
from llama_index import GPTVectorStoreIndex, download_loader
from langchain.agents import initialize_agent, Tool
from langchain.llms import OpenAI
from langchain.chains.conversation.memory import ConversationBufferMemory
RedditReader = download_loader('RedditReader')
subreddits = ['MachineLearning']
search_keys = ['PyTorch', 'deploy']
post_limit = 10
loader = RedditReader()
documents = loader.load_data(subreddits=subreddits, search_keys=search_keys, post_limit=post_limit)
index = GPTVectorStoreIndex.from_documents(documents)
tools = [
Tool(
name="Reddit Index",
func=lambda q: index.query(q),
description=f"Useful when you want to read relevant posts and top-level comments in subreddits.",
),
]
llm = OpenAI(temperature=0)
memory = ConversationBufferMemory(memory_key="chat_history")
agent_chain = initialize_agent(
tools, llm, agent="zero-shot-react-description", memory=memory
)
output = agent_chain.run(input="What are the pain points of PyTorch users?")
print(output)
This loader is designed to be used as a way to load data into GPT Index and/or subsequently used as a Tool in a LangChain Agent. See here for examples.