LightRAG/examples/lightrag_ollama_demo.py

import asyncio
import os
import inspect
import logging
from lightrag import LightRAG, QueryParam
from lightrag.llm import ollama_model_complete, ollama_embedding
from lightrag.utils import EmbeddingFunc

WORKING_DIR = "./dickens"

logging.basicConfig(format="%(levelname)s:%(message)s", level=logging.INFO)

if not os.path.exists(WORKING_DIR):
    os.mkdir(WORKING_DIR)

rag = LightRAG(
    working_dir=WORKING_DIR,
    llm_model_func=ollama_model_complete,
    llm_model_name="gemma2:2b",
    llm_model_max_async=4,
    llm_model_max_token_size=32768,
    llm_model_kwargs={"host": "http://localhost:11434", "options": {"num_ctx": 32768}},
    embedding_func=EmbeddingFunc(
        embedding_dim=768,
        max_token_size=8192,
        func=lambda texts: ollama_embedding(
            texts, embed_model="nomic-embed-text", host="http://localhost:11434"
        ),
    ),
)

with open("./book.txt", "r", encoding="utf-8") as f:
    rag.insert(f.read())

# Perform naive search
print(
    rag.query("What are the top themes in this story?", param=QueryParam(mode="naive"))
)

# Perform local search
print(
    rag.query("What are the top themes in this story?", param=QueryParam(mode="local"))
)

# Perform global search
print(
    rag.query("What are the top themes in this story?", param=QueryParam(mode="global"))
)

# Perform hybrid search
print(
    rag.query("What are the top themes in this story?", param=QueryParam(mode="hybrid"))
)

# stream response
resp = rag.query(
    "What are the top themes in this story?",
    param=QueryParam(mode="hybrid", stream=True),
)


async def print_stream(stream):
    async for chunk in stream:
        print(chunk, end="", flush=True)


if inspect.isasyncgen(resp):
    asyncio.run(print_stream(resp))
else:
    print(resp)
Add support for Ollama streaming output and integrate Open-WebUI as the chat UI demo 2024-12-06 08:48:55 +08:00			`import asyncio`
ollama test 2024-10-16 15:15:10 +08:00			`import os`
Add support for Ollama streaming output and integrate Open-WebUI as the chat UI demo 2024-12-06 08:48:55 +08:00			`import inspect`
Add ability to passadditional parameters to ollama library like host and timeout 2024-10-21 11:53:06 +00:00			`import logging`
ollama test 2024-10-16 15:15:10 +08:00			`from lightrag import LightRAG, QueryParam`
			`from lightrag.llm import ollama_model_complete, ollama_embedding`
			`from lightrag.utils import EmbeddingFunc`

			`WORKING_DIR = "./dickens"`

Fix lint issue 2024-10-28 17:05:38 +02:00			`logging.basicConfig(format="%(levelname)s:%(message)s", level=logging.INFO)`

ollama test 2024-10-16 15:15:10 +08:00			`if not os.path.exists(WORKING_DIR):`
			`os.mkdir(WORKING_DIR)`

			`rag = LightRAG(`
			`working_dir=WORKING_DIR,`
Add ability to passadditional parameters to ollama library like host and timeout 2024-10-21 11:53:06 +00:00			`llm_model_func=ollama_model_complete,`
Finetune example to be able to run ollama example without need to tweak context size in Modelfile 2024-10-22 14:35:42 +00:00			`llm_model_name="gemma2:2b",`
			`llm_model_max_async=4,`
			`llm_model_max_token_size=32768,`
			`llm_model_kwargs={"host": "http://localhost:11434", "options": {"num_ctx": 32768}},`
ollama test 2024-10-16 15:15:10 +08:00			`embedding_func=EmbeddingFunc(`
			`embedding_dim=768,`
			`max_token_size=8192,`
			`func=lambda texts: ollama_embedding(`
Add ability to passadditional parameters to ollama library like host and timeout 2024-10-21 11:53:06 +00:00			`texts, embed_model="nomic-embed-text", host="http://localhost:11434"`
			`),`
ollama test 2024-10-16 15:15:10 +08:00			`),`
			`)`

set encoding as utf-8 when reading ./book.txt in examples 2024-10-22 16:01:40 +08:00			`with open("./book.txt", "r", encoding="utf-8") as f:`
ollama test 2024-10-16 15:15:10 +08:00			`rag.insert(f.read())`

			`# Perform naive search`
Add ability to passadditional parameters to ollama library like host and timeout 2024-10-21 11:53:06 +00:00			`print(`
			`rag.query("What are the top themes in this story?", param=QueryParam(mode="naive"))`
			`)`
ollama test 2024-10-16 15:15:10 +08:00
			`# Perform local search`
Add ability to passadditional parameters to ollama library like host and timeout 2024-10-21 11:53:06 +00:00			`print(`
			`rag.query("What are the top themes in this story?", param=QueryParam(mode="local"))`
			`)`
ollama test 2024-10-16 15:15:10 +08:00
			`# Perform global search`
Add ability to passadditional parameters to ollama library like host and timeout 2024-10-21 11:53:06 +00:00			`print(`
			`rag.query("What are the top themes in this story?", param=QueryParam(mode="global"))`
			`)`
ollama test 2024-10-16 15:15:10 +08:00
			`# Perform hybrid search`
Add ability to passadditional parameters to ollama library like host and timeout 2024-10-21 11:53:06 +00:00			`print(`
			`rag.query("What are the top themes in this story?", param=QueryParam(mode="hybrid"))`
			`)`
Add support for Ollama streaming output and integrate Open-WebUI as the chat UI demo 2024-12-06 08:48:55 +08:00
			`# stream response`
			`resp = rag.query(`
			`"What are the top themes in this story?",`
			`param=QueryParam(mode="hybrid", stream=True),`
			`)`


			`async def print_stream(stream):`
			`async for chunk in stream:`
			`print(chunk, end="", flush=True)`


			`if inspect.isasyncgen(resp):`
			`asyncio.run(print_stream(resp))`
			`else:`
			`print(resp)`