Add documentation about adding reasoning and indexing pipelines to the application (#138)

2025-06-26 23:19:56 +00:00 · 2024-01-26 22:31:52 +07:00 · 2024-01-26 22:31:52 +07:00 · 22c646e5c4
commit 22c646e5c4
parent 757aabca4d
6 changed files with 257 additions and 10 deletions
--- a/docs/app.md
+++ b/docs/app.md
@ -0,0 +1,237 @@
+# Add new indexing and reasoning pipeline to the application
+
+@trducng
+
+At high level, to add new indexing and reasoning pipeline:
+
+1. You define your indexing or reasoning pipeline as a class from
+   `BaseComponent`.
+2. You declare that class in the setting files `flowsettings.py`.
+
+Then when `python launch.py`, the application will dynamically load those
+pipelines.
+
+The below sections talk in more detail about how the pipelines should be
+constructed.
+
+## Define a pipeline as a class
+
+In essence, a pipeline will subclass from `kotaemon.base.BaseComponent`.
+Each pipeline has 2 main parts:
+
+- All declared arguments and sub-pipelines.
+- The logic inside the pipeline.
+
+An example pipeline:
+
+```python
+from kotaemon.base import BaseComponent
+
+
+class SoSimple(BaseComponent):
+    arg1: int
+    arg2: str
+
+    def run(self, arg3: str):
+        return self.arg1 * self.arg2 + arg3
+```
+
+This pipeline is named `SoSimple`. It takes `arg1` and `arg2` as init argument.
+It takes `arg3` as run argument.
+
+```python
+>> pipeline = SoSimple(arg1=2, arg2="ha")
+>> pipeline("x")
+hahax
+```
+
+This pipeline is simple for demonstration purpose, but we can imagine pipelines
+with much more arguments, that can take other pipelines as arguments, and have
+more complicated logic in the `run` method.
+
+**_An indexing or reasoning pipeline is just a class subclass from
+`BaseComponent` like above._**
+
+## Run signatures
+
+**Note**: this section is tentative at the moment. We will finalize `def run`
+function signature by latest early April.
+
+The indexing pipeline:
+
+```python
+    def run(
+        self,
+        file_paths: str | Path | list[str | Path],
+        reindex: bool = False,
+        **kwargs,
+    ):
+        """Index files to intermediate representation (e.g. vector, database...)
+
+        Args:
+            file_paths: the list of paths to files
+            reindex: if True, files in `file_paths` that already exists in database
+                should be reindex.
+        """
+```
+
+The reasoning pipeline:
+
+```python
+    def run(self, question: str, history: list, **kwargs) -> Document:
+        """Answer the question
+
+        Args:
+            question: the user input
+            history: the chat history [(user_msg1, bot_msg1), (user_msg2, bot_msg2)...]
+
+        Returns:
+            kotaemon.base.Document: the final answer
+        """
+```
+
+## Register your pipeline to ktem
+
+To register your pipelines to ktem, you declare it in the `flowsettings.py`
+file. This file locates at the current working directory where you start the
+ktem. In most use cases, it is this
+[one](https://github.com/Cinnamon/kotaemon/blob/main/libs/ktem/flowsettings.py).
+
+```
+KH_REASONING = ["<python.module.path.to.the.reasoning.class>"]
+
+KH_INDEX = "<python.module.path.to.the.indexing.class>"
+```
+
+You can register multiple reasoning pipelines to ktem by populating the
+`KH_REASONING` list. The user can select which reasoning pipeline to use
+in their Settings page.
+
+For now, there's only one supported index option for `KH_INDEX`.
+
+Make sure that your class is discoverable by Python.
+
+## Allow users to customize your pipeline in the app settings
+
+To allow the users to configure your pipeline, you need to declare what you
+allow the users to configure as a dictionary. `ktem` will include them into the
+application settings.
+
+In your pipeline class, add a classmethod `get_user_settings` that returns a
+setting dictionary, add a classmethod `get_info` that returns an info
+dictionary. Example:
+
+```
+class SoSimple(BaseComponent):
+
+    ... # as above
+
+    @classmethod
+    def get_user_settings(cls) -> dict:
+        """The settings to the user"""
+        return {
+            "setting_1": {
+                "name": "Human-friendly name",
+                "value": "Default value",
+                "choices": [("Human-friendly Choice 1", "choice1-id"), ("HFC 2", "choice2-id")], # optional
+                "component": "Which Gradio UI component to render, can be: text, number, checkbox, dropdown, radio, checkboxgroup"
+            },
+            "setting_2": {
+                # follow the same rule as above
+            }
+        }
+
+    @classmethod
+    def get_info(cls) -> dict:
+        """Pipeline information for bookkeeping purpose"""
+        return {
+            "id": "a unique id to differentiate this pipeline from other pipeline",
+            "name": "Human-friendly name of the pipeline",
+            "description": "Can be a short description of this pipeline"
+        }
+```
+
+Once adding these methods to your pipeline class, `ktem` will automatically
+extract and add them to the settings.
+
+## Construct to pipeline object
+
+Once `ktem` runs your pipeline, it will call your classmethod `get_pipeline`
+with the full user settings and expect to obtain the pipeline object. Within
+this `get_pipeline` method, you implement all the necessary logics to initiate
+the pipeline object. Example:
+
+```python
+class SoSimple(BaseComponent):
+    ... # as above
+
+    @classmethod
+    def get_pipeline(self, setting):
+        obj = cls(arg1=setting["reasoning.id.setting1"])
+        return obj
+```
+
+## Reasoning: Stream output to UI
+
+For fast user experience, you can stream the output directly to UI. This way,
+user can start observing the output as soon as the LLM model generates the 1st
+token, rather than having to wait the pipeline finishes to read the whole message.
+
+To stream the output, you need to;
+
+1. Turn the `run` function to async.
+2. Pass in the output to a special queue with `self.report_output`.
+
+```python
+
+    async def run(self, question: str, history: list, **kwargs) -> Document:
+        for char in "This is a long messages":
+            self.report_output({"output": text.text})
+```
+
+The argument to `self.report_output` is a dictionary, that contains either or
+all of these 2 keys: "output", "evidence". The "output" string will be streamed
+to the chat message, and the "evidence" string will be streamed to the
+information panel.
+
+## Access application LLMs, Embeddings
+
+You can access users' collections of LLMs and embedding models with:
+
+```python
+from ktem.components import llms, embeddings
+
+
+llm = llms.get_default()
+embedding_model = embeddings.get_default()
+```
+
+You can also allow the users to specifically select which llms or embedding
+models they want to use through the settings.
+
+```python
+    @classmethod
+    def get_user_settings(cls) -> dict:
+        from ktem.components import llms
+
+        return {
+            "citation_llm": {
+                "name": "LLM for citation",
+                "value": llms.get_lowest_cost_name(),
+                "component: "dropdown",
+                "choices": list(llms.options().keys()),
+            },
+            ...
+        }
+```
+
+## Optional: Access application data
+
+You can access the user's application database, vector store as follow:
+
+```python
+# get the database that contains the source files
+from ktem.db.models import Source, Index, Conversation, User
+
+# get the vector store
+```
--- a/docs/contributing.md
+++ b/docs/contributing.md
@ -19,6 +19,7 @@
 - Install all

  ```shell
+  cd libs/kotaemon
  pip install -e ".[dev]"
  ```

--- a/libs/ktem/flowsettings.py
+++ b/libs/ktem/flowsettings.py
@ -68,9 +68,7 @@ KH_EMBEDDINGS = {
        "default": True,
    },
 }
-KH_REASONINGS = {
-    "simple": "ktem.reasoning.simple.FullQAPipeline",
-}
+KH_REASONINGS = ["ktem.reasoning.simple.FullQAPipeline"]


 SETTINGS_APP = {
--- a/libs/ktem/ktem/app.py
+++ b/libs/ktem/ktem/app.py
@ -73,11 +73,12 @@ class BaseApp:
        if getattr(settings, "KH_REASONINGS", None) is None:
            return

-        for name, value in settings.KH_REASONINGS.items():
+        for value in settings.KH_REASONINGS:
            reasoning_cls = import_dotted_string(value, safe=False)
-            reasonings[name] = reasoning_cls
+            rid = reasoning_cls.get_info()["id"]
+            reasonings[rid] = reasoning_cls
            options = reasoning_cls().get_user_settings()
-            self.default_settings.reasoning.options[name] = BaseSettingGroup(
+            self.default_settings.reasoning.options[rid] = BaseSettingGroup(
                settings=options
            )

--- a/libs/ktem/ktem/pages/chat/events.py
+++ b/libs/ktem/ktem/pages/chat/events.py
@ -128,7 +128,7 @@ async def chat_fn(chat_history, files, settings):
    pipeline = create_pipeline(settings, files)
    pipeline.set_output_queue(queue)

-    asyncio.create_task(pipeline(chat_input))
+    asyncio.create_task(pipeline(chat_input, chat_history))
    text, refs = "", ""

    while True:
--- a/libs/ktem/ktem/reasoning/simple.py
+++ b/libs/ktem/ktem/reasoning/simple.py
@ -3,7 +3,7 @@ import logging
 import warnings
 from collections import defaultdict
 from functools import partial
-from typing import Iterator, Optional
+from typing import Optional

 import tiktoken
 from ktem.components import embeddings, get_docstore, get_vectorstore, llms
@ -278,7 +278,7 @@ class AnswerWithContextPipeline(BaseComponent):

    async def run(  # type: ignore
        self, question: str, evidence: str, evidence_mode: int = 0
-    ) -> Document | Iterator[Document]:
+    ) -> Document:
        """Answer the question based on the evidence

        In addition to the question and the evidence, this method also take into
@ -342,7 +342,9 @@ class FullQAPipeline(BaseComponent):
    evidence_pipeline: PrepareEvidencePipeline = PrepareEvidencePipeline.withx()
    answering_pipeline: AnswerWithContextPipeline = AnswerWithContextPipeline.withx()

-    async def run(self, question: str, **kwargs) -> Document:  # type: ignore
+    async def run(  # type: ignore
+        self, question: str, history: list, **kwargs  # type: ignore
+    ) -> Document:  # type: ignore
        docs = self.retrieval_pipeline(text=question)
        evidence_mode, evidence = self.evidence_pipeline(docs).content
        answer = await self.answering_pipeline(
@ -455,3 +457,11 @@ class FullQAPipeline(BaseComponent):
                "choices": main_llm_choices,
            },
        }
+
+    @classmethod
+    def get_info(cls) -> dict:
+        return {
+            "id": "simple",
+            "name": "Simple QA",
+            "description": "Simple QA pipeline",
+        }