diff --git a/config/json_yaml/index.html b/config/json_yaml/index.html index 4853e98e..3d4fd0d0 100644 --- a/config/json_yaml/index.html +++ b/config/json_yaml/index.html @@ -1402,7 +1402,7 @@
  • api_version str - The API version
  • organization str - The client organization.
  • proxy str - The proxy URL to use.
  • -
  • cognitive_services_endpoint str - The url endpoint for cognitive services.
  • +
  • audience str - (Azure OpenAI only) The URI of the target Azure resource/service for which a managed identity token is requested. Used if api_key is not defined. Default=https://cognitiveservices.azure.com/.default
  • deployment_name str - The deployment name to use (Azure).
  • model_supports_json bool - Whether the model supports JSON-mode output.
  • tokens_per_minute int - Set a leaky-bucket throttle on tokens-per-minute.
  • @@ -1430,9 +1430,17 @@
  • parallelization (see Parallelization top-level config)
  • async_mode (see Async Mode top-level config)
  • batch_size int - The maximum batch size to use.
  • -
  • batch_max_tokens int - The maximum batch #-tokens.
  • +
  • batch_max_tokens int - The maximum batch # of tokens.
  • target required|all - Determines which set of embeddings to emit.
  • skip list[str] - Which embeddings to skip.
  • +
  • vector_store dict - The vector store to use. Configured for lancedb by default.
  • +
  • type str - lancedb or azure_ai_search. Default=lancedb
  • +
  • db_uri str (only for lancedb) - The database uri. Default=storage.base_dir/lancedb
  • +
  • url str (only for AI Search) - AI Search endpoint
  • +
  • api_key str (optional - only for AI Search) - The AI Search api key to use.
  • +
  • audience str (only for AI Search) - Audience for managed identity token if managed identity authentication is used.
  • +
  • overwrite bool (only used at index creation time) - Overwrite collection if it exist. Default=True
  • +
  • collection_name str - The name of a vector collection. Default=entity_description_embeddings
  • strategy dict - Fully override the text-embedding strategy.
  • chunks

    @@ -1547,7 +1555,7 @@
  • top_level_nodes bool - Emit top-level-node snapshots.
  • encoding_model

    -

    str - The text encoding model to use. Default is cl100k_base.

    +

    str - The text encoding model to use. Default=cl100k_base.

    skip_workflows

    list[str] - Which workflow names to skip.

    diff --git a/examples_notebooks/global_search/index.html b/examples_notebooks/global_search/index.html index ec6851e9..e04af54c 100644 --- a/examples_notebooks/global_search/index.html +++ b/examples_notebooks/global_search/index.html @@ -2531,15 +2531,15 @@ print(result.response)
    ### Major Conflict
     
    -The central conflict in the story revolves around the Paranormal Military Squad's mission to establish contact with extraterrestrial intelligence. This mission involves deciphering alien signals and managing the potential implications of first contact. The conflict is marked by the secrecy and high stakes associated with the mission, as well as the challenges posed by the unknown nature of the extraterrestrial entities [Data: Reports (4, 5, 2, 3, 0)].
    +The central conflict in the story revolves around the Paranormal Military Squad's mission to establish contact with extraterrestrial intelligence. This involves deciphering alien signals and managing the potential implications of first contact. The mission is characterized by its secrecy and high stakes, as well as the challenges posed by the unknown nature of the extraterrestrial entities. The team must navigate these uncertainties and potential threats as they work towards their goal [Data: Reports (4, 5, 2, 3, 0)].
     
     ### Protagonists
     
    -The protagonists are the members of the Paranormal Military Squad, which includes key figures such as Taylor Cruz, Dr. Jordan Hayes, Alex Mercer, and Sam Rivera. These individuals play crucial roles in the mission, contributing their expertise in leadership, signal decryption, and communication with extraterrestrial beings [Data: Reports (4, 5, 2, 3, 0)].
    +The protagonists are the key members of the Paranormal Military Squad, including Taylor Cruz, Dr. Jordan Hayes, Alex Mercer, and Sam Rivera. Each of these individuals plays a crucial role in the mission, bringing their expertise in leadership, signal decryption, diplomatic engagement, and technical innovation to the forefront. Their combined efforts are essential in tackling the challenges posed by the mission [Data: Reports (4, 5, 2, 3, 0)].
     
     ### Antagonist
     
    -The antagonist in the story is not a single entity or character. Instead, it may be considered the unknown and potentially threatening nature of the extraterrestrial signals and the challenges they present to the Paranormal Military Squad's mission [Data: Reports (4, 5, 2, 3, 0)].
    +There is no clear antagonist in the traditional sense. Instead, the conflict primarily involves the challenges and uncertainties associated with extraterrestrial communication and the potential risks it poses. The antagonist could be interpreted as the unknown and potentially threatening nature of the extraterrestrial entities or the obstacles faced by the team in achieving their mission [Data: Reports (4, 5, 2, 3, 0)].
     
    @@ -2690,7 +2690,7 @@ print(f"LLM calls: {result.llm_calls}. LLM tokens: {result.prompt_tokens}")
    -
    LLM calls: 2. LLM tokens: 5274
    +
    LLM calls: 2. LLM tokens: 5283
     
    diff --git a/examples_notebooks/local_search/index.html b/examples_notebooks/local_search/index.html index c4a6b9c9..246a0738 100644 --- a/examples_notebooks/local_search/index.html +++ b/examples_notebooks/local_search/index.html @@ -2347,7 +2347,7 @@ entity_df.head()
    -
    [2024-10-21T18:05:01Z WARN  lance::dataset] No existing dataset at /home/runner/work/graphrag/graphrag/docs/examples_notebooks/inputs/operation dulce/lancedb/entity_description_embeddings.lance, it will be created
    +
    [2024-10-21T20:58:06Z WARN  lance::dataset] No existing dataset at /home/runner/work/graphrag/graphrag/docs/examples_notebooks/inputs/operation dulce/lancedb/entity_description_embeddings.lance, it will be created
     
    @@ -3335,21 +3335,21 @@ print(result.response)
    ### Overview of Agent Alex Mercer
     
    -Agent Alex Mercer is a central figure within the Paranormal Military Squad Team at Dulce Base, where he plays a pivotal role in the team's operations and mission objectives. His responsibilities are multifaceted, encompassing leadership, strategic oversight, and direct involvement in the analysis and interpretation of extraterrestrial signals. Mercer's military background and experience are crucial to his role, as he guides the team through complex scenarios involving potential first contact with alien intelligence [Data: Entities (0, 209); Relationships (5, 8, 6)].
    +Agent Alex Mercer is a central figure within the Paranormal Military Squad Team at Dulce Base, where he plays a pivotal role in the team's operations and mission objectives. His responsibilities are multifaceted, encompassing leadership, strategic oversight, and direct involvement in the analysis and interpretation of extraterrestrial signals. Mercer's military background and experience are crucial to his role, as he guides the team through complex scenarios involving potential first contact with alien intelligence [Data: Entities (0, 209); Relationships (5, 8, 65)].
     
     ### Leadership and Responsibilities
     
    -As a leader, Alex Mercer is instrumental in directing the Paranormal Military Squad's efforts to engage with extraterrestrial intelligence. His leadership style is characterized by a mix of caution and anticipation, reflecting the gravity of the mission at hand. Mercer is responsible for ensuring a cautious approach to interspecies communication, unraveling galactic mysteries, and engaging with alien signals. His role involves not only overseeing the team but also participating in the decryption and analysis of alien messages, which are critical to understanding extraterrestrial societies [Data: Entities (0); Claims (73, 82, 67)].
    +As a leader, Alex Mercer is instrumental in overseeing the Paranormal Military Squad's efforts to engage with extraterrestrial intelligence. His leadership style is characterized by a mix of caution and anticipation, reflecting the gravity of the mission at hand. Mercer is responsible for ensuring a cautious approach to interspecies communication and unraveling galactic mysteries, which involves both strategic planning and hands-on participation in decryption efforts [Data: Entities (0); Relationships (5, 6, 8)].
     
     ### Collaboration and Team Dynamics
     
    -Agent Mercer works closely with other key members of the team, such as Dr. Jordan Hayes, with whom he shares a mutual respect and understanding of the mission's significance. Their collaboration focuses on decrypting and communicating with extraterrestrial intelligence, highlighting the importance of teamwork in achieving their objectives. Mercer also interacts with other team members like Sam Rivera and Taylor Cruz, fostering a collaborative environment that is essential for the success of their mission [Data: Relationships (1, 4, 26, 67); Reports (0)].
    +Agent Mercer works closely with other key members of the team, such as Dr. Jordan Hayes, with whom he collaborates on deciphering alien signals and managing interspecies communication. Their partnership is built on mutual respect and recognition of each other's analytical skills, which is essential for the success of their mission. Additionally, Mercer interacts with other team members like Sam Rivera and Taylor Cruz, highlighting his role in fostering teamwork and collaboration within the squad [Data: Reports (0); Relationships (1, 4, 26, 67)].
     
    -### Strategic Approach and Challenges
    +### Involvement in Extraterrestrial Communication
     
    -Mercer's strategic approach to the mission involves balancing the need for caution with the potential for groundbreaking discoveries. He is aware of the complexities and potential risks associated with extraterrestrial contact, and he emphasizes the importance of understanding and engaging with alien signals. This strategic foresight is crucial as the team navigates the challenges of deciphering alien code and preparing for first contact scenarios [Data: Claims (50, 57, 60); Reports (0)].
    +Mercer's involvement in the decryption and analysis of alien signals is a significant aspect of his role. He is seen as a key figure in establishing contact and dialogue with alien intelligence, acting as a representative of humanity. This responsibility underscores the importance of his work in the broader context of interstellar communication and exploration [Data: Claims (73, 82); Reports (0)].
     
    -In summary, Agent Alex Mercer is a key leader within the Paranormal Military Squad Team, guiding the team through the intricacies of extraterrestrial communication and ensuring that their mission is conducted with strategic foresight and caution. His collaboration with team members and his leadership in the face of potential first contact highlight his critical role in the team's efforts at Dulce Base.
    +In summary, Agent Alex Mercer is a vital member of the Paranormal Military Squad Team, whose leadership and expertise are crucial to the team's mission of engaging with extraterrestrial intelligence. His role involves a delicate balance of strategic oversight, collaboration, and direct involvement in the analysis of alien signals, making him a central figure in the unfolding narrative of human-alien relations.
     
    @@ -3395,21 +3395,21 @@ print(result.response)
    ## Overview of Dr. Jordan Hayes
     
    -Dr. Jordan Hayes is a prominent member of the Paranormal Military Squad Team stationed at Dulce Base. Their primary role involves deciphering alien code and interpreting extraterrestrial patterns, which are crucial for the team's mission of understanding and interacting with extraterrestrial entities [Data: Entities (104, 2); Reports (0)]. Dr. Hayes is known for their analytical mindset and skepticism, which they balance with a willingness to explore new possibilities during their mission [Data: Entities (124); Claims (12)].
    +Dr. Jordan Hayes is a prominent member of the Paranormal Military Squad, a specialized team stationed at Dulce Base. This team is dedicated to the analysis and interpretation of extraterrestrial signals and patterns, with Dr. Hayes playing a crucial role in deciphering alien code and facilitating interspecies communication [Data: Entities (104, 2); Reports (0)].
     
    -## Role and Contributions
    +## Role and Expertise
     
    -Dr. Hayes plays a pivotal role in the Paranormal Military Squad's efforts to communicate with alien intelligence. This involves isolating signal harmonics, decrypting alien messages, and interpreting alien signals for further analysis [Data: Entities (2, 192, 180)]. Their expertise in decryption algorithms and signal analysis is vital to the team's mission, as they work on deciphering extraterrestrial signals and engaging in interstellar communication [Data: Entities (166, 180); Claims (68, 79)].
    +Dr. Hayes is known for their analytical mindset and expertise in decryption algorithms, which are essential for interpreting alien signals. Their work involves isolating signal harmonics and decrypting alien messages, which are critical components of the team's mission to understand and interact with extraterrestrial entities [Data: Entities (2, 180, 192, 166); Claims (61, 68, 79)]. Dr. Hayes's focus on empirical evidence and adaptability is a key aspect of their approach to the complex challenges posed by extraterrestrial communication [Data: Entities (2); Claims (12)].
     
    -## Collaboration and Relationships
    +## Contributions to the Paranormal Military Squad
     
    -Dr. Hayes collaborates closely with other team members, including Agent Alex Mercer, Sam Rivera, and Taylor Cruz. Their partnership with Alex Mercer is particularly significant, as they work together on managing interspecies communication and interpreting signals crucial to the team's operations [Data: Relationships (1, 4, 26, 67); Reports (0)]. Despite occasional differing views with Taylor Cruz, their collaboration is essential for the success of the mission [Data: Relationships (9, 15)].
    +Within the Paranormal Military Squad, Dr. Hayes collaborates closely with other team members, including Agent Alex Mercer, to manage interspecies communication and analyze extraterrestrial patterns. Their partnership is marked by mutual respect and a shared commitment to the mission's objectives [Data: Relationships (1, 4, 26, 67); Reports (0)]. Dr. Hayes's work is pivotal in the team's efforts to prepare for potential first contact scenarios and to decipher alien signals that could represent both threats and opportunities for untapped wisdom [Data: Reports (0); Claims (84)].
     
     ## Scientific Breakthroughs and Challenges
     
    -Dr. Hayes is on the verge of a scientific breakthrough, as they analyze evolving alien signals and consider the implications of a tandem evolution with extraterrestrial intelligence [Data: Claims (49, 74)]. They have discovered hidden technology and deciphered extraterrestrial patterns that could represent potential threats or untapped wisdom, highlighting the complex nature of their mission [Data: Claims (18, 84)].
    +Dr. Hayes is on the verge of significant scientific breakthroughs, as they analyze evolving alien signals and consider the implications of these patterns. Their work suggests a potential tandem evolution with extraterrestrial intelligence, highlighting the profound impact of their research on the understanding of alien thought patterns and communication [Data: Claims (49, 74)]. Despite the challenges, Dr. Hayes remains focused on the mission, contemplating the skepticism and the need to accept other possibilities as they navigate the unknown [Data: Claims (12, 54)].
     
    -In summary, Dr. Jordan Hayes is a central figure in the Paranormal Military Squad's mission at Dulce Base, contributing significantly to the understanding and communication with extraterrestrial entities. Their analytical skills, collaboration with team members, and potential scientific breakthroughs underscore their importance in the team's efforts to navigate the complexities of interstellar communication.
    +In summary, Dr. Jordan Hayes is a central figure in the Paranormal Military Squad's mission to engage with extraterrestrial intelligence. Their expertise in signal analysis and decryption, combined with their analytical approach, makes them an invaluable asset to the team as they explore the frontiers of interstellar communication and diplomacy.
     
    @@ -3971,7 +3971,7 @@ print(candidate_questions.response)
    -
    ['- What is the role of Alex Mercer in Operation: Dulce?', '- How does the Paranormal Military Squad interact with extraterrestrial intelligence at Dulce Base?', '- What are the main objectives of Operation: Dulce?', '- How does the environment of Dulce Military Base affect the team members?', '- What challenges does the Paranormal Military Squad face during their mission at Dulce Base?']
    +
    ['- What is the role of Alex Mercer in Operation: Dulce?', '- How does the Paranormal Military Squad interact with extraterrestrial intelligence at Dulce Base?', '- What are the main objectives of Operation: Dulce?', '- How does the environment of the Dulce Military Base affect the team members?', '- What is the significance of New Mexico in the context of Operation: Dulce?']
     
    diff --git a/search/search_index.json b/search/search_index.json index 3531bcf7..d444f92a 100644 --- a/search/search_index.json +++ b/search/search_index.json @@ -1 +1 @@ -{"config": {"lang": ["en"], "separator": "[\\s\\-]+", "pipeline": ["stopWordFilter"]}, "docs": [{"location": "", "title": "Welcome to GraphRAG", "text": "

    \ud83d\udc49 Microsoft Research Blog Post \ud83d\udc49 GraphRAG Accelerator \ud83d\udc49 GraphRAG Arxiv

    Figure 1: An LLM-generated knowledge graph built using GPT-4 Turbo.

    GraphRAG is a structured, hierarchical approach to Retrieval Augmented Generation (RAG), as opposed to naive semantic-search approaches using plain text snippets. The GraphRAG process involves extracting a knowledge graph out of raw text, building a community hierarchy, generating summaries for these communities, and then leveraging these structures when perform RAG-based tasks.

    To learn more about GraphRAG and how it can be used to enhance your LLMs ability to reason about your private data, please visit the Microsoft Research Blog Post.

    "}, {"location": "#solution-accelerator", "title": "Solution Accelerator \ud83d\ude80", "text": "

    To quickstart the GraphRAG system we recommend trying the Solution Accelerator package. This provides a user-friendly end-to-end experience with Azure resources.

    "}, {"location": "#get-started-with-graphrag", "title": "Get Started with GraphRAG \ud83d\ude80", "text": "

    To start using GraphRAG, check out the Get Started guide. For a deeper dive into the main sub-systems, please visit the docpages for the Indexer and Query packages.

    "}, {"location": "#graphrag-vs-baseline-rag", "title": "GraphRAG vs Baseline RAG \ud83d\udd0d", "text": "

    Retrieval-Augmented Generation (RAG) is a technique to improve LLM outputs using real-world information. This technique is an important part of most LLM-based tools and the majority of RAG approaches use vector similarity as the search technique, which we call Baseline RAG. GraphRAG uses knowledge graphs to provide substantial improvements in question-and-answer performance when reasoning about complex information. RAG techniques have shown promise in helping LLMs to reason about private datasets - data that the LLM is not trained on and has never seen before, such as an enterprise\u2019s proprietary research, business documents, or communications. Baseline RAG was created to help solve this problem, but we observe situations where baseline RAG performs very poorly. For example:

    To address this, the tech community is working to develop methods that extend and enhance RAG. Microsoft Research\u2019s new approach, GraphRAG, uses LLMs to create a knowledge graph based on an input corpus. This graph, along with community summaries and graph machine learning outputs, are used to augment prompts at query time. GraphRAG shows substantial improvement in answering the two classes of questions described above, demonstrating intelligence or mastery that outperforms other approaches previously applied to private datasets.

    "}, {"location": "#the-graphrag-process", "title": "The GraphRAG Process \ud83e\udd16", "text": "

    GraphRAG builds upon our prior research and tooling using graph machine learning. The basic steps of the GraphRAG process are as follows:

    "}, {"location": "#index", "title": "Index", "text": ""}, {"location": "#query", "title": "Query", "text": "

    At query time, these structures are used to provide materials for the LLM context window when answering a question. The primary query modes are:

    "}, {"location": "#prompt-tuning", "title": "Prompt Tuning", "text": "

    Using GraphRAG with your data out of the box may not yield the best possible results. We strongly recommend to fine-tune your prompts following the Prompt Tuning Guide in our documentation.

    "}, {"location": "blog_posts/", "title": "Microsoft Research Blog", "text": "