diff --git a/config/json_yaml/index.html b/config/json_yaml/index.html index 0643a214..97ce32f0 100644 --- a/config/json_yaml/index.html +++ b/config/json_yaml/index.html @@ -1401,8 +1401,8 @@
  • async_mode (see Async Mode top-level config)
  • batch_size int - The maximum batch size to use.
  • batch_max_tokens int - The maximum batch # of tokens.
  • -
  • target required|all - Determines which set of embeddings to emit.
  • -
  • skip list[str] - Which embeddings to skip.
  • +
  • target required|all|none - Determines which set of embeddings to emit.
  • +
  • skip list[str] - Which embeddings to skip. Only useful if target=all to customize the list.
  • vector_store dict - The vector store to use. Configured for lancedb by default.
  • type str - lancedb or azure_ai_search. Default=lancedb
  • db_uri str (only for lancedb) - The database uri. Default=storage.base_dir/lancedb
  • @@ -1410,7 +1410,7 @@
  • api_key str (optional - only for AI Search) - The AI Search api key to use.
  • audience str (only for AI Search) - Audience for managed identity token if managed identity authentication is used.
  • overwrite bool (only used at index creation time) - Overwrite collection if it exist. Default=True
  • -
  • collection_name str - The name of a vector collection. Default=entity_description_embeddings
  • +
  • container_name str - The name of a vector container. This stores all indexes (tables) for a given dataset ingest. Default=default
  • strategy dict - Fully override the text-embedding strategy.
  • chunks

    diff --git a/examples_notebooks/global_search/index.html b/examples_notebooks/global_search/index.html index 930cbc0e..9972d5b6 100644 --- a/examples_notebooks/global_search/index.html +++ b/examples_notebooks/global_search/index.html @@ -2501,15 +2501,15 @@ print(result.response)
    ### Major Conflict
     
    -The central conflict in the story revolves around the Paranormal Military Squad's mission to establish contact with extraterrestrial intelligence. This mission involves deciphering alien signals and managing the potential implications of first contact. The conflict is marked by the secrecy and high stakes associated with the mission, as well as the challenges posed by the unknown nature of the extraterrestrial entities [Data: Reports (4, 5, 2, 3, 0)].
    +The central conflict in the story revolves around the Paranormal Military Squad's mission to establish contact with extraterrestrial intelligence. This mission involves deciphering alien signals and managing the potential implications of first contact. The conflict is marked by the secrecy and high stakes associated with the mission, as well as the challenges posed by the unknown nature of the extraterrestrial entities [Data: Reports (4, 5, 2, 3)].
     
     ### Protagonists
     
    -The protagonists are the key members of the Paranormal Military Squad, including Taylor Cruz, Dr. Jordan Hayes, Alex Mercer, and Sam Rivera. Each of these individuals plays a crucial role in the mission, contributing their unique expertise to the squad's efforts in extraterrestrial communication and engagement [Data: Reports (4, 5, 2, 3, 0)].
    +The protagonists are the key members of the Paranormal Military Squad, including Taylor Cruz, Dr. Jordan Hayes, Alex Mercer, and Sam Rivera. Each of these individuals plays a crucial role in the mission, bringing their expertise in leadership, signal decryption, diplomatic engagement, and technical innovation to the forefront [Data: Reports (4, 5, 2, 3)].
     
     ### Antagonist
     
    -There is no clear antagonist in the traditional sense. Instead, the conflict primarily involves the challenges and uncertainties associated with extraterrestrial communication rather than a specific opposing force. The unknown nature of the extraterrestrial entities and the potential risks they pose could be considered an abstract antagonist in the story [Data: Reports (4, 5, 2, 3, 0)].
    +The antagonist in the story is not a single entity or character. Instead, it may be considered the unknown and potentially threatening nature of the extraterrestrial signals and the challenges they present to the Paranormal Military Squad's mission [Data: Reports (4, 5, 2, 3)].
     
    @@ -2660,7 +2660,7 @@ print(f"LLM calls: {result.llm_calls}. LLM tokens: {result.prompt_tokens}")
    -
    LLM calls: 2. LLM tokens: 5279
    +
    LLM calls: 2. LLM tokens: 5261
     
    diff --git a/examples_notebooks/local_search/index.html b/examples_notebooks/local_search/index.html index 71dabc3e..279427a9 100644 --- a/examples_notebooks/local_search/index.html +++ b/examples_notebooks/local_search/index.html @@ -2271,7 +2271,7 @@ COMMUNITY_LEVEL = 2 # load description embeddings to an in-memory lancedb vectorstore # to connect to a remote db, specify url and port values. description_embedding_store = LanceDBVectorStore( - collection_name="entity_description_embeddings", + collection_name="entity.description", ) description_embedding_store.connect(db_uri=LANCEDB_URI) entity_description_embeddings = store_entity_semantic_embeddings( @@ -2290,7 +2290,7 @@ entities = read_indexer_entities(entity_df, entity_embedding_df, COMMUNITY_LEVEL # load description embeddings to an in-memory lancedb vectorstore # to connect to a remote db, specify url and port values. description_embedding_store = LanceDBVectorStore( - collection_name="entity_description_embeddings", + collection_name="entity.description", ) description_embedding_store.connect(db_uri=LANCEDB_URI) entity_description_embeddings = store_entity_semantic_embeddings( @@ -2317,7 +2317,7 @@ entity_df.head()
    -
    [2024-10-30T21:50:55Z WARN  lance::dataset] No existing dataset at /home/runner/work/graphrag/graphrag/docs/examples_notebooks/inputs/operation dulce/lancedb/entity_description_embeddings.lance, it will be created
    +
    [2024-11-01T22:03:01Z WARN  lance::dataset] No existing dataset at /home/runner/work/graphrag/graphrag/docs/examples_notebooks/inputs/operation dulce/lancedb/entity.description.lance, it will be created
     
    @@ -3305,21 +3305,21 @@ print(result.response)
    ### Overview of Agent Alex Mercer
     
    -Agent Alex Mercer is a key figure within the Paranormal Military Squad Team at Dulce Base, where he plays a significant leadership role. His responsibilities include overseeing operations and making critical decisions during potential first contact scenarios with extraterrestrial intelligence. Mercer's military background and experience are crucial to his role, as he guides the team in efforts to establish contact with alien intelligence, ensuring that such engagements are handled with strategic foresight and caution [Data: Entities (0, 209); Relationships (5, 8, 6)].
    +Agent Alex Mercer is a central figure within the Paranormal Military Squad Team at Dulce Base, where he plays a pivotal role in the team's operations and mission objectives. His responsibilities are multifaceted, encompassing leadership, strategic oversight, and direct involvement in the analysis and interpretation of extraterrestrial signals. Mercer's military background and experience are crucial to his role, as he guides the team through complex scenarios involving potential first contact with alien intelligence [Data: Entities (0, 209); Relationships (5, 8, 6)].
     
    -### Role and Responsibilities
    +### Leadership and Responsibilities
     
    -Mercer's role is multifaceted, encompassing the analysis of paranormal activities and participation in the operational aspects of Dulce Base's command center. He is deeply involved in the decryption and analysis of alien signals, contributing to the understanding of extraterrestrial societies. His leadership is characterized by a diplomatic approach, balancing the need for caution with the potential for groundbreaking discoveries [Data: Entities (0, 209); Claims (73, 85, 50, 67, 82)].
    +As a leader, Alex Mercer is instrumental in overseeing the team's efforts to engage with extraterrestrial intelligence. His role involves ensuring a cautious and strategic approach to interspecies communication, which is vital for the success of their mission. Mercer's leadership is characterized by a mix of concern and anticipation, reflecting the gravity of the mission at hand. He is responsible for guiding the team's response to extraterrestrial contact, ensuring that the engagement is handled with care and foresight [Data: Entities (0); Relationships (5, 8, 6)].
     
     ### Collaboration and Team Dynamics
     
    -Agent Mercer works closely with other key members of the Paranormal Military Squad, such as Dr. Jordan Hayes. Their collaboration focuses on decrypting and communicating with extraterrestrial intelligence, highlighting the importance of teamwork in achieving their mission's objectives. Mercer is also involved in discussions with colleagues like Sam Rivera and Taylor Cruz, emphasizing the need for intuition and trust alongside technological advancements [Data: Relationships (1, 4, 26, 67); Claims (8, 9)].
    +Agent Mercer works closely with other key members of the Paranormal Military Squad, such as Dr. Jordan Hayes and Sam Rivera. His collaboration with Dr. Hayes is particularly significant, as they jointly focus on decrypting and communicating with extraterrestrial intelligence. This partnership is built on mutual respect and recognition of each other's analytical skills, which are essential for the team's success. Mercer's interactions with Sam Rivera highlight his role as a mentor, providing guidance and fostering a relationship based on intuition and trust [Data: Reports (0); Relationships (1, 4, 2)].
     
    -### Strategic Approach and Challenges
    +### Involvement in Extraterrestrial Communication
     
    -Mercer is known for his strategic approach to potential extraterrestrial contact, contemplating the transformation of their mission from a simple recovery operation to a complex interstellar engagement. He acknowledges the structured nature of the patterns they analyze, suggesting a readiness for unexpected developments. His leadership is crucial in navigating the complexities of interspecies communication and ensuring the team's preparedness for any eventualities [Data: Claims (22, 37, 41, 45, 57, 60)].
    +Mercer's involvement in the decryption and analysis of alien signals is a critical aspect of his role. He is actively engaged in interpreting first contact data and ensuring that the team's efforts align with the broader objectives of Operation: Dulce. His work is not only focused on overseeing the team but also on unraveling galactic mysteries and engaging with alien signals. This involvement underscores his position as a key figure in humanity's efforts to establish contact and dialogue with extraterrestrial intelligence [Data: Claims (73, 85, 82); Reports (0)].
     
    -In summary, Agent Alex Mercer is a pivotal leader within the Paranormal Military Squad, guiding his team through the challenges of extraterrestrial communication and ensuring that their mission is conducted with both caution and ambition. His role is essential in bridging the gap between humanity and potential alien civilizations, marking a significant chapter in interstellar exploration [Data: Reports (0); Entities (0, 209); Relationships (5, 8, 6)].
    +In summary, Agent Alex Mercer is a vital member of the Paranormal Military Squad Team, whose leadership and expertise are crucial to the team's mission at Dulce Base. His role in guiding the team through the complexities of extraterrestrial communication highlights his importance in the broader narrative of interstellar exploration and contact.
     
    @@ -3363,23 +3363,23 @@ print(result.response)
    -
    ## Overview of Dr. Jordan Hayes
    +
    # Overview of Dr. Jordan Hayes
     
    -Dr. Jordan Hayes is a prominent member of the Paranormal Military Squad, a specialized team based at Dulce Base, dedicated to the study and communication with extraterrestrial entities. Dr. Hayes plays a crucial role in the team's mission, focusing on deciphering alien code and interpreting extraterrestrial patterns. This expertise is vital to the squad's efforts in understanding and interacting with alien intelligence [Data: Entities (104, 2); Reports (0)].
    +Dr. Jordan Hayes is a prominent member of the Paranormal Military Squad Team stationed at Dulce Base, where they play a crucial role in the team's mission to understand and interact with extraterrestrial entities. Dr. Hayes is primarily focused on deciphering alien code and interpreting extraterrestrial patterns, which are essential tasks for the team's efforts in interspecies communication [Data: Entities (104, 2); Reports (0)].
     
    -## Role and Responsibilities
    +## Role and Expertise
     
    -Dr. Hayes is deeply involved in the analysis and interpretation of alien signals, which includes isolating signal harmonics and decrypting alien messages. This work is central to the team's operations at Dulce Base, where they aim to establish interspecies communication. Dr. Hayes' analytical mindset and expertise in decryption algorithms are pivotal in deciphering extraterrestrial signals, which is a cornerstone of the team's mission [Data: Entities (2, 166, 180); Claims (61, 68, 79)].
    +Dr. Hayes is known for their analytical mindset and expertise in decryption algorithms, which are vital for interpreting alien signals and engaging in interstellar communication. Their work involves isolating signal harmonics, decrypting alien messages, and analyzing data related to star alignments. This expertise is critical to the team's mission at Dulce Base, as they work on deciphering extraterrestrial signals and preparing for potential first contact scenarios [Data: Entities (2, 166, 180, 140); Claims (36, 61, 68)].
     
    -## Collaboration and Team Dynamics
    +## Collaboration and Relationships
     
    -Dr. Hayes works closely with other key members of the Paranormal Military Squad, such as Agent Alex Mercer. Their collaboration is essential for managing interspecies communication and interpreting alien signals. The mutual respect and understanding between Dr. Hayes and Agent Mercer highlight the importance of teamwork in their mission. Additionally, Dr. Hayes collaborates with colleagues like Sam Rivera and Taylor Cruz, further emphasizing the collective effort required to achieve the team's objectives [Data: Reports (0); Relationships (1, 4, 21, 9, 15)].
    +Dr. Hayes collaborates closely with other key members of the Paranormal Military Squad, including Agent Alex Mercer. Their partnership is characterized by mutual respect and a shared focus on managing interspecies communication. Together, they work on decrypting and responding to alien messages, which is a cornerstone of their operation at Dulce Base. Dr. Hayes also works with colleagues like Sam Rivera and Taylor Cruz, contributing to the team's collective effort in understanding alien communication [Data: Relationships (1, 4, 21, 9); Reports (0)].
     
    -## Scientific Contributions and Challenges
    +## Challenges and Achievements
     
    -Dr. Hayes is known for their analytical and skeptical approach, which is crucial in the complex task of deciphering alien code. This skepticism is balanced with a willingness to explore other possibilities, reflecting the challenges faced in their mission. Dr. Hayes' work has led to significant discoveries, such as deciphering extraterrestrial patterns that could represent potential threats or untapped wisdom. These contributions underscore the scientific breakthroughs that Dr. Hayes is on the verge of achieving [Data: Claims (12, 49, 84); Reports (0)].
    +Throughout their mission, Dr. Hayes has faced the challenge of balancing skepticism with the need to accept other possibilities, reflecting the complex nature of their work. They have been involved in significant discoveries, such as deciphering extraterrestrial patterns that could represent potential threats or untapped wisdom. These achievements highlight Dr. Hayes' critical role in the team's success and their potential for scientific breakthroughs in understanding extraterrestrial intelligence [Data: Claims (12, 18, 49, 84); Reports (0)].
     
    -In summary, Dr. Jordan Hayes is a central figure in the Paranormal Military Squad, contributing significantly to the understanding and communication with extraterrestrial entities. Their expertise and collaborative efforts are vital to the success of the team's mission at Dulce Base.
    +In summary, Dr. Jordan Hayes is a pivotal figure within the Paranormal Military Squad, contributing their expertise and analytical skills to the team's mission of deciphering alien communication and preparing for interstellar engagement. Their work at Dulce Base underscores the importance of their role in advancing humanity's understanding of extraterrestrial entities.
     
    @@ -3941,7 +3941,7 @@ print(candidate_questions.response)
    -
    ['- What is the role of Alex Mercer in Operation: Dulce?', '- How does the Paranormal Military Squad interact with extraterrestrial intelligence at Dulce Base?', '- What are the main objectives of Operation: Dulce?', '- How does the environment of Dulce Military Base affect the team members?', '- What is the significance of New Mexico in the context of Operation: Dulce?']
    +
    ['- What is the role of Alex Mercer in Operation: Dulce?', '- How does the Paranormal Military Squad engage with extraterrestrial intelligence at Dulce Base?', '- What are the main challenges faced by the team at Dulce Military Base during their mission?', '- How does the environment of New Mexico impact the operations at Dulce Base?', '- What is the significance of the alien signals intercepted at Dulce Base?']
     
    diff --git a/search/search_index.json b/search/search_index.json index 3022d379..78998e45 100644 --- a/search/search_index.json +++ b/search/search_index.json @@ -1 +1 @@ -{"config": {"lang": ["en"], "separator": "[\\s\\-]+", "pipeline": ["stopWordFilter"]}, "docs": [{"location": "", "title": "Welcome to GraphRAG", "text": "

    \ud83d\udc49 Microsoft Research Blog Post \ud83d\udc49 GraphRAG Accelerator \ud83d\udc49 GraphRAG Arxiv

    Figure 1: An LLM-generated knowledge graph built using GPT-4 Turbo.

    GraphRAG is a structured, hierarchical approach to Retrieval Augmented Generation (RAG), as opposed to naive semantic-search approaches using plain text snippets. The GraphRAG process involves extracting a knowledge graph out of raw text, building a community hierarchy, generating summaries for these communities, and then leveraging these structures when perform RAG-based tasks.

    To learn more about GraphRAG and how it can be used to enhance your LLMs ability to reason about your private data, please visit the Microsoft Research Blog Post.

    "}, {"location": "#solution-accelerator", "title": "Solution Accelerator \ud83d\ude80", "text": "

    To quickstart the GraphRAG system we recommend trying the Solution Accelerator package. This provides a user-friendly end-to-end experience with Azure resources.

    "}, {"location": "#get-started-with-graphrag", "title": "Get Started with GraphRAG \ud83d\ude80", "text": "

    To start using GraphRAG, check out the Get Started guide. For a deeper dive into the main sub-systems, please visit the docpages for the Indexer and Query packages.

    "}, {"location": "#graphrag-vs-baseline-rag", "title": "GraphRAG vs Baseline RAG \ud83d\udd0d", "text": "

    Retrieval-Augmented Generation (RAG) is a technique to improve LLM outputs using real-world information. This technique is an important part of most LLM-based tools and the majority of RAG approaches use vector similarity as the search technique, which we call Baseline RAG. GraphRAG uses knowledge graphs to provide substantial improvements in question-and-answer performance when reasoning about complex information. RAG techniques have shown promise in helping LLMs to reason about private datasets - data that the LLM is not trained on and has never seen before, such as an enterprise\u2019s proprietary research, business documents, or communications. Baseline RAG was created to help solve this problem, but we observe situations where baseline RAG performs very poorly. For example:

    To address this, the tech community is working to develop methods that extend and enhance RAG. Microsoft Research\u2019s new approach, GraphRAG, uses LLMs to create a knowledge graph based on an input corpus. This graph, along with community summaries and graph machine learning outputs, are used to augment prompts at query time. GraphRAG shows substantial improvement in answering the two classes of questions described above, demonstrating intelligence or mastery that outperforms other approaches previously applied to private datasets.

    "}, {"location": "#the-graphrag-process", "title": "The GraphRAG Process \ud83e\udd16", "text": "

    GraphRAG builds upon our prior research and tooling using graph machine learning. The basic steps of the GraphRAG process are as follows:

    "}, {"location": "#index", "title": "Index", "text": ""}, {"location": "#query", "title": "Query", "text": "

    At query time, these structures are used to provide materials for the LLM context window when answering a question. The primary query modes are:

    "}, {"location": "#prompt-tuning", "title": "Prompt Tuning", "text": "

    Using GraphRAG with your data out of the box may not yield the best possible results. We strongly recommend to fine-tune your prompts following the Prompt Tuning Guide in our documentation.

    "}, {"location": "blog_posts/", "title": "Microsoft Research Blog", "text": "