* Collapse create_final_communities
* Semver
* Spellcheck
* Clean up filtering
* Add space in title
* Format
* Cleanup imports and format
* Spruce up the tests
* Update dictionary.txt
* Spellcheck
---------
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
* Setup basic verb test runner
* Replace join_text_units_to_entity_ids with subflow
* Update comments
* Replace join_text_units_to_relationship_ids subflow
* Roll in final select
* Reuse assertion util
* Small fix + format
* Format/typing
* Semver
* Format/typing
* Semver
* Revert format changes
* Fix smoke test subworkflow count
* Edit subworkflows for another smoke test
* Update test parquets for covariates
* Collapse covariate join
* Rework subtasks for per-flow customization
* Format
* Semver
* Fix smoke test
* Setup basic verb test runner
* Replace join_text_units_to_entity_ids with subflow
* Update comments
* Replace join_text_units_to_relationship_ids subflow
* Roll in final select
* Reuse assertion util
* Small fix + format
* Format/typing
* Semver
* Format/typing
* Semver
* Revert format changes
* Fix smoke test subworkflow count
* Edit subworkflows for another smoke test
* fix: fix the bug that community context builder will cause a report to be repeated twice in local mode.
* Fix duplicates in community context builder
* Small tweaks on code
---------
Co-authored-by: jarlor <zjl58960902@outlook.com>
Update factories.py to allow the usage of the request timeout ChatOpenAI parameter
allow the usage of the request timeout ChatOpenAI parameter
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
Correct links to verbs in comments
Updated the links in comments to reflect new paths for 'derive' and 'aggregate' verbs. This improves documentation and ensures that references are up to date for future developers.
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
* Moved query loading from file to helper function
* added loading parquets from blob to function
* resolved adlfs async error
* debugging cleanup and small fixes
* added connection string support
* semversioner and ruff fixes
* completed testing for merge with main
* more ruff changes
* fixed unbound vars warning
* rewrote function to use storage utils
* removed unused vars
---------
Co-authored-by: Kenny Zhang <zhangken@microsoft.com>
* Create entypoint for cli and api (#1067)
* Add cli and api entrypoints for update index
* Semver
* Update docs
* Run tests on feature branch main
* Better /main handling in tests
* Clean and organize run index code
* Ruff fix
* Pyright fix
* Format fixes
* Pyright fix
* Format
* Fix integ tests
* Fix ruff
* Reorganize and clean up
* Optimized _build_text_unit_context function for improved time and space complexity
Refactored the _build_text_unit_context function to enhance its performance and efficiency. Key optimizations include:
1. Set for Text Unit IDs: Replaced list-based membership checks with a set (text_unit_ids_set) to achieve constant-time complexity for membership checks, reducing overall time complexity.
2. Direct Attribute Removal: Utilized pop with a default value (None) to directly remove attributes entity_order and num_relationships from text units, minimizing overhead and avoiding potential KeyError.
3. Default Dictionary for Entity Orders: Implemented defaultdict for managing entity orders, simplifying the ranking process and improving readability.
These improvements result in a more efficient function with better performance, especially when handling large datasets or numerous selected entities. The refactoring ensures that the core functionality remains unchanged while enhancing both time and space complexity.
* Format
* Ruff fixes
* semver
---------
Co-authored-by: arjun-234 <arjun.darji@yudiz.com>
Co-authored-by: Arjun D. <103405661+arjun-234@users.noreply.github.com>
* Added graphrag_import_neo4j_cypher Notebook
* changed to procedure for setting embedding property to save disk space
* Reformat and cleanup
* semver
* Poetry lock update
* Update AAIS docs
* Rename contrib folder
* Merge from main
* Revert "Merge from main"
This reverts commit a399dde97b689a5b5c62dc2e9c2290cb2503b3a4.
* Fix ruff check
* Add readme and fix tests
* Fix community reports
---------
Co-authored-by: Michael Hunger <github@jexp.de>
* changed placement of lancedb dir to under /artifacts
* ruff checks and semversioner
* added support for static paths
* added support for streaming
* more ruff changes
* ruff format changes
* removed string concat for path formation
* added more ruff checks
* removed os.join usage
* more ruff fixes and removed unneccesary path creations
* replaced cast calls with str()
---------
Co-authored-by: Kenny Zhang <zhangken@microsoft.com>