403 Commits

Author SHA1 Message Date
Kenny Zhang
d220fa63ab Merge branch 'kennyzhang/sql-server-support' of github.com:microsoft/graphrag into kennyzhang/sql-server-support 2025-05-06 16:32:11 -04:00
Kenny Zhang
26c4a70ce4 prep storage class for silicon indexing 2025-05-06 16:31:53 -04:00
Kenny Zhang
4cd3186135 allow embedding to be stored as vectors 2025-05-01 15:49:18 -04:00
Kenny Zhang
d14c609d71 cicd linting errors 2025-04-17 15:54:03 -04:00
Kenny Zhang
1917791ddc added tracking to log for row insertion 2025-04-17 15:29:02 -04:00
Kenny Zhang
c8f008285f generated new lockfile 2025-04-17 15:19:32 -04:00
Kenny Zhang
ab29d2c8ba modified tests 2025-04-17 14:02:43 -04:00
Kenny Zhang
d487c743bf sem 2025-04-17 13:45:30 -04:00
Kenny Zhang
77c70a7bec added TODO for create_tables 2025-04-14 16:45:31 -04:00
Kenny Zhang
abb5f5b8c9 added outline for manual table creation 2025-04-11 16:46:23 -04:00
Kenny Zhang
37f9350822 introduced autogenerate tables functionality 2025-04-11 15:42:33 -04:00
Kenny Zhang
c3f8849493 added support for ManagedIdentityCredential 2025-04-11 14:08:33 -04:00
Kenny Zhang
7c6e441303 refactored logging for other storage functions 2025-04-10 15:53:32 -04:00
Kenny Zhang
d4aa8079c6 added more logging for row insertion 2025-04-10 15:48:07 -04:00
Kenny Zhang
93b772e84c Merge branch 'main' of github.com:microsoft/graphrag into kennyzhang/sql-server-support 2025-04-10 15:13:51 -04:00
Dayenne Souza
74ad1d4a0c
Update .vsts-ci.yml (#1874) 2025-04-10 10:31:03 -06:00
Dayenne Souza
89381296c3
fix yaml path in unified-search-app (#1873) 2025-04-10 13:02:00 -03:00
Kenny Zhang
d378bdfe3a cleaned up comments and formatting 2025-04-08 18:03:28 -04:00
Kenny Zhang
73282a6f4f re-added overwrite table flag 2025-04-08 18:01:42 -04:00
Dayenne Souza
66aab4267e
add vsts deploy file for unified search app (#1869)
* add vsts deploy file for un ified search app

* fix file name

* remove unused tasks

* remove unused file
2025-04-08 17:08:02 -03:00
Kenny Zhang
31529476af confirmed successful run of index pipeline 2025-04-08 14:08:09 -04:00
Kenny Zhang
8f6b349d05 add overwrite param 2025-04-08 13:44:02 -04:00
Kenny Zhang
5b54709f30 added list serialization and deserialization 2025-04-08 10:57:22 -04:00
gaudyb
0e1a6e3770
Unified search added to graphrag (#1862)
* unified search app added to graphrag repository

* ignore print statements

* update words for unified-search

* fix lint errors

* fix lint error

* fix module name

---------

Co-authored-by: Gaudy Blanco <gaudy-microsoft@MacBook-Pro-m4-Gaudy-For-Work.local>
2025-04-07 11:59:02 -06:00
Kenny Zhang
b6c1edf867 hook up sql server class to rest of graphrag 2025-04-07 10:59:47 -04:00
Kenny Zhang
a843c6c479 refactored comments 2025-04-04 16:37:26 -04:00
Kenny Zhang
4cded8111d ruff formatting 2025-04-04 16:33:01 -04:00
Kenny Zhang
f1e5a0c96c cleaned up formatting and linting 2025-04-04 16:32:33 -04:00
Kenny Zhang
652cf84d6d debugged CRUD methods for sql server storage class 2025-04-04 16:21:54 -04:00
Kenny Zhang
febd55fb33 added more comments and logging 2025-04-03 15:27:49 -04:00
Kenny Zhang
6fad33f316 added warnings for non-parquet calls 2025-04-03 15:23:14 -04:00
Kenny Zhang
849009d84c cleaned up SQL server storage implementation 2025-04-03 14:20:37 -04:00
Kenny Zhang
b24de53cbe generated initial implementation for sql server support 2025-04-03 12:55:58 -04:00
Kenny Zhang
d896830b70 XMerge branch 'main' of github.com:microsoft/graphrag 2025-04-01 15:34:26 -04:00
KennyZhang1
61769dd47e
Vector Store Integration Tests (#1856)
* Add vector store id reference to embeddings config.

* generated initial vector store pytests

* cleaned up cosmosdb vector store test

* fixed class name typo and debugged cosmosdb vector store test

* reset emulator connection string

* remove unneccessary comments

* removed extra comments from azure ai search test

* ruff

* semversioner

* fix cicd issues

* bypass diskANN policy for test env

* handle floating point inprecisions

---------

Co-authored-by: Derek Worthen <worthend.derek@gmail.com>
2025-04-01 11:05:04 -04:00
Gabriel Nieves-Ponce
ffd8db7104
Gnievesponce prompt tune embedd chunking (#1826)
* Added support for embeddings chunking as defined by the  config.

* ran semvisor -t patch

* Eliminated redunant code by using the embed_text strategy directly

* Added fix to support brakets within the corpus text; For example, inline LaTeX within a markdown file

---------

Co-authored-by: Gabriel Nieves <gnievesponce@microsoft.com>
2025-03-31 12:38:01 -04:00
Kenny Zhang
dbd53622ed Merge branch 'main' of github.com:microsoft/graphrag 2025-03-25 15:30:47 -04:00
Alonso Guevara
b7b2b562ce
fnllm version fix (#1835)
* Fix fnllm version

* Semver
2025-03-21 22:13:56 -07:00
Nathan Evans
3b1e70c06b
Update config docs (2.1.0) (#1818)
* Align docs with config

* Semver

* Spelling

* Format

* Spelling
2025-03-18 12:39:30 -07:00
Nathan Evans
813b4de99f
Fix API key reference for gh-pages (#1821) 2025-03-18 11:10:11 -07:00
Nathan Evans
ddc6541ab6
Add docs page about input formats (#1784)
* Add docs page about input formats

* Add json example

* Spelling
2025-03-11 17:37:46 -07:00
Nathan Evans
321d479ab6
Update notebooks for 2.0 (#1785)
* Update API overview

* Fix global search example

* Fix local search example

* Fix global dynamic example

* Fix drift example

* Update multi-index example

* Semver
2025-03-11 17:23:49 -07:00
Alonso Guevara
0d363e6957
Release v2.1.0 (#1800) v2.1.0 2025-03-11 18:16:08 -06:00
Alonso Guevara
53950f8442
Fix/model provider key injection check (#1799)
* Check available models for type validation

* Semver

* Fix ruff and pyright

* Apply feedback
2025-03-11 17:48:30 -06:00
Gabriel Nieves-Ponce
e39d869bed
Added support for verbose logging and csv-metadata to the prompt tune… (#1789)
* Added support for verbose logging and csv-metadata to the prompt tune client.

* Updated community report summarization file name and prompt template

* updated semversioner

* ran ruff linter

* Ran poe format

* Fix Ruff complains

* Fix a new ruff complain :P

* Pyright

* Fix tests

---------

Co-authored-by: Gabriel Nieves <gnievesponce@microsoft.com>
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2025-03-11 14:55:02 -06:00
Nathan Evans
66c2cfb3ce
Support JSON input files (#1777)
* Add csv loader tests

* Add test loader tests

* Add json input support

* Remove temp path constraint

* Reuse loader cose

* Semver

* Set file pattern automatically based on type, if empty

* Remove pattern from smoke test config

* Spelling

---------

Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2025-03-10 14:04:07 -07:00
Kenny Zhang
ed4c77cd01 Merge branch 'main' of github.com:microsoft/graphrag 2025-03-04 13:53:48 -05:00
Nathan Evans
bcb74789f1
Next release docs (#1627)
* Wordind updates

* Update yam lconfig and add notes to "deprecated" env

* Add basic search section

* Update versioning docs

* Minor edits for clarity

* Update init command

* Update init to add --force in docs

* Add NLP extraction params

* Move vector_store to root

* Add workflows to config

* Add FastGraphRAG docs

* add metadata column changes

* Added documentation for multi index search.

* Minor fixes.

* Add config and table renames

* Update migration notebook and comments to specify v1

* Add frequency to entity table docs

* add new chunking options for metadata

* Update output docs

* Minor edits and cleanup

* Add model ids to search configs

* Spruce up migration notebook

* Lint/format multi-index notebook

* SpaCy model note

* Update SpaCy footnote

* Updated multi_index_search.ipynb to remove ruff errors.

* add spacy to dictionary

---------

Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
Co-authored-by: Dayenne Souza <ddesouza@microsoft.com>
Co-authored-by: dorbaker <dorbaker@microsoft.com>
2025-03-03 14:46:00 -08:00
Nathan Evans
bd06d8b4f0
Context property bag ("state") (#1774)
* Add pipeline state property bag to run context

* Move state creation out of context util

* Move callbacks into PipelineRunContext

* Semver

* Rename state.json to context.json to avoid confusion with stats.json

* Expand smoke test row count

* Add util to create storage and cache
2025-02-28 09:31:48 -08:00
Nathan Evans
a15942629b
Add more verb tests (#1773)
* Add NLP verb test

* Add finalize_graph tests

* Add more thorough final column assertions
2025-02-27 09:31:46 -08:00