Corrected a misspelling of 'customizability' in the env_vars.md documentation. This change ensures clarity and accuracy in the description of input data handling configurations.
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
* Migrate towards using static output directories
- Fixes load_config eagering resolving directories.
Directories are only resolved when the output
directories are local.
- Add support for `--output` and `--reporting` flags
for index CLI. To achieve previous output structure
`index --output run1/artifacts --reports run1/reports`.
- Use static output directories when initializing
a new project.
- Maintains backward compatibility for those using
timestamp outputs locally.
* fix smoke tests
* update query cli to work with static directories
* remove eager path resolution from load_config. Support CLI overrides that can be resolved.
* add docs and output logs/artifacts to same directory
* use match statement
* switch back to if statement
---------
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
Correct links to verbs in comments
Updated the links in comments to reflect new paths for 'derive' and 'aggregate' verbs. This improves documentation and ensures that references are up to date for future developers.
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
* Added streaming output support for global search. Introduce `--streaming` flag to enable or disable streaming mode
* ran ruff format --preview
* update
* cleanup code and streaming api
* update cli argument
* remove whitespace
* checkpoint - add context data to streaming api
* cleanup help menu
* ruff format update
* add context data to streaming response
* add semversioner file
* rename variable for better readability
* rename variable for better readability
* ruff fixes
* fix abstract class type annotation
* add documentation for --streaming CLI flag
---------
Co-authored-by: 6GOD <55304045+6ixGODD@users.noreply.github.com>
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
* Remove outdated references to entity resolution
* Clarify covariate extraction
* Minor edits from other PR feedback
* Remove duplicate line
* Semver
---------
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
* added default title_column and collection_name values for workflows using the vector store option
* incorporated vector database support to the query client
* Updated docuemnatation to reflect the new query client param.
* Fixed ruff formatting
* added new poetry lock file
---------
Co-authored-by: Gabriel Nieves-Ponce <gnievesponce@microsoft.com>
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
* enlarge the chunk size to reduce the cost and time greatly to 1/4
* Change default gleanings, chunk_size and chunk_overlap
* Update patch-20240710114442871595.json
---------
Co-authored-by: KylinMountain <kose2livs@gmail.com>
* fix community_report doesn't work in settings.yaml
* add semversioner
* fix unittest about community report to community reports of env
---------
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>