24 Commits

Author SHA1 Message Date
JunHo Kim (김준호)
ea468204bc
Fix typo in documentation for customizability (#1160)
Corrected a misspelling of 'customizability' in the env_vars.md documentation. This change ensures clarity and accuracy in the description of input data handling configurations.

Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2024-09-20 14:52:44 -06:00
Derek Worthen
3b09df6e07
Migrate towards using static output directories (#1113)
* Migrate towards using static output directories

- Fixes load_config eagering resolving directories.
    Directories are only resolved when the output
    directories are local.
- Add support for `--output` and `--reporting` flags
    for index CLI. To achieve previous output structure
    `index --output run1/artifacts --reports run1/reports`.
- Use static output directories when initializing
    a new project.
- Maintains backward compatibility for those using
    timestamp outputs locally.

* fix smoke tests

* update query cli to work with static directories

* remove eager path resolution from load_config. Support CLI overrides that can be resolved.

* add docs and output logs/artifacts to same directory

* use match statement

* switch back to if statement

---------

Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2024-09-18 17:36:50 -06:00
JunHo Kim (김준호)
7b8f5ba51f
Correct links to datashaper verbs in comments (#1068)
Correct links to verbs in comments

Updated the links in comments to reflect new paths for 'derive' and 'aggregate' verbs. This improves documentation and ensures that references are up to date for future developers.

Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2024-09-12 12:44:38 -06:00
Alonso Guevara
3f9800230f
Fix img width (#1061) 2024-08-29 17:02:47 -06:00
Alonso Guevara
7ffce8d7ba
Fix img for autotune (#1060)
* Fix img for autotune

* Add line breaks to tune docs

* More line breaks
2024-08-29 16:56:34 -06:00
Alonso Guevara
6fc452b954
Update bash example in docs for prompt tune (#1059)
* Semver

* Update bash command
2024-08-29 16:35:32 -06:00
Alonso Guevara
e023882033
Update Prompt Tuning docs (#1057)
* Update Prompt Tuning docs

* Semver
2024-08-29 16:00:07 -06:00
Ikko Eltociear Ashimine
26bcdf39ed
docs: update manual_prompt_tuning.md (#963)
paramater -> parameter

Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2024-08-28 17:49:35 -06:00
Alonso Guevara
4c2f5376a8
Add missing config parameter for prompt tuning docs (#1017) 2024-08-26 14:38:59 -06:00
Josh Bradley
62546a3c14
Add streaming support for local/global search (#944)
* Added streaming output support for global search. Introduce `--streaming` flag to enable or disable streaming mode

* ran ruff format --preview

* update

* cleanup code and streaming api

* update cli argument

* remove whitespace

* checkpoint - add context data to streaming api

* cleanup help menu

* ruff format update

* add context data to streaming response

* add semversioner file

* rename variable for better readability

* rename variable for better readability

* ruff fixes

* fix abstract class type annotation

* add documentation for --streaming CLI flag

---------

Co-authored-by: 6GOD <55304045+6ixGODD@users.noreply.github.com>
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2024-08-20 13:44:48 -06:00
Nayeon Kim
84f9bae129
Update 0-architecture.md (#961) 2024-08-19 12:21:40 -06:00
Nathan Evans
c749fe2a15
Docs updates aug06 (#852)
* Remove outdated references to entity resolution

* Clarify covariate extraction

* Minor edits from other PR feedback

* Remove duplicate line

* Semver

---------

Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2024-08-06 16:31:47 -07:00
Gabriel Nieves-Ponce
d26491a622
Gnievesponce/query client vectore store (#771)
* added default title_column and collection_name values for workflows using the vector store option

* incorporated vector database support to the query client

* Updated docuemnatation to reflect the new query client param.

* Fixed ruff formatting

* added new poetry lock file

---------

Co-authored-by: Gabriel Nieves-Ponce <gnievesponce@microsoft.com>
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2024-07-30 17:59:04 -06:00
Chris Trevino
9d99f323ea
Add encoding model to entity/claim extraction config sections (#740)
* Add encoding-model configuration to entity & claim extraction

* add change note

* pr updates

* test fix

* disable GH-based smoke tests
2024-07-26 15:05:08 -07:00
Chris Trevino
4c229afec8
add encoding model to text-chunking config (#743)
* add encoding model to text-chunking config

* revert groupby fix, handled in other pr

* revert environment reader update for other pr
2024-07-26 14:15:17 -07:00
Alonso Guevara
7a9c9071c1
Change defaults for CHUNK_SIZE, CHUNK_OVERLAP and GLEANINGS (#499)
* enlarge the chunk size to reduce the cost and time greatly to 1/4

* Change default gleanings, chunk_size and chunk_overlap

* Update patch-20240710114442871595.json

---------

Co-authored-by: KylinMountain <kose2livs@gmail.com>
2024-07-11 10:22:27 -06:00
Alonso Guevara
c3852b0749
Fix/prompt tuning report rating (#496)
* add community report ranking to cli

* use tuned ratings in generated community report rankings

* update report generation template

* add new import paths to init

* add community report generation prompt and util function

* add rating generator, update init paths

* add report rating generation prompt

* update report generation template

* typo

* add semantic release

* Add missing docstring

* Format

* is is

* Format ruff

---------

Co-authored-by: Julian Whiting <j2whitin@gmail.com>
2024-07-10 15:55:21 -06:00
Alonso Guevara
aff2f79ae2
Feat/language in autotemplating (#468)
* Add language parameter on prompt tuning

* Semver and docs

* Fix typo
2024-07-10 13:56:54 -06:00
YangJiaqi
b1d1cb9b4b
Update auto_prompt_tuning.md (#406)
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2024-07-09 07:40:18 -06:00
Kylin
e2572c7fab
[bug fix]Fix community_report config doesn't work in settings.yaml (#405)
* fix community_report doesn't work in settings.yaml

* add semversioner

* fix unittest about community report to community reports of env

---------

Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2024-07-08 22:48:02 -06:00
Alonso Guevara
b912081f1b
Add N parameter support (#390)
* Add N parameter support

* Fix unit tests

* Add new env vars to param testing
2024-07-08 14:04:49 -06:00
AbdulHamid Merii
a22003c302
Fix/syntax error in overview.md (#364)
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2024-07-05 13:50:36 -06:00
Zi Nean Teoh
f865e66fa2
Fix dead link due to typo (#320)
Co-authored-by: Alonso Guevara <alonsog@microsoft.com>
2024-07-02 15:23:41 -06:00
Alonso Guevara
81b81cf60b Initial Release 2024-07-01 15:25:30 -06:00