ZanSara
cf79aa1485
feat: add support for single meta dict in TextFileToDocument
( #6606 )
...
* add support for single meta dict
* reno
* reno
* mypy
* extract to function
* docstring
* mypy
2023-12-21 14:21:17 +01:00
Stefano Fiorucci
7cc6080dfa
chore: replace metadata w meta in tests/examples ( #6612 )
...
* replace metadata w meta in tests/examples
* do not touch already broken e2e tests
* Revert "do not touch already broken e2e tests"
This reverts commit 1f911920d98954b57daacfe8d8ed02fd77d136db.
2023-12-21 14:09:31 +01:00
sahusiddharth
3d17e6ff76
changed metadata to meta ( #6605 )
2023-12-21 12:39:58 +01:00
Ashwin Mathur
fc88ef7076
feat: Add HuggingFace TEI Embedders - HuggingFaceTEITextEmbedder
and HuggingFaceTEIDocumentEmbedder
( #6602 )
...
* Add TEI Embedders
* Add release notes
* Update release notes with usage examples
2023-12-21 12:16:36 +01:00
Silvano Cerza
8a513f3b8c
test: Add fixture to block requests in tests ( #6585 )
...
* Add fixture to block requests in tests
* Mark tests making requests as integration
2023-12-21 08:51:54 +01:00
Silvano Cerza
5546c8144e
ci: Speed up tests.yml by caching dependencies ( #6417 )
...
* Speed up tests.yml by caching dependencies
* Trigger for testing
* Use restore only action to speedup restoring
* Use bash shell to get pip cache dir
* Set shell for caching step
* Cache correct path
* Remove trigger
2023-12-20 16:21:48 +01:00
Daria Fokina
a64cf25148
docs: fix formatting ( #6599 )
2023-12-20 12:30:27 +01:00
ZanSara
ae5297bfd7
example: self-correcting loop for RAG ( #6420 )
...
* add example
* docstrings
* reno
* use condrouter
* move functions
* tests
* reno
* add component
* reno
* add tests
* mypy
* pylint
* logger
* module name
* multiplexer
* draw
* query_multiplexer
* reno
* typo
2023-12-20 11:35:05 +01:00
ZanSara
5a68bb10c1
add doc config for others module ( #6604 )
2023-12-20 11:34:54 +01:00
ZanSara
5a0f0ce22f
feat: Multiplexer
( #6592 )
...
* move functions
* tests
* reno
* add component
* reno
* add tests
* mypy
* pylint
* logger
* module name
2023-12-20 11:03:22 +01:00
Silvano Cerza
e836fd6875
fix: Fix Pipeline.connect()
when multiple compatible sockets are found ( #6594 )
...
* Fix connect not picking the correct socket
* Add release notes
2023-12-20 11:01:18 +01:00
Silvano Cerza
f224f991be
Change DocumentWriter default policy from DuplicatePolicy.FAIL to DuplicatePolicy.NONE ( #6596 )
2023-12-19 17:46:16 +01:00
ZanSara
f877704839
chore: extract type serialization ( #6586 )
...
* move functions
* tests
* reno
2023-12-19 14:16:20 +01:00
Vladimir Blagojevic
2dd5a94b04
feat: Add RAG based OpenAPI service integration ( #6555 )
...
* Add OpenAPIServiceConnector and OpenAPIServiceToFunctions
* Add release note
* Add test deps
* Better docs on OpenAPI spec reqs, improve tests
* Silvano PR feedback
2023-12-19 13:27:41 +01:00
Stefano Fiorucci
94cfe5d9ae
feat!: HTMLToDocument
- allow choosing the boilerpy3 extractor ( #6582 )
...
* allow extractor customizability
* release note
* typo
2023-12-19 10:52:12 +01:00
Sebastian Husch Lee
dcf37c5173
feat: Extractive QA answer deduplication ( #6459 )
...
* Add answer deduplication
* Fix test
* Handle None case
* Release notes
* Handle cases where documents or answer spans could be None
* Adding checks for Nones and satisfying mypy
* Add option to turn off deduplication
* Adding unit tests
* Refactored tests to use fixtures
* Added overlap_threshold to run
* Update test
* Fixes related to the merge
* Remove casting, use direct variable names
* Move out if statement and add new test for it
* Update if statement to match comment
* Update how if statements work
2023-12-18 19:27:04 +01:00
Sebastian Husch Lee
c294b8ac8c
feat: Add auto device checks and model_kwargs
to TransformersSimilarityRanker
( #6561 )
...
* Add device checking and model_kwargs like we do in ExtractiveReader
* Add release notes
* Make a utility function for the device checking
* Better warning message and updated ExtractiveReader to use the util function
* Add unit tests for get_device
* Fix pylint
2023-12-18 15:13:42 +01:00
Ashwin Mathur
46b395eec3
feat: Add Eval and EvaluationResult ( #6505 )
...
* Add initial implementation for Eval and EvaluationResult
* Add release notes
* Update files with suggestions from review
* Remove serialization
* Add eval e2e tests
* Update eval e2e tests
2023-12-18 11:29:09 +01:00
Sebastian Husch Lee
3e0e81b1e0
feat: Add meta_fields_to_embed
to TransformersSimilarityRanker
( #6564 )
...
* Add initial implementation following SentenceTransformersDocumentEmbedder
* Add test for embedding metadata
* Add release notes
* Update name
* Fix tests and to dict
* Fix release notes
2023-12-18 11:28:16 +01:00
Massimiliano Pippi
0ac1bdc6a0
refactor!: uniform run api for LocalWhisperTranscriber ( #6542 )
...
* uniform run api for LocalWhisperTranscriber
* add relnote
* fix linter
2023-12-18 10:47:46 +01:00
Massimiliano Pippi
00fed32024
build: depend on haystack_bm25
instead of rank_bm25
( #6578 )
...
* use the forked package
* switch package dependency
* relnote
* fix package name
2023-12-18 10:47:15 +01:00
Massimiliano Pippi
4fdbcfaf56
bump version ( #6569 )
v2.0.0-beta.3
2023-12-15 17:54:39 +01:00
Massimiliano Pippi
3b4c461bfc
use the right version ( #6568 )
2023-12-15 17:48:12 +01:00
Stefano Fiorucci
f583102932
docs: Converters - add usage examples ( #6556 )
...
* standardize converters inputs: first draft
* fix precommit
* fix precommit 2
* fix precommit 3
* add default for optional param
* rm leftover
* install boilerpy in linting workflow
* add boilerpy3 to the core dependencies
* add reno
* remove boilerpy3 installation from test workflow
* fix pylint: import order and unused import
* fix import order
* add release note
* better Tika docstring
* rm boilerpy from linting
* leftover
* first impl for html
* progressing on other components
* fix test
* add tests - run with meta
* release note
* reintroduce patches wrongly deleted
* add patch in test
* fix tika test
* add usage examples
---------
Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
Co-authored-by: Julian Risch <julian.risch@deepset.ai>
2023-12-15 17:06:21 +01:00
Stefano Fiorucci
2f034d3c97
refactor!: Converters - standardize inputs ( #6540 )
...
* standardize converters inputs: first draft
* fix precommit
* fix precommit 2
* fix precommit 3
* add default for optional param
* rm leftover
* install boilerpy in linting workflow
* add boilerpy3 to the core dependencies
* add reno
* remove boilerpy3 installation from test workflow
* fix pylint: import order and unused import
* fix import order
* add release note
* better Tika docstring
* rm boilerpy from linting
* leftover
* md link brackets
* feat: Converters - allow passing `meta` in the `run` method (#6554 )
* first impl for html
* progressing on other components
* fix test
* add tests - run with meta
* release note
* reintroduce patches wrongly deleted
* add patch in test
* fix tika test
* Update haystack/components/converters/azure.py
Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
---------
Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
* Update releasenotes/notes/converters-standardize-inputs-ed2ba9c97b762974.yaml
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
* simplify test
---------
Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
Co-authored-by: Julian Risch <julian.risch@deepset.ai>
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
2023-12-15 16:41:35 +01:00
Stefano Fiorucci
cf47abdff5
chore: simplify the management of test dependencies ( #6559 )
...
* remove audio dep group
* extract dependencies
* beautify
* rm one step
2023-12-15 16:40:41 +01:00
Massimiliano Pippi
ae1f39a76e
custom release creator for beta versions ( #6552 )
2023-12-15 16:27:41 +01:00
dependabot[bot]
8ea891f357
chore(deps): bump actions/upload-artifact from 3 to 4 ( #6565 )
...
Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact ) from 3 to 4.
- [Release notes](https://github.com/actions/upload-artifact/releases )
- [Commits](https://github.com/actions/upload-artifact/compare/v3...v4 )
---
updated-dependencies:
- dependency-name: actions/upload-artifact
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-12-15 16:14:07 +01:00
Vladimir Blagojevic
c642695ec0
feat: Add FileTypeRouter markdown support ( #6551 )
...
* Add FileTypeRouter markdown support
* Add releae note
2023-12-14 16:30:57 +01:00
Vladimir Blagojevic
3b172b0476
Clarify DynamicPromptBuilder Raison d’Être ( #6549 )
2023-12-14 14:53:00 +01:00
Massimiliano Pippi
bc45170f4e
chore: add boilerpy3 to the core dependencies ( #6544 )
...
* add boilerpy3 to the core dependencies
* remove boilerpy3 installation from test workflow
* fix pylint: import order and unused import
* fix import order
* add release note
---------
Co-authored-by: Julian Risch <julian.risch@deepset.ai>
2023-12-14 11:53:38 +01:00
Tuana Çelik
0ec2801c72
Fixing incorrect docstring ( #6536 )
2023-12-14 11:35:33 +01:00
ZanSara
d0c115fc9d
fix: make minor_version_release.yml
to run on the v1.x branch ( #6547 )
...
* Fix minor_version_release.yml to run on the v1.x branch
* review
2023-12-14 11:34:18 +01:00
Massimiliano Pippi
923efd88b2
chore: add version label to PRs ( #6548 )
2023-12-14 11:28:00 +01:00
Massimiliano Pippi
8d9c3de37e
Remove 'preview' from the release notes template ( #6543 )
2023-12-14 09:59:48 +01:00
Massimiliano Pippi
9867dd188a
update labeler to latest version ( #6541 )
2023-12-13 21:21:29 +01:00
Massimiliano Pippi
a55024bee7
fix: do not dump pipeline graph into the debug payload ( #6528 )
2023-12-12 18:24:23 +01:00
Massimiliano Pippi
09abcc1d4c
allow connecting the same components multiple times ( #6530 )
2023-12-12 16:01:09 +01:00
Julian Risch
25a6eaae05
feat!: Rename ExtractiveReader's confidence_threshold
to score_threshold
( #6532 )
...
* rename to score_threshold
* Update haystack/components/readers/extractive.py
Co-authored-by: Stefano Fiorucci <44616784+anakin87@users.noreply.github.com>
---------
Co-authored-by: Stefano Fiorucci <44616784+anakin87@users.noreply.github.com>
2023-12-12 15:12:28 +01:00
Silvano Cerza
82fe80ce68
Remove old Pylint plugin ( #6527 )
2023-12-12 09:59:01 +01:00
Silvano Cerza
18dbce25fc
refacotr: Refactor answer dataclasses ( #6523 )
...
* Refactor answer dataclasses
* Add release notes
* Fix tests
* Fix end to end tests
* Enhance ExtractiveReader
2023-12-11 18:50:49 +01:00
dependabot[bot]
820d9c37d5
chore(deps): bump actions/stale from 8 to 9 ( #6509 )
...
Bumps [actions/stale](https://github.com/actions/stale ) from 8 to 9.
- [Release notes](https://github.com/actions/stale/releases )
- [Changelog](https://github.com/actions/stale/blob/main/CHANGELOG.md )
- [Commits](https://github.com/actions/stale/compare/v8...v9 )
---
updated-dependencies:
- dependency-name: actions/stale
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-12-11 08:35:40 +01:00
Vladimir Blagojevic
628e8aa3d4
feat: Improve getting started examples ( #6510 )
...
* Improve rag and indexing pipelines
* Update examples
* Simplify user interface and code, improve embedder model
* Improve default vals for embedder
* resolve typing
* resolve typing 2
* Fix unit test
---------
Co-authored-by: Timo Möller <timo.moeller@deepset.ai>
2023-12-09 19:01:13 +01:00
bogdankostic
728383a149
fix: Make TransformersSimilarityRanker
run with single document list ( #6503 )
...
* Make `TransformersSimilarityRanker` run with single document list
* Add release note
* Remove unused import in test
2023-12-08 16:18:46 +01:00
Ashwin Mathur
2767cd2f01
Fix usage examples ( #6507 )
2023-12-07 14:01:32 +01:00
dependabot[bot]
51b49b838c
chore(deps): bump actions/setup-python from 4 to 5 ( #6498 )
...
Bumps [actions/setup-python](https://github.com/actions/setup-python ) from 4 to 5.
- [Release notes](https://github.com/actions/setup-python/releases )
- [Commits](https://github.com/actions/setup-python/compare/v4...v5 )
---
updated-dependencies:
- dependency-name: actions/setup-python
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com>
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2023-12-06 22:33:05 +01:00
Silvano Cerza
5f06e93573
Bump version to v2.0.0-beta.2 ( #6502 )
v2.0.0-beta.2
2023-12-06 18:30:23 +01:00
Julian Risch
dbe2319cae
feat: Add posthog api key for telemetry ( #6500 )
2023-12-06 18:18:16 +01:00
Massimiliano Pippi
38bd367b74
fix pydoc config ( #6496 )
2023-12-06 15:02:07 +01:00
Stefano Fiorucci
b520c77f67
ci: reintroduce readme_sync workflow ( #6490 )
...
* reintroduce readme_sync
* minimal workflow
* fix wrong id
* remove reference to branches other than main
2023-12-06 13:11:25 +01:00