3803 Commits

Author SHA1 Message Date
Madeesh Kannan
906177329b
fix: Enforce basic Python types restriction on serialized component data (#8473) 2024-10-22 17:08:36 +02:00
Alper
a556e11bf1
fix: window_size set during run instead of construction (#8463)
* window_size set during runtime

* revert init and update run with window_size

* improved doc, removed print

* adding release notes

* updating tests

* reverting docstring example

* Update haystack/components/retrievers/sentence_window_retriever.py

Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>

* Update haystack/components/retrievers/sentence_window_retriever.py

Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>

* Update haystack/components/retrievers/sentence_window_retriever.py

Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>

---------

Co-authored-by: David S. Batista <dsbatista@gmail.com>
Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>
2024-10-22 14:01:26 +00:00
Silvano Cerza
0157459a7b
Pin ddtrace test dependency to fix tests (#8478) 2024-10-22 10:19:25 +00:00
David S. Batista
3a50d35f06
feat: allow Generators to run with a system prompt defined at run time (#8423)
* initial import

* Update haystack/components/generators/openai.py

Co-authored-by: Sebastian Husch Lee <sjrl@users.noreply.github.com>

* docs: fixing

* supporting the three use cases: no system prompt, using system prompt defined at init, using system prompt defined at run time

* renaming 'run_time_system_prompt' to 'system_prompt'

* adding tests, converting methods to static

---------

Co-authored-by: Sebastian Husch Lee <sjrl@users.noreply.github.com>
2024-10-22 11:21:10 +02:00
Stefano Fiorucci
f6935d1456
ci: add pip to test dependencies (#8475)
* add pip to test dependencies

* trigger

* release note

* rm trigger
2024-10-22 08:35:30 +00:00
Stefano Fiorucci
322f63de6d
feat: Logging Tracer (#8447)
* logging tracer: first draft

* progress

* more tests

* license header

* avoid interference with other tests

* release note

* incorporate feedback from review

* Update haystack/tracing/logging_tracer.py

Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>

---------

Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
2024-10-21 09:47:46 +02:00
Ajit Singh
6cf13e8b98
enhancement: reduced usage of numpy and substituted built-in libraries (#8418)
* reduced usage of numpy and substituted built-in libraries

* added release note

* edited expit function to support both float as well as list (this case was giving error CI)

* revert code , numpy can't be removed here

* more cleaning

* fix relnote

---------

Co-authored-by: anakin87 <stefanofiorucci@gmail.com>
2024-10-18 15:42:19 +02:00
Stefano Fiorucci
ff584f1577
Update CONTRIBUTING.md (#8466) 2024-10-18 15:21:55 +02:00
Stefano Fiorucci
dfd339ca2d
ensure compatibility with huggingface_hub==0.26.0 (#8464) 2024-10-18 08:38:48 +00:00
Ikko Eltociear Ashimine
939698bb18
chore: update readme_sync.yml (#8462)
minor fix
2024-10-18 10:03:13 +02:00
Vladimir Blagojevic
54104e9cc6
Fix variable naming in deserialize_callable (#8461) 2024-10-17 11:41:10 +02:00
Stefano Fiorucci
7788bfe558
ci: upgrade Hatch to 1.13.0 and adopt uv as installer (#8313)
* try uv

* upgrade hatch

* rm unnecessary specification

* release note
2024-10-17 10:32:14 +02:00
jlonge4
78f378b34d
implement additional mime types (#8446)
* implement additional mime types

* correct typo

* reduce complexity

* add optional

* add missing release note

* yamllint

* yamllint

* Update file-router-additional-mime-types-47fe57e6816b83da.yaml

minor reno change for consistency

---------

Co-authored-by: Vladimir Blagojevic <dovlex@gmail.com>
2024-10-16 12:38:49 +02:00
tstadel
8613bb7653
fix: logs containing JSON getting lost (#8456)
* fix: logs getting lost

* add test

* add reno
2024-10-15 14:11:14 +02:00
Alper
b40f0c8b5d
feat: SentenceTransformersTextEmbedder supports config_kwargs (#8432)
* add config_kwargs

* disable PLR0913 for a specific function

* add a release note

* refer to AutoConfig in config_kwargs docstring

---------

Co-authored-by: David S. Batista <dsbatista@gmail.com>
Co-authored-by: Julian Risch <julianrisch@gmx.de>
2024-10-14 16:08:53 +00:00
David S. Batista
b81abc0c85
feat: SentenceTransformersDocumentEmbedder supports config_kwargs (#8433)
* initial import

* adding release notes
2024-10-14 17:43:04 +02:00
David S. Batista
5867fa1f34
fix: whisper transcription test use github url + update test (#8455)
* adding audio file

* changing URL

* updating tests

* temporary removing failing test

* updating tests

* removing failing test

* typo

* linting

* fixing URL

* updating tests
2024-10-14 16:24:52 +02:00
David S. Batista
a50593ede0
fix: whisper tests using audio file from our github repo (#8454)
* adding audio file

* temporary removing failing test

* removing failing test
2024-10-14 12:56:37 +02:00
Madeesh Kannan
e7bfd80f3b
fix: (Temporarily) Re-add suport for pre-2.6.0 YAMLs with PyPDFConverter (#8443) 2024-10-08 14:35:43 +02:00
David S. Batista
3b9a60b377
adding missing import (#8435) 2024-10-02 15:26:05 +02:00
Silvano Cerza
bdc6b2e290
Fixing linting of tracing/datadog.py (#8434) 2024-10-02 10:09:15 +00:00
Madeesh Kannan
ee89f6ad57
fix: PyPDFToDocument correctly serializes custom converters, deprecate DefaultConverter (#8430)
* fix: `PyPDFToDocument` correctly serializes custom converters, deprecate `DefaultConverter`

* Remove `auto` prefix from serde util function names, add unit tests
2024-10-01 16:35:38 +02:00
Julian Risch
08686d90af
feat: Add DocumentNDCGEvaluator component (#8419)
* draft new component and tests

* draft new component and tests

* fix tests, replace usage of get_attr

* improve docstrings, refactor tests

* add test for mixed documents w/wo scores

* add test with multiple lists and update docstring

* validate inputs, add tests, make methods static

* change fallback to binary relevance

* rename validate_init_parameters to validate_inputs
2024-10-01 16:15:02 +02:00
Silvano Cerza
d6f073f9b3
Revert "fix: make pypdf converter more robust (#8427)" (#8428)
This reverts commit d234c75168dcb49866a6714aa232f37d56f72cab.
2024-10-01 11:55:25 +02:00
Tobias Wochinger
d234c75168
fix: make pypdf converter more robust (#8427)
* fix: make `from_dict` of `PyPDFToDocument` more robust

* chore: drop trailing space

* converting method to static and making the comment shorter

* reverting method to static

---------

Co-authored-by: David S. Batista <dsbatista@gmail.com>
2024-09-30 16:47:23 +00:00
Julian Risch
651244225b
fix: fix release note indentation of #8302 (#8426) 2024-09-30 17:17:56 +02:00
Silvano Cerza
cf9a75a73e
Remove deprecated Pipeline init argument debug_path (#8424) 2024-09-30 17:11:49 +02:00
Ajit Singh
2dd8089409
chore: Removed deprecated max_loop_allowed argument from Pipeline init (#8409)
* Added equality check for sender and receiver in connection function of pipeline

* Update base.py

irrelevant changes reverted

* added release note

* removed deprecated param max_loops_allowed from pipeline init

* added release note

* revert non relevant test

* Delete releasenotes/notes/remove-support-to-connect-component-to-self-6eedfb287f2a2a02.yaml

* revery non relevant change

* Remove unused test_pipeline_deprecated.yaml

* Remove PipelineMaxLoops error

* Update release notes

---------

Co-authored-by: Silvano Cerza <silvanocerza@gmail.com>
2024-09-30 15:58:05 +02:00
Ajit Singh
7ba30d5691
feat: Pipeline.connect() will now raise a PipelineConnectError if sender and receiver are the same Component (#8403)
* Added equality check for sender and receiver in connection function of pipeline

* Update base.py

irrelevant changes reverted

* added release note

* altered a walk with cycle test

* added a test to verify that pipeline raises PipelineConnectError when adding a component to itself

* Update release notes

* Remove self connection feature tests

* Tidy up connect unit test

---------

Co-authored-by: Silvano Cerza <silvanocerza@gmail.com>
2024-09-30 15:52:36 +02:00
Haystack Bot
bbfd74fe27
chore: Update unstable version to 2.7.0-rc0 (#8420)
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2024-09-30 11:58:30 +02:00
Amna Mubashar
c68160c299
Expose raise_on_invalid_filter_syntax in haystack.utils (#8413) v2.6.0-rc1 v2.7.0-rc0 2024-09-26 17:01:35 +02:00
Stefano Fiorucci
c682a59f5e
ci: reintroduce macos-latest in tests (#8405)
* try testing with macos-latest

* trigger CI

* remove trigger
2024-09-25 16:43:03 +02:00
Silvano Cerza
29672d4b42
feat: Add JSONConverter Component (#8397)
* Add JSONConverter Component

* Handle some corner cases

* Add JSONConverter to pydoc config

* Add a way to extract all non content fields as metadata

* Small fix in docstring

* Fix tests

* docstrings upd

* Update json.py

---------

Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
2024-09-25 12:34:51 +02:00
Silvano Cerza
1cc705c0fe
chore: Deprecate connecting a Component to itself (#8368)
* Deprecate connecting a Component to itself

* Apply suggestions from code review

Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>

---------

Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
2024-09-25 09:29:31 +00:00
Silvano Cerza
0df379e6a2
feat: Deprecate @component decorator is_greedy argument (#8400)
* Deprecate @component decorator is_greedy argument

* Fix some typos and docstrings

* Add _is_lazy_variadic test
2024-09-25 11:28:30 +02:00
Stefano Fiorucci
2cc76beacd
fix: fix PredefinedPipeline.CHAT_WITH_WEBSITE template (#8401)
* fix pipeline template

* reno

* update to gpt-4o-mini
2024-09-25 10:06:30 +02:00
Ikko Eltociear Ashimine
ef60016d79
chore: update pipeline.py (#8285)
minor fix
2024-09-24 16:17:31 +02:00
Sebastian Husch Lee
74f7c6fdfb
Set max_runs_per_component to 1 for pipelines that are linear (#8393) 2024-09-24 14:44:45 +02:00
Silvano Cerza
b1fe267dd3
Fix codespell pre commit hook (#8396) 2024-09-24 13:23:55 +02:00
Silvano Cerza
4b77ec1b6f
Fix codespell config (#8392) 2024-09-24 12:00:45 +02:00
Vladimir Blagojevic
09b95746a2
feat: HuggingFaceAPIChatGenerator add token usage data (#8375)
* Ensure HuggingFaceAPIChatGenerator has token usage data

* Add reno note

* Fix release note
2024-09-23 15:40:50 +02:00
Tuana Çelik
0b7f1fd114
chore: Update cookbook url in README.md (#8387)
Adding the new cookbook url
2024-09-20 14:26:45 +02:00
Sriniketh J
066e2e3ec5
Make api_key param optional in LLMEvaluator (#8340) 2024-09-20 10:47:13 +02:00
Silvano Cerza
b6cde2414b
Remove unused benchmark.yml workflow (#8380) 2024-09-19 18:34:14 +02:00
Sebastian Husch Lee
2235ce673f
test: Move pipeline test to behavorials (#8377) 2024-09-19 16:59:35 +02:00
Madeesh Kannan
151bd531a5
feat: Add utility function to check metadata filter syntax (#8386) 2024-09-19 15:03:04 +02:00
Daria Fokina
caf465b004
docs: add NLTKSplitter and ZeroShotClassifier to pydocs (#8384)
* Update preprocessors_api.yml

* Update classifiers_api.yml
2024-09-18 15:55:40 +02:00
Vladimir Blagojevic
514e0abc39
fix: Fix nltk imports (#8381) 2024-09-18 11:25:21 +00:00
Madeesh Kannan
b22014b915
fix: Prevent set_output_types from being called when the output_types decorator is used (#8376) 2024-09-18 13:05:31 +02:00
Vladimir Blagojevic
117c298145
NLTKDocumentSplitter: add 'function' split_by option (#8378) 2024-09-17 16:18:39 +02:00