haystack

mirror of https://github.com/deepset-ai/haystack.git synced 2025-12-06 11:57:14 +00:00

Author	SHA1	Message	Date
Stefano Fiorucci	35fb6c6f01	feat: improve `AnswerBuilder` to support showing RAG references (#9933 ) * draft * improve * refactor * improvs + usage ex * relnote * pipeline test fix	2025-10-24 12:47:16 +02:00
Arya Tayshete	f8d6757eab	feat(converters): CSVToDocument supports row-level conversion (#9773 ) * feat(converters): CSVToDocument row-level conversion (content_column, columns→meta) + tests + releasenote Signed-off-by: Arya Tayshete <avtayshete_b21@et.vjti.ac.in> * feat(converters): CSVToDocument row-mode hardening + tests Signed-off-by: Arya Tayshete <avtayshete_b21@et.vjti.ac.in> * test(converters): remove long commented line to satisfy ruff E501 Signed-off-by: Arya Tayshete <avtayshete_b21@et.vjti.ac.in> * fix(converters): avoid infinite loop Signed-off-by: Arya Tayshete <avtayshete_b21@et.vjti.ac.in> * feat(converters): require content_column in run() for row mode; remove fallbacks; improve docstrings; update tests Signed-off-by: Arya Tayshete <avtayshete_b21@et.vjti.ac.in> * feat(converters): content_column required in run method instead of init Signed-off-by: Arya Tayshete <avtayshete_b21@et.vjti.ac.in> * feat(csv): row-mode with required run() arg ; update BDD pipeline tests --------- Signed-off-by: Arya Tayshete <avtayshete_b21@et.vjti.ac.in>	2025-10-09 13:15:51 +00:00
Abdelrahman Kaseb	5f3c37d287	chore: adopt PEP 585 type hints (#9678 ) * chore(lint): enforce and apply PEP 585 type hinting * Run fmt fixes * Fix all typing imports using some regex * Fix all typing written in string in tests * undo changes in the e2e tests * make e2e test use list instead of List * type fixes * remove type:ignore * pylint * Remove typing from Usage example comments * Remove typing from most of comments * try to fix e2e tests on comm PRs * fix * Add tests typing.List in to adjust test compatiplity - test/components/agents/test_state_class.py - test/components/converters/test_output_adapter.py - test/components/joiners/test_list_joiner.py * simplify pyproject * improve relnote --------- Co-authored-by: anakin87 <stefanofiorucci@gmail.com>	2025-08-07 10:23:14 +02:00
Sebastian Husch Lee	7a63559faf	feat: Raise warning if the pipeline is unable to continue running due to a blocked component (#9569 ) * First pass at alerting a user that the pipeline is blocked * Change approach and add change to async pipeline * Fix check in run_async * Another fix * Somehow also fixed the max_runs_per_component * Align sync run and async run component * Update output type of component outputs to Mapping to align with our protocol * linting * ruff * Make it a core test * Add reno * Some refactoring * Linting * Fix mypy * Converting to warning * Small changes * Update release note * More cleanup * PR comment * PR comments	2025-07-15 14:02:39 +00:00
Sebastian Husch Lee	85258f0654	fix: Fix types and formatting pipeline test_run.py (#9575 ) * Fix types in test_run.py * Get test_run.py to pass fmt-check * Add test_run to mypy checks * Update test folder to pass ruff linting * Fix merge * Fix HF tests * Fix hf test * Try to fix tests * Another attempt * minor fix * fix SentenceTransformersDiversityRanker * skip integrations tests due to model unavailable on HF inference --------- Co-authored-by: anakin87 <stefanofiorucci@gmail.com>	2025-07-03 09:49:09 +02:00
David S. Batista	da60156174	chore: removing unused imports from tests (#9446 )	2025-05-26 16:22:51 +00:00
Vladimir Blagojevic	167229f328	feat: Extend AnswerBuilder for Agent (#9406 ) * Extend AnswerBuilder for Agent * Update tests * Add reno note * PR feedback * Add a better unit test * Update haystack/components/builders/answer_builder.py Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com> * Update haystack/components/builders/answer_builder.py Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com> * PR feedback * Remove copy --------- Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>	2025-05-22 14:32:36 +02:00
Stefano Fiorucci	f3c44be904	refactor!: remove `dataframe` field from `Document` and `ExtractedTableAnswer`; make `pandas` optional (#8906 ) * remove dataframe * release note * small fix * group imports * Update pyproject.toml Co-authored-by: Julian Risch <julian.risch@deepset.ai> * Update pyproject.toml Co-authored-by: Julian Risch <julian.risch@deepset.ai> * address feedback --------- Co-authored-by: Julian Risch <julian.risch@deepset.ai>	2025-03-04 11:06:07 +00:00
mathislucka	ee81570f37	fix: only overwrite existing socket inputs when we provide a new value (#8940 ) * fix: only overwrite existing socket inputs when we provide a new value * chore: add release notes * Apply suggestions from code review --------- Co-authored-by: Julian Risch <julian.risch@deepset.ai>	2025-02-27 09:13:41 +00:00
mathislucka	76753fd4c6	fix: reduce number of edge cases where lazy variadic components wait for inputs that can't arrive anymore (#8907 ) * wip * fix: running order with lazy variadic components * fix: tests * format * comment * fix: alternative approach to fixing running order * unused imports * revert fix * remove unneeded return * remove data based approach to tie breaking * release note * trailing spaces * newline eof * unused import * add more explanations to release note	2025-02-24 15:17:17 +00:00
mathislucka	8c54f06a19	fix: component checks failing for components that return dataframes (#8873 ) * fix: use is not to compare to sentinel value * chore: release notes * Update releasenotes/notes/fix-component-checks-with-ambiguous-truth-values-949c447b3702e427.yaml Co-authored-by: David S. Batista <dsbatista@gmail.com> * fix: another sentinel value * test: also test base class * add pandas as test dependency * format * Trigger CI * mark test with xfail strict=False --------- Co-authored-by: Sebastian Husch Lee <sjrl@users.noreply.github.com> Co-authored-by: David S. Batista <dsbatista@gmail.com> Co-authored-by: anakin87 <stefanofiorucci@gmail.com>	2025-02-19 09:10:48 +00:00
mathislucka	e5b9bdeb66	feat: AsyncPipeline that can schedule components to run concurrently (#8812 ) * add component checks * pipeline should run deterministically * add FIFOQueue * add agent tests * add order dependent tests * run new tests * remove code that is not needed * test: intermediate from cycle outputs are available outside cycle * add tests for component checks (Claude) * adapt tests for component checks (o1 review) * chore: format * remove tests that aren't needed anymore * add _calculate_priority tests * revert accidental change in pyproject.toml * test format conversion * adapt to naming convention * chore: proper docstrings and type hints for PQ * format * add more unit tests * rm unneeded comments * test input consumption * lint * fix: docstrings * lint * format * format * fix license header * fix license header * add component run tests * fix: pass correct input format to tracing * fix types * format * format * types * add defaults from Socket instead of signature - otherwise components with dynamic inputs would fail * fix test names * still wait for optional inputs on greedy variadic sockets - mirrors previous behavior * fix format * wip: warn for ambiguous running order * wip: alternative warning * fix license header * make code more readable Co-authored-by: Amna Mubashar <amnahkhan.ak@gmail.com> * Introduce content tracing to a behavioral test * Fixing linting * Remove debug print statements * Fix tracer tests * remove print * test: test for component inputs * test: remove testing for run order * chore: update component checks from experimental * chore: update pipeline and base from experimental * refactor: remove unused method * refactor: remove unused method * refactor: outdated comment * refactor: inputs state is updated as side effect - to prepare for AsyncPipeline implementation * format * test: add file conversion test * format * fix: original implementation deepcopies outputs * lint * fix: from_dict was updated * fix: format * fix: test * test: add test for thread safety * remove unused imports * format * test: FIFOPriorityQueue * chore: add release note * feat: add AsyncPipeline * chore: Add release notes * fix: format * debug: switch run order to debug ubuntu and windows tests * fix: consider priorities of other components while waiting for DEFER * refactor: simplify code * fix: resolve merge conflict with mermaid changes * fix: format * fix: remove unused import * refactor: rename to avoid accidental conflicts * fix: track pipeline type * fix: and extend test * fix: format * style: sort alphabetically * Update test/core/pipeline/features/conftest.py Co-authored-by: Amna Mubashar <amnahkhan.ak@gmail.com> * Update test/core/pipeline/features/conftest.py Co-authored-by: Amna Mubashar <amnahkhan.ak@gmail.com> * Update releasenotes/notes/feat-async-pipeline-338856a142e1318c.yaml * fix: indentation, do not close loop * fix: use asyncio.run * fix: format --------- Co-authored-by: Amna Mubashar <amnahkhan.ak@gmail.com> Co-authored-by: David S. Batista <dsbatista@gmail.com>	2025-02-07 16:37:29 +01:00
mathislucka	eec91824bc	fix: pipeline run bugs in cyclic and acyclic pipelines (#8707 ) * add component checks * pipeline should run deterministically * add FIFOQueue * add agent tests * add order dependent tests * run new tests * remove code that is not needed * test: intermediate from cycle outputs are available outside cycle * add tests for component checks (Claude) * adapt tests for component checks (o1 review) * chore: format * remove tests that aren't needed anymore * add _calculate_priority tests * revert accidental change in pyproject.toml * test format conversion * adapt to naming convention * chore: proper docstrings and type hints for PQ * format * add more unit tests * rm unneeded comments * test input consumption * lint * fix: docstrings * lint * format * format * fix license header * fix license header * add component run tests * fix: pass correct input format to tracing * fix types * format * format * types * add defaults from Socket instead of signature - otherwise components with dynamic inputs would fail * fix test names * still wait for optional inputs on greedy variadic sockets - mirrors previous behavior * fix format * wip: warn for ambiguous running order * wip: alternative warning * fix license header * make code more readable Co-authored-by: Amna Mubashar <amnahkhan.ak@gmail.com> * Introduce content tracing to a behavioral test * Fixing linting * Remove debug print statements * Fix tracer tests * remove print * test: test for component inputs * test: remove testing for run order * chore: update component checks from experimental * chore: update pipeline and base from experimental * refactor: remove unused method * refactor: remove unused method * refactor: outdated comment * refactor: inputs state is updated as side effect - to prepare for AsyncPipeline implementation * format * test: add file conversion test * format * fix: original implementation deepcopies outputs * lint * fix: from_dict was updated * fix: format * fix: test * test: add test for thread safety * remove unused imports * format * test: FIFOPriorityQueue * chore: add release note * fix: resolve merge conflict with mermaid changes * fix: format * fix: remove unused import * refactor: rename to avoid accidental conflicts * chore: remove unused inputs, add missing license header * chore: extend release notes * Update releasenotes/notes/fix-pipeline-run-2fefeafc705a6d91.yaml Co-authored-by: Amna Mubashar <amnahkhan.ak@gmail.com> * fix: format * fix: format * Update release note --------- Co-authored-by: Amna Mubashar <amnahkhan.ak@gmail.com> Co-authored-by: David S. Batista <dsbatista@gmail.com>	2025-02-06 14:19:47 +00:00
mathislucka	fe9b1e29d4	CI: fix format after newly introduced formatting rules from ruff release (#8696 )	2025-01-09 16:25:55 +00:00
Stefano Fiorucci	ea3602643a	feat!: new `ChatMessage` (#8640 ) * draft * del HF token in tests * adaptations * progress * fix type * import sorting * more control on deserialization * release note * improvements * support name field * fix chatpromptbuilder test * Update chat_message.py --------- Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>	2024-12-17 17:02:04 +01:00
Stefano Fiorucci	c8685aa141	refactor: update components to access `ChatMessage.text` instead of `content` (#8589 ) * introduce text property and deprecate content * release note * use chatmessage.text * release note * linting	2024-11-28 10:16:07 +00:00
Sebastian Husch Lee	294a67e426	feat: Adding StringJoiner (#8357 ) * Adding StringJoiner * Release notes * Remove typing * Remove unused import * Try to fix header * Fix one test * Add to docs, move test to behavioral pipeline test * Undo changes * Fix test * Update haystack/components/joiners/string_joiner.py Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com> * Update haystack/components/joiners/string_joiner.py Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com> * Provide usage example * Apply suggestions from code review Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com> --------- Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com> Co-authored-by: Silvano Cerza <3314350+silvanocerza@users.noreply.github.com>	2024-10-30 15:03:41 +00:00
Silvano Cerza	8205724395	feat: Rework `Pipeline.run()` to better handle cycles (#8431 ) * draft * Enhance * Almost works * Simplify some parts and handle intermediate outputs * Handle connections with default * Handle cycles with multiple connections from two components * Update distributed outputs at the correct time * Remove Component inputs after it runs * Add agent pipeline test case * Fix infite loop test * Handle some corner cases with loops checking and inputs deletion * Fix tests * Add new behavioral test * Remove unused code in behavioural test * Fix behavioural test * Fix max run check * Simplify outputs distribution * Simplify subgraph run check * Remove unused _init_run_queue function * Remove commented code * Add some missing type hints * Simplify cycles breaking * Fix _distribute_output test * Fix _find_components_that_will_receive_no_input test * Fix validation test * Fix tracer losing Component inputs * Fix some linting issues * Remove ignore pylint rule * Rename method that break cycles and make it raise * Add docstring to _run_subgraph * Update Pipeline.run() docstring * Update comment to clarify cycles execution * Remove SelfLoop sample Component * Add behavioural test for unsupported cycles * Rename behavioural test to be more specific * Add new behavioural test * Add release notes * Remove commented out code and random pass * Use more efficient function to find cycles * Simplify _break_supported_cycles_in_graph by using defaultdict * Stop breaking edges as soon as we make the graph acyclic * Fix docstring and add some more comments * Fix _distribute_output docstring * Fix _find_receivers_from docstring * More detailed release notes * Minimize calls to networkx.is_directed_acyclic_graph * Add some more info on edges keys * Adjust components_in_cycles comment * Add new Pipeline behavioural test * Enhance _find_components_that_will_receive_no_input to cover more cases * Explain why run_queue is reset after running a subgraph cycle * Rename _init_inputs_state to _normalize_input_data * Better explain the subgraph output distribution * Remove for else * Fix some comments and docstrings * Fix linting * Add missing return type * Fix typo * Rename _normalize_input_data to _normalize_varidiac_input_data and add more documentation * Remove unused import --------- Co-authored-by: Sebastian Husch Lee <sjrl423@gmail.com>	2024-10-29 15:43:16 +01:00
Ajit Singh	7ba30d5691	feat: `Pipeline.connect()` will now raise a `PipelineConnectError` if `sender` and `receiver` are the same Component (#8403 ) * Added equality check for sender and receiver in connection function of pipeline * Update base.py irrelevant changes reverted * added release note * altered a walk with cycle test * added a test to verify that pipeline raises PipelineConnectError when adding a component to itself * Update release notes * Remove self connection feature tests * Tidy up connect unit test --------- Co-authored-by: Silvano Cerza <silvanocerza@gmail.com>	2024-09-30 15:52:36 +02:00
Sebastian Husch Lee	74f7c6fdfb	Set max_runs_per_component to 1 for pipelines that are linear (#8393 )	2024-09-24 14:44:45 +02:00
Sebastian Husch Lee	2235ce673f	test: Move pipeline test to behavorials (#8377 )	2024-09-19 16:59:35 +02:00
Madeesh Kannan	672bcf7e03	fix: Add constraints to `set_input_type(s)` based on `run` method (#8358 ) * fix: Prevent the usage of `set_input_type(s)` when the `run` method doesn't have kwargs, raise if `set_input_type(s)` overrides `run` method parameters * fix: update components and tests * reno	2024-09-12 15:58:16 +02:00
Silvano Cerza	5514676b5e	feat: Deprecate `max_loops_allowed` in favour of new argument `max_runs_per_component` (#8354 ) * Deprecate max_loops_allowed in favour of new argument max_runs_per_component * Add missing test file * Some enhancements * Add version that will remove deprecate stuff	2024-09-12 11:00:12 +02:00
Silvano Cerza	4d67b552e1	Fix Pipeline skipping a Component with Variadic input (#8347 ) * Fix Pipeline skipping a Component with Variadic input * Simplify _find_components_that_will_receive_no_input	2024-09-10 14:59:53 +02:00
Silvano Cerza	c7e29a83c1	fix: Fix infinite loop when running Pipeline (#8123 ) * Fix infinite loop when running Pipeline * Simplify if	2024-07-30 15:00:12 +02:00
Amna Mubashar	499fbcc59f	Remove Multiplexer and related tests (#8020 )	2024-07-16 15:39:40 +02:00
Silvano Cerza	0411cd938a	Fix bug in Pipeline.run() executing Components in a wrong and unexpected order (#8021 ) * Fix bug in Pipeline.run() executing Components in a wrong and unexpected order * Update haystack/core/pipeline/base.py Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com> --------- Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>	2024-07-12 15:30:10 +00:00
Massimiliano Pippi	3a03fce71c	ci: Add code formatting checks (#7882 ) * ruff settings enable ruff format and re-format outdated files feat: `EvaluationRunResult` add parameter to specify columns to keep in the comparative `Dataframe` (#7879) * adding param to explictily state which cols to keep * adding param to explictily state which cols to keep * adding param to explictily state which cols to keep * updating tests * adding release notes * Update haystack/evaluation/eval_run_result.py Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com> * Update releasenotes/notes/add-keep-columns-to-EvalRunResult-comparative-be3e15ce45de3e0b.yaml Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com> * updating docstring --------- Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com> add format-check fail on format and linting failures fix string formatting reformat long lines fix tests fix typing linter pull from main * reformat * lint -> check * lint -> check	2024-06-18 15:52:46 +00:00
Silvano Cerza	3c8569e12c	fix: Fix running `Pipeline` with conditional branch and Component with default inputs (#7799 ) * Fix running Pipeline with conditional branch and Component with default inputs * Add release notes * Change arg name of _init_to_run so it's clearer * Enhance release note	2024-06-06 13:19:07 +00:00
Silvano Cerza	74df8ed937	test: Rework `Pipeline.run()` tests to ease declaration with dataclasses (#7790 ) * Rework boilerplate function that run Pipeline in scenarios testing * Update tests to use new dataclasses * Update README.md to reflect dataclass changes * Use absolute import from conftest	2024-06-03 15:59:42 +02:00
Silvano Cerza	07ae45e0c2	test: Migrate `Pipeline.run()` tests with run arguments (#7777 ) * Support Pipeline.run() arguments in tests * Move intermediate outputs	2024-06-03 12:36:04 +02:00
Silvano Cerza	d81af81fbb	test: Migrate pipeline run tests (#7775 ) * Move complex pipeline * Move pipeline with default * Move pipeline with distinct loops * Move pipeline with double loop * Move pipeline with dynamic inputs * Move fixed decision pipeline * Move fixed merging pipeline * Move fixed decision and merge pipeline * Remove test_joiners.py * Move looping and merge pipeline * Remove test_looping.py * Move mutable input pipeline * Move parallel branches pipeline * Move same input different components pipeline * Move test_run_with_greedy_variadic_after_component_with_default_input_simple * Remove test_run_raises_if_max_visits_reached * Move test_run_with_component_that_does_not_return_dict * Move test_correct_execution_order_of_components_with_only_defaults * Move test_pipeline_is_not_stuck_with_components_with_only_defaults * Move test_pipeline_is_not_stuck_with_components_with_only_defaults_as_first_components * Move self loop pipeline * Move variable decision and merge pipeline * Remove test_variable_decision_pipeline * Move variable merging pipeline * Add FakeComponent removed by mistake	2024-05-31 13:00:29 +02:00
Silvano Cerza	3dcc21fd73	test: Pipeline run tests rework (#7748 ) * Rework Pipeline.run() tests * Remove test_linear_pipeline.py * Add test for components execution order * Add new pytest-bdd tests dependency * Update README.md * Add function to dinamically add integration marker * Fix marking tests as integration	2024-05-28 15:42:47 +02:00

33 Commits