haystack

mirror of https://github.com/deepset-ai/haystack.git synced 2025-11-16 18:13:54 +00:00

Author	SHA1	Message	Date
Amna Mubashar	9ed0b9b0bc	fix: Update the de/serialization with schema utils (#9526 ) * Update the util methods * Update tests * fix tests * schema fix * Add json schema for tuples and sets * Add proper conversion for sets and tuples * Adjust typing * PR comments * Linting * Optimize deserialization * remove TODO * PR comments * PR comments * Update tests and deserialization error * Support legacy deserialization * Update deprecating warning * Update test	2025-06-24 13:10:12 +02:00
Sriniketh J	6198f0cba9	feat: adding support for torch xpu device (#9470 ) * feat: add support for torch xpu device support * test: xpu based tests ci/cd * test: add xpu code device support --------- Co-authored-by: Sebastian Husch Lee <10526848+sjrl@users.noreply.github.com> Co-authored-by: David S. Batista <dsbatista@gmail.com>	2025-06-17 14:15:19 +02:00
David S. Batista	b85c8e3382	feat: adding `deserialize_component_inplace()` (#9459 ) * adding tests * adding release notes * deserialize_chatgenerator_inplace uses deserialize_component_inplace * removing tests	2025-06-02 09:40:35 +02:00
Stefano Fiorucci	2616d4d55b	test: speed up some tests + minor refactorings (#9451 ) * this is an integration test * more improvements * rm redundant comments	2025-05-29 09:49:11 +02:00
atopx	3deaa20cb6	feat: Add HuggingFace API (text-embeddings-inference for rerank model) for component.rankers (#9414 ) * feat(component.rankers): Add HuggingFace API (text-embeddings-inference for rerank) ranker component * update test flow & doc loaders * Support run_async for HuggingFaceAPIRanker * Add release note for HuggingFace API support in component.rankers * Add release note for HuggingFace API support in component.rankers * Add release note for HuggingFace API support in component.rankers * Add release note for HuggingFace API support in component.rankers * fix: 1. `hugging_face_api.HuggingFaceAPIRanker` rename to `hugging_face_tei.HuggingFaceAPIRanker` 2. HuggingFaceAPIRanker: use our Secret API for token 3. add the missing modules for `docs/pydoc/config/rankers_api.yml` 4. added function `async_request_with_retry` for `haystack/utils/requests_utils.py` and added unittest on `test/utils/test_requests_utils.py` 4. HuggingFaceAPIRanker: refactor the retry function to support configuration based on attempts and status code. 5. HuggingFaceAPIRanker: refactor the test into unit tests using mocks * fix(HuggingFaceTEIRanker): change the token check logic to use the resolve_value method. * fix(format): run `hatch run format` * fix: - Force keyword-only arguments in __init__ method by adding , - Clarify token docstring that it's not always required - Copy documents to avoid modifying original objects - Remove test file from slow workflow - Add monkeypatch eånvironment variable cleanup in tests - Fix missing module in rankers_api.yml and sort modules alphabetically - Remove unnecessary test info from release notes fix HuggingFaceTEIRanker： - "None" of "Optional[Secret]" has no attribute "resolve_value" - run/run_async: too many parameters * fix(HuggingFaceTEIRanker) :Revise the docstring of the HuggingFaceTEIRanker, improve the parameter descriptions, ensure consistency and clarity. Add error handling information to enhance the readability of the API response. * fix：unit test for HuggingFaceTEIRanker raise message * fix fmt * minor refinements * refine release note --------- Co-authored-by: anakin87 <stefanofiorucci@gmail.com>	2025-05-27 12:44:54 +02:00
David S. Batista	da60156174	chore: removing unused imports from tests (#9446 )	2025-05-26 16:22:51 +00:00
Amna Mubashar	720cc19d7d	feat: add serialization to `State` / move `State` to agents.state (#9345 ) * Add serialization to State * Add release notes * Deprecate State in dataclasses * Fix tests * Remove state_utils test * Fix linting * Fix formating * Update tests and remove old state utils * Update agents test * Update deserilaization per review * Linting * Add tests for edge case (custom class types) * Fix type serialization * PR comments * Move State to agents * Fix tests * Update utils init * Improve seriliaztion/deser * Update the release notes * Minor fix in docstrings * PR comments * Add deprecation warnign for state utils * Recreate the serialization methods to use schema * Update key names * Make serialization methods private	2025-05-23 11:04:15 +02:00
Amna Mubashar	498637788a	feat: Allow OpenAI client config in `OpenAIChatGenerator` and `AzureOpenAIChatGenerator` (#9215 ) * Allow OpenAI client config in chat generator * Add init_http_client as a util method * Update azure chat gen * Fix linting	2025-04-16 18:32:13 +02:00
Stefano Fiorucci	81fbe546cb	feat: `ChatGenerator` protocol - do not require `to_dict` and `from_dict` methods (#9213 ) * minimize protocol * progress * rm unneeded test changes * reno * use keywords arguments for clarity	2025-04-11 10:30:48 +02:00
Stefano Fiorucci	c6df8d2c7a	test: monkeypatch OpenAI API key in some unit tests (#9173 )	2025-04-04 13:33:22 +02:00
Stefano Fiorucci	adc3dfc5d2	refactor: LLM evaluators - introduce `chat_generator` init param; deprecate `api`, `api_key` and `api_params` (#9122 ) * start * progress * tests for deserialize_chatgenerator_inplace * progress on llmevaluator + tests * update context relevance evaluator * update faithfulness evaluator + tests * release note * rm unused import * rm indentation	2025-03-31 15:35:03 +02:00
Sebastian Husch Lee	3d41c04130	fix: Fix type serialization and deserialization (#8993 ) * Expand tests * New version of type serialization * Adding more tests * More tests * Fix type serialization when using python 3.9 * Deserialization works with Optional now and we don't require 'typing.' to be present anymore * Don't worry about Literal * Add reno * Fix mypy * Pylint * Add additional test * Simplify * Add back comment * Fix types * Fix	2025-03-07 11:10:16 +01:00
Stefano Fiorucci	f3c44be904	refactor!: remove `dataframe` field from `Document` and `ExtractedTableAnswer`; make `pandas` optional (#8906 ) * remove dataframe * release note * small fix * group imports * Update pyproject.toml Co-authored-by: Julian Risch <julian.risch@deepset.ai> * Update pyproject.toml Co-authored-by: Julian Risch <julian.risch@deepset.ai> * address feedback --------- Co-authored-by: Julian Risch <julian.risch@deepset.ai>	2025-03-04 11:06:07 +00:00
Sebastian Husch Lee	93f361e1e1	fix: Fix serialization of `typing.Any` when using `serialize_type` utility (#8853 ) * Fix issue and expand tests * Add reno	2025-02-18 17:26:56 +01:00
Sebastian Husch Lee	1ee86b5041	fix: Fix filters to handle date times with timezones (loading and comparison) (#8800 ) * Fix on date time parsing with timezones. And comparing naive and aware date times. * Add release note * Add more filter tests	2025-02-04 14:51:06 +01:00
mathislucka	1a91365cc8	fix: callables can be deserialized from fully qualified import path (#8788 ) * fix: callables can be deserialized from fully qualified import path * fix: license header * fix: format * fix: types * fix? types * test: extend test case * format * add release notes	2025-02-03 12:35:37 +01:00
Stefano Fiorucci	5539f6c33f	refactor: improve serialization/deserialization of callables (to handle class methods and static methods) (#8683 ) * progress * refinements * tidy up * release note	2025-01-08 11:28:00 +01:00
Bohan Qu	8e3f64717f	feat: use importlib when deserializing callables (#8648 )	2025-01-03 15:06:58 +01:00
Stefano Fiorucci	2bc58d2987	feat: support for tools in `HuggingFaceAPIChatGenerator` (#8661 ) * message conversion function * hfapi w tools * right test file + hf_hub version * release note * feedback	2024-12-19 15:04:37 +01:00
Madeesh Kannan	ee89f6ad57	fix: `PyPDFToDocument` correctly serializes custom converters, deprecate `DefaultConverter` (#8430 ) * fix: `PyPDFToDocument` correctly serializes custom converters, deprecate `DefaultConverter` * Remove `auto` prefix from serde util function names, add unit tests	2024-10-01 16:35:38 +02:00
Vladimir Blagojevic	7e9f153e78	chore: Remove all references to old filter syntax (#8342 ) * Remove all references to old filter syntax * More removals * Lint * Do not remove test_filter_retriever.py * Add reno note * Update ValueError text to match text in haystack-core-integrations	2024-09-12 16:28:31 +02:00
Silvano Cerza	da49e782e2	chore: Make `arrow` an optional dependency (#8345 ) * Make arrow an optional dependency * Fix imports	2024-09-09 16:09:51 +02:00
Mo Sriha	75955922b9	feat: Add current date in UTC to PromptBuilder (#8233 ) * initial commit * add unit tests * add release notes * update function name	2024-09-09 09:47:03 +02:00
Alper	e614fa0c62	refactor: Rename deserialize_document_store_in_init_parameters (#8302 ) * 8259 * update function name * rename and update docstring * fix linting * add a release note	2024-09-02 11:42:23 +02:00
Stefano Fiorucci	bcc4104729	refactor: utility function for docstore deserialization (#8226 ) * refactor docstore deserialization * more tests * reno; headers * expose key	2024-08-14 13:29:27 +02:00
Vladimir Blagojevic	c8d53b3ebf	fix: Adjust serialization to handle PEP-585 generic types (#7690 ) * Adjust serialization to handle PEP-585 generic types * Add reno note * Simplify * PEP 585 serialization handling in sys.version_info < (3, 9)	2024-05-15 14:25:19 +02:00
Massimiliano Pippi	10c675d534	chore: add license header to all modules (#7675 ) * add license header to modules * check license header at linting time	2024-05-09 13:40:36 +00:00
Stefano Fiorucci	1d083861ff	feat: `HuggingFaceAPIGenerator` (#7464 ) * draft * docstrings and more tests * deprecation; reno * pydoc config * better error messages * rm unneeded else * make params mandatory * Apply suggestions from code review Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com> * document enum * Update haystack/utils/hf.py Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com> * fix test --------- Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>	2024-04-05 18:48:13 +02:00
Vladimir Blagojevic	08e97d874e	Export (de)serialize_callable and (de)serialize_type in haystack.utils (#7069 ) * Export (de)serialize_callable and (de)serialize_type in haystack.utils * Group imports where possible	2024-02-22 17:07:07 +01:00
Madeesh Kannan	b552b0b37c	fix: Ensure `Secret` types are immutable (#6994 )	2024-02-16 12:46:38 +01:00
Vladimir Blagojevic	a311d82593	feat: Externalize callable serialization so it can be reused (#6979 ) * Callback (de)serialization * Add unit tests * Replace callback handler sede with callable sede * Remove unused functions --------- Co-authored-by: Silvano Cerza <silvanocerza@gmail.com>	2024-02-13 13:00:49 +01:00
Madeesh Kannan	b772c1127c	feat: Implement `Secret` for structured authentication (#6855 ) * feat: Implement `AuthPolicy` for structured authentication * Rename `AuthPolicy` to `Secret` * Update release notes, fix typo	2024-01-31 12:51:14 +01:00
Sebastian Husch Lee	ceda4cd655	feat: Add support for `device_map` (#6679 ) * Getting device_map working to support 8bit loading and multi device inference * Update to take account the device specified by the user * add release notes * Add device_map support for ExtractiveReader * Update test * Update to model that doesn't have issues * Update test * Update pytest approx * Update release notes * Start supporting device map * Update ExtractiveReader to use new ComponentDevice * Update similarity ranker to follow extractive reader implementation * Fixing pylint * Make mypy mostly happy * Add new unit test to test device_map * Adding unit tests * Some refactoring * Add more tests * Add more tests * Add another unit test * Update first_device property to return a ComponentDevice to be able to use the to methods * Updating tests for test_device * Update tests and now explicitly modify device_map in model_kwargs * Update haystack/utils/hf.py Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com> * Make mypy happy * mypy * Remove unneeded optional flag * Update ExtractiveReader with new logic * Update ranker to follow new logic * Removing unneeded code * Make mypy happy * fxi pylint * Fix test * Adding unit tests for device_map="auto" * Add unit tests for ranker * PR comments * Make util method * Adding unit tests * Fix type annotation * Fix pylint * Fix test --------- Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>	2024-01-30 13:47:57 +01:00
Madeesh Kannan	7376838922	feat!: Framework-agnostic device management (#6748 ) * feat: Framework-agnostic device management * Add release note * Linting * Fix test * Add `first_device` property, expand release notes, validate `ComponentDevice` state	2024-01-17 10:41:34 +01:00
ZanSara	f877704839	chore: extract type serialization (#6586 ) * move functions * tests * reno	2023-12-19 14:16:20 +01:00
Sebastian Husch Lee	c294b8ac8c	feat: Add auto device checks and `model_kwargs` to `TransformersSimilarityRanker` (#6561 ) * Add device checking and model_kwargs like we do in ExtractiveReader * Add release notes * Make a utility function for the device checking * Better warning message and updated ExtractiveReader to use the util function * Add unit tests for get_device * Fix pylint	2023-12-18 15:13:42 +01:00
Silvano Cerza	e6637f5ec2	Fix all tests	2023-11-24 14:48:43 +01:00
Massimiliano Pippi	8adb8bbab8	Remove preview folder in test/ --------- Co-authored-by: Silvano Cerza <silvanocerza@gmail.com>	2023-11-24 11:52:55 +01:00
Massimiliano Pippi	09e7831f60	clean up 1.x code --------- Co-authored-by: Silvano Cerza <silvanocerza@gmail.com>	2023-11-24 11:47:47 +01:00
Tobias Wochinger	d5d3a9eef4	chore: adapt deepset cloud sdk endpoint format for saving pipelines (#5969 ) * chore: adapt to new endpoints formats * docs: add release notes	2023-10-05 08:56:28 +02:00
Christian Clauss	1bc03ddc73	ci: Fix all ruff pyflakes errors except unused imports (#5820 ) * ci: Fix all ruff pyflakes errors except unused imports * Delete releasenotes/notes/fix-some-pyflakes-errors-69a1106efa5d0203.yaml	2023-09-15 18:30:33 +02:00
ZanSara	462f3a5c99	feat: globally disable progress bars (#5207 ) * add SilenceableTqdm and update usage * pylint * rename module * add tests	2023-06-27 11:45:17 +02:00
Michael Feil	cfd703fa3e	fix: model_tokenizer in openai text completion tokenization details (#5104 ) * fix: model_tokenizer * Update test --------- Co-authored-by: Sebastian Husch Lee <sjrl423@gmail.com>	2023-06-22 14:23:19 +02:00
Ben Heckmann	1318ac5074	feat: Optional Content Moderation for OpenAI PromptNode & OpenAIAnswerGenerator (#5017 ) * #4071 implemented optional content moderation for OpenAI PromptNode * added two simple integration tests * improved documentation & renamed _invoke method to _execute_openai_request * added a flag to check_openai_policy_violation that will return a full dict of all text violations and their categories * re-implemented the tests as unit tests & without use of the OpenAI APIs * removed unused patch * changed check_openai_policy_violation back to only return a bool * fixed pylint and test error --------- Co-authored-by: Julian Risch <julian.risch@deepset.ai>	2023-06-19 13:27:11 +02:00
ZanSara	49e037a055	fix: rename `requests.py` into `requests_utils.py` (#5099 ) * requests.py -> requests_utils.py * fix tests * reimport requrests * fix more tests * review feedback	2023-06-12 12:40:21 +02:00
ZanSara	123ee55a5c	docstring (#4950 )	2023-05-18 16:00:02 +02:00
Julian Risch	8cfeed095d	build: Remove mmh3 dependency (#4896 ) * build: Remove mmh3 dependency * resolve circular import * pylint * make mmh3.py sibling of schema.py * pylint import order * pylint * undo example changes * increase coverage in modeling module * increase coverage further * rename new unit tests	2023-05-17 21:31:08 +02:00
Farzad E	6eb251d1f0	fix: Support for gpt-4-32k (#4825 ) * Add step to loook up tokenizers by prefix in openai_utils * Updated tiktoken min version + openai_utils test * Added test case for GPT-4 and Azure model naming * Broken down tests * Added default case --------- Co-authored-by: ZanSara <sara.zanzottera@deepset.ai>	2023-05-12 19:02:12 +02:00
Silvano Cerza	f12e5a0127	fix: Fix missing error in openai_request retry strategy (#4802 ) * Fix missing error in openai_request retry strategy * Correctly handle OpenAIUnauthorizedError Co-authored-by: bogdankostic <bogdankostic@web.de> --------- Co-authored-by: bogdankostic <bogdankostic@web.de>	2023-05-10 10:31:07 +02:00
Silvano Cerza	5547e85bd5	feat: Add util method to make HTTP requests with configurable retry (#4627 ) * Add util method to make HTTP requests with configurable retry * Fix pylint * Remove unnecessary optional parameter	2023-04-11 10:35:39 +02:00

50 Commits