Agnieszka Marzec
d441c2faab
Docs: Update HuggingFaceAPIChatGEnertaor docstrings ( #8152 )
...
* update docstrings
* Update haystack/components/generators/chat/hugging_face_api.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* Update haystack/components/generators/chat/hugging_face_api.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
---------
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
2024-08-02 09:31:59 +00:00
Sebastian Husch Lee
c90495c2e8
feat: Add model and tokenizer kwargs to TransformersSimilarityRanker, SentenceTransformersDocumentEmbedder, SentenceTransformersTextEmbedder ( #8145 )
...
* Start adding model and tokenizer kwargs support
* Add model and tokenizer kwargs to doc embedder
* Some updates and fixes in tests
* Fix more tests
* Fix tests
* Add release note
* Fix test
* Add from_dict tests
2024-08-02 10:37:10 +02:00
Agnieszka Marzec
d9a7a7a4db
Docs: Update ConditionalRouter docstrings ( #8140 )
...
* update docstrings
* Update haystack/components/routers/conditional_router.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* add reviewer's comments
---------
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
2024-08-02 10:34:01 +02:00
Agnieszka Marzec
c670f0fbee
Docs: update SentenceWindowRetriever docstrings ( #8138 )
...
* update docstrings
* Update haystack/components/retrievers/sentence_window_retriever.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* Update haystack/components/retrievers/sentence_window_retriever.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
---------
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
2024-08-01 18:05:31 +02:00
Agnieszka Marzec
ffbaed85de
update docstrings ( #8142 )
2024-08-01 16:27:30 +02:00
Agnieszka Marzec
bec822c361
Docs: Update FilterRetriever docstrings ( #8133 )
...
* update docstrings
* Update haystack/components/retrievers/filter_retriever.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
---------
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
2024-08-01 13:16:44 +02:00
Vladimir Blagojevic
25d3520f5a
feat: Add AnswerJoiner new component ( #8122 )
...
* Initial AnswerJoiner
* Initial tests
* Add release note
* Resove mypy warning
* Add custom join function
* Serialize custom join function
* Handle all Answer types, add integration test, improve pydoc
* Make fixes
* Add to API docs
* Add more tests
* Update haystack/components/joiners/answer_joiner.py
Co-authored-by: Amna Mubashar <amnahkhan.ak@gmail.com>
* Update docstrings and release notes
* update docstrings
---------
Co-authored-by: Sebastian Husch Lee <sjrl423@gmail.com>
Co-authored-by: Sebastian Husch Lee <sjrl@users.noreply.github.com>
Co-authored-by: Amna Mubashar <amnahkhan.ak@gmail.com>
Co-authored-by: Darja Fokina <daria.fokina@deepset.ai>
2024-08-01 12:51:17 +02:00
Stefano Fiorucci
3d1ad10385
fix html test ( #8127 )
2024-07-31 10:59:53 +02:00
Daria Fokina
bc153c233c
docs: clean up docstrings of TransformersSimilarityRanker ( #8124 )
...
* update transformerssimilarityranker docstrings
* Apply suggestions from code review
Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>
* upd device param
---------
Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>
2024-07-31 09:54:32 +02:00
Daria Fokina
ac51885fe8
docs: clean up docstrings of OpenAITextEmbedder ( #8120 )
...
* update docstrings
* update capitalization
Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>
---------
Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>
2024-07-31 09:53:25 +02:00
Daria Fokina
28141ec6b9
docs: clean up docstrings of OpenAIChatGenerator ( #8125 )
...
* openaichatgen-docstrings
* link update
2024-07-31 09:45:14 +02:00
Silvano Cerza
c7e29a83c1
fix: Fix infinite loop when running Pipeline ( #8123 )
...
* Fix infinite loop when running Pipeline
* Simplify if
2024-07-30 15:00:12 +02:00
Agnieszka Marzec
1d4883f178
update docstrings ( #8117 )
2024-07-30 11:10:36 +02:00
Agnieszka Marzec
42f59fc022
update docstrings ( #8115 )
2024-07-30 11:08:45 +02:00
Daria Fokina
21de1f87d4
docs: clean up docstrings of AnswerBuilder ( #8094 )
...
* answerbuilder docstrings
* update the `replies`
* Apply suggestions from code review
Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>
* Update answer_builder.py
---------
Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>
2024-07-30 11:06:39 +02:00
Agnieszka Marzec
e8598befb6
Docs: Update OpenAIGen docstrings and add missing headers ( #8105 )
...
* update docstrings
* Update haystack/components/generators/openai.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
---------
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
2024-07-30 11:06:17 +02:00
Daria Fokina
92e2377eff
docs: clean up docstrings of FileTypeRouter ( #8098 )
...
* upd filetyperouter docstrings
* Suggestions from code review
Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>
* aga's suggestions
---------
Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>
2024-07-30 08:39:08 +02:00
Agnieszka Marzec
8ce7bedf25
Docs: Update DocSplitter docstrings ( #8081 )
...
* update docstrings
* Update haystack/components/preprocessors/document_splitter.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* Update haystack/components/preprocessors/document_splitter.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* fix article
---------
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
2024-07-29 15:11:12 +02:00
Agnieszka Marzec
abb24c61c2
Docs: Update DocumentEmbedder docstrings ( #8112 )
...
* update docstrings
* Update haystack/components/embedders/sentence_transformers_document_embedder.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* Update haystack/components/embedders/sentence_transformers_document_embedder.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* Update haystack/components/embedders/sentence_transformers_document_embedder.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* fix casing
---------
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
2024-07-29 15:10:49 +02:00
Agnieszka Marzec
950c632009
Docs: Update DocumentCleaner docstrings ( #8106 )
...
* update docstrings
* Update haystack/components/preprocessors/document_cleaner.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* fix article
---------
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
2024-07-29 14:45:15 +02:00
Agnieszka Marzec
da81d10060
Docs: Update DocumentJoiner docstrings ( #8109 )
...
* update docstrings
* Update haystack/components/joiners/document_joiner.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* Update haystack/components/joiners/document_joiner.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* Update haystack/components/joiners/document_joiner.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* Update haystack/components/joiners/document_joiner.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* Update haystack/components/joiners/document_joiner.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* fix typo
* fix typo
---------
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
2024-07-29 14:39:44 +02:00
Corentin Meyer
1c53aae8f0
fix: Tika converter not yielding page break tags (\f) ( #8082 )
...
* Fix TikaConverter not having \f page tag by using HTML mode of parsing and then parsing the HTML to text using the old Haystack 1.X integration as template.
* Add Reno
* Fix test by making Mock Tika return XML (before parsing)
* refinements and test
---------
Co-authored-by: anakin87 <stefanofiorucci@gmail.com>
2024-07-26 20:13:47 +02:00
Amna Mubashar
e0de423ee0
Rename SentenceWindowRetrieval to SentenceWindowRetriever
2024-07-26 17:46:44 +02:00
Silvano Cerza
3fed1366c4
fix: Fix issue that could lead to RCE if using unsecure Jinja templates ( #8095 )
...
* Fix issue that could lead to RCE if using unsecure Jinja templates
* Add comment explaining exception suppression
* Update release note
* Update release note
2024-07-26 14:02:09 +00:00
Nicola Procopio
47f4db8698
added truncate_dim to sentence transformers embedder ( #8077 )
...
* added truncate_dim to sentence transformers embedder
* Update haystack/components/embedders/sentence_transformers_document_embedder.py
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
* Update releasenotes/notes/release-note-2b603a123cd36214.yaml
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
* fixed parameter description
* added test for truncation to text embedder
* fix format
---------
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
2024-07-26 10:39:48 +02:00
Madeesh Kannan
b2aef217da
chore: Remove deprecated DynamicPromptBuilder and DynamicChatPromptBuilder components ( #8085 )
2024-07-26 10:00:59 +02:00
Daria Fokina
f372ca443c
bm25 retriever docstrings ( #8087 )
2024-07-25 17:28:21 +02:00
Agnieszka Marzec
1f58ec20a8
Docs: Standardize and improve SentenceTransformersTextEmbedder docstrings ( #8060 )
...
* Update docstrings
* format
* add Daria's comments
* Update haystack/components/embedders/sentence_transformers_text_embedder.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
* Update haystack/components/embedders/sentence_transformers_text_embedder.py
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
---------
Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
2024-07-25 13:56:51 +02:00
Agnieszka Marzec
de728b4877
Docs: Simplify lg + standardize docstrings ( #8057 )
...
* Simplify lg + standardize
* Format
* Update formatting
* Fix formatting again
* Fix empty line
* Change formatting
* Format with black
---------
Co-authored-by: Vladimir Blagojevic <dovlex@gmail.com>
2024-07-25 13:24:42 +02:00
Agnieszka Marzec
855f8e61f3
Docs: Update InMemoryEmbeddingRetriever docstrings ( #8068 )
...
* update docstrings
* Update documents to lowercase
2024-07-25 13:24:00 +02:00
Madeesh Kannan
f9e4d5dc58
chore: Deprecate the debug parameter in Pipeline.run ( #8075 )
2024-07-25 09:58:57 +00:00
Tobias Wochinger
4dde6fbaec
build: unpin structlog ( #8071 )
2024-07-24 20:58:34 +02:00
Amna Mubashar
b374c528b2
Assign streaming_callback to OpenAIGenerator and OpenAIChatGenerator in run() method ( #8054 )
...
* Add optional parameter for streaming_callback in run() method
2024-07-24 15:49:19 +02:00
Sebastian Husch Lee
baed478f23
fix: Fix split_start_idx and _split_overlap information in DocumentSplitter ( #8046 )
...
* Fix bug in DocumentSplitter and expand tests to catch said bug
* Fix split overlap information calc and actually test it
* Add release notes
* Remove comments
* Same fix in SentenceWindowRetrieval
---------
Co-authored-by: Vladimir Blagojevic <dovlex@gmail.com>
2024-07-24 15:15:36 +02:00
Stefano Fiorucci
b36ec0a38c
fix release note ( #8070 )
2024-07-24 15:03:01 +02:00
Tobias Wochinger
38d38678c7
fix: fix PPTX import ( #8069 )
...
* fix: fix PPTX import
* docs: add release notes
2024-07-24 14:50:47 +02:00
Agnieszka Marzec
a022af02bc
Update docstrings ( #8066 )
2024-07-24 13:54:39 +02:00
Madeesh Kannan
4650263bc3
chore: Remove deprecated init paramters from HTMLToDocument ( #8056 )
...
* chore: Remove deprecated init paramters from `HTMLToDocument`
* Fix reno
2024-07-24 13:16:47 +02:00
David S. Batista
0c9dc008f0
fix: improve context relevancy metric ( #7964 )
...
* fixing tests
* fixing tests
* updating tests
* updating tests
* updating docstring
* adding release notes
* making the insufficient information more robust
* updating docstring and release notes
* empty list instead of informative string
* Update haystack/components/evaluators/context_relevance.py
Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
* Update haystack/components/evaluators/context_relevance.py
Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
* fixing tests
* Update haystack/components/evaluators/context_relevance.py
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
* reverting commit
* reverting again commit
* fixing docstrings
* removing deprecation warning
* removing warning import
---------
Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
2024-07-22 15:13:46 +02:00
Vladimir Blagojevic
a59de1d7b3
chore: Combined main unblock ( #8045 )
...
* Pin structlog to 24.2.0 due to unit test failures
* Remove object init parameter in huggingface_hub unit tests
* Use less restrictive structlog pin
* Add release note
2024-07-19 10:39:10 +02:00
Daria Fokina
913078dfaa
docs: add sentence window retrieval to api reference ( #8032 )
...
* docs: add sentence window retrieval to api reference
* deprecating multiplexer
2024-07-17 11:16:58 +02:00
Amna Mubashar
3fa6c253c3
fix: Prevent Pipeline.from_dict from modifying the dictionary parameter passed to it ( #8030 )
...
* Updated the pipeline deserialization
2024-07-17 10:28:29 +02:00
David S. Batista
431aa4a406
updating sentence window retriever tests ( #8034 )
...
* updating sentence window retriever tests
* fix
2024-07-16 22:10:55 +02:00
David S. Batista
3ed69c4aab
docs: adding example to docstring to SentenceWindowRetrieval ( #8031 )
...
* adding example to docstring
* small fix
* Update haystack/components/retrievers/sentence_window_retrieval.py
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
* Update haystack/components/retrievers/sentence_window_retrieval.py
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
* PR comments
* Update haystack/components/retrievers/sentence_window_retrieval.py
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
* PR comments
* PR comments
---------
Co-authored-by: Stefano Fiorucci <stefanofiorucci@gmail.com>
2024-07-16 16:22:26 +02:00
Amna Mubashar
499fbcc59f
Remove Multiplexer and related tests ( #8020 )
2024-07-16 15:39:40 +02:00
Silvano Cerza
0411cd938a
Fix bug in Pipeline.run() executing Components in a wrong and unexpected order ( #8021 )
...
* Fix bug in Pipeline.run() executing Components in a wrong and unexpected order
* Update haystack/core/pipeline/base.py
Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
---------
Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
2024-07-12 15:30:10 +00:00
Madeesh Kannan
94b806815c
refactor: Improve error messages shown during pipeline deserialization ( #8016 )
...
* refactor: Improve error messages shown during pipeline deserialization
* Add link to release notes
* Update release notes link
2024-07-12 14:47:00 +00:00
Anushree Bannadabhavi
1f05e633a9
refactor: refactor DocumentJoiner to follow enum pattern for join_mode parameter ( #8010 )
...
* refactor document joiner to follow enum pattern for join mode
* Added to_dict and from_dict
2024-07-12 11:29:44 +02:00
Silvano Cerza
0cec82e55e
refactor: Pipeline.run() ( #8019 )
...
* Move utility functions from _enqueue_next_runnable_component (#7895 )
* Isolate logic to check if we're stuck in a loop
* Simplify for else
* Add missing return in docstring
* Emit warning when stuck in a loop
* Fix docstring
Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
* Add utility function to move Components in queues
* Add function to find next Component to run
* Comment update
* Add missing break in loop
* Make _add_missing_input_defaults less error prone and add tests
* Fix tests
* Update docstring
* Simplify enqueue logic
* Remove unused _enqueue_next_runnable_component function
* Add method to find Component with lazy variadic input or all inputs with defaults
* Simplify _find_next_runnable_lazy_variadic_or_default_component
* Remove unnecessary type ignore
* Split _dequeue_components_that_received_no_input into separate functions
* Fix linting
* Simplify variadic check when running Component
* Simplify code
* Reorganize functions used by Pipeline.run
* Rename variables used in Pipeline.run() for clarity
* Add comment clarifying last_waiting_queue and before_last_waiting_queue
* Add functions to easily update waiting_queue
---------
Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
2024-07-12 08:35:23 +00:00
David S. Batista
d02356fe7a
chore: normalise the use of importlib in getting an object from a qualified name string across the codebase ( #8012 )
...
* initial import
* cleaning up
* removing unused imports
2024-07-11 16:14:00 +02:00