33 Commits

Author SHA1 Message Date
Massimiliano Pippi
10c675d534
chore: add license header to all modules (#7675)
* add license header to modules
* check license header at linting time
2024-05-09 13:40:36 +00:00
David S. Batista
1b10a8304f
proposal: rag evaluation results presentation (#7462)
* adding files

* adding proposal in md

* renaming proposal number

* removing stuff

* cleaning up

* adding PR number and issue

* updating proposal

* updating proposal

* Update proposals/text/7462-rag-evaluation.md

Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>

* changing name

* PR comments

* changing output to table format

* adding user stories

* Update proposals/text/7462-rag-evaluation.md

Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>

* adding user stories

---------

Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
2024-04-08 18:37:48 +02:00
Madeesh Kannan
5c8feeac6a
proposal: Integration of 3rd party evaluation frameworks (#6784)
* proposal: Integration of 3rd party evaluation frameworks

* Add note about previous eval proposal
2024-01-22 12:35:27 +01:00
Silvano Cerza
e6637f5ec2 Fix all tests 2023-11-24 14:48:43 +01:00
SebastjanPrachovskij
28c2b09d90
Add SearchApi integration for websearch (#6400) 2023-11-24 11:18:43 +01:00
Silvano Cerza
7287657f0e
refactor: Rename Document's text field to content (#6181)
* Rework Document serialisation

Make Document backward compatible

Fix InMemoryDocumentStore filters

Fix InMemoryDocumentStore.bm25_retrieval

Add release notes

Fix pylint failures

Enhance Document kwargs handling and docstrings

Rename Document's text field to content

Fix e2e tests

Fix SimilarityRanker tests

Fix typo in release notes

Rename Document's metadata field to meta (#6183)

* fix bugs

* make linters happy

* fix

* more fix

* match regex

---------

Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
2023-10-31 12:44:04 +01:00
Domenico
6bddc5c78a
fix: missing closing quotation marks (#6195)
The proposal was missing closing quotation marks so it was formatted badly
2023-10-30 10:34:55 +01:00
Domenico
4196102a56
proposal: meta field ranker (#6141)
* proposal: meta field ranker

* Apply suggestions from code review

Co-authored-by: ZanSara <sarazanzo94@gmail.com>

* update proposal filename

* feat: add metafield ranker

* Revert "feat: add metafield ranker"

This reverts commit be760d8b037a3e1a37539c8002edde9d322c874a.

---------

Co-authored-by: ZanSara <sarazanzo94@gmail.com>
2023-10-30 09:24:23 +01:00
Silvano Cerza
92ae169bdf
Proposal: Document Stores filter specification for Haystack 2.x (#6001)
* Filters rework proposal

* Update proposal with received feedback
2023-10-16 09:26:23 +02:00
Silvano Cerza
0cb9abb1c2
Rename proposal to respect specifications (#6002) 2023-10-09 11:24:19 +02:00
Silvano Cerza
35ec8cc8fb
Rework evaluation and metrics calculation for Haystack 2.x (#5794)
* draft requirements from discussion

* Add some more information

* Update proposal given new feedback

* More drawbacks

* Decision drivers

* Nitpick

* Summary

* PR number

* Mark code snippets

Co-authored-by: ZanSara <sara.zanzottera@deepset.ai>

* Link correct issue

* Add missing word

* More context on blind evaluation

* Rephrase confusing sentence

* Add a more detailed code example

* Ignore mypy and pylint in example file

---------

Co-authored-by: Julian Risch <julian.risch@deepset.ai>
Co-authored-by: ZanSara <sara.zanzottera@deepset.ai>
2023-09-28 00:51:51 +02:00
Christian Clauss
6dd52d91b2
ci: Fix typos discovered by codespell (#5778)
* Fix typos discovered by codespell

* pylint: max-args = 38
2023-09-13 16:14:45 +02:00
Christian Clauss
23f7308bec
ci: pre-commit autoupdate (#5777) 2023-09-12 14:34:41 +02:00
ZanSara
6e70d403f8
feat: Improve Document for Haystack 2.0 (#5738)
* initial draft

* tests

* add proposal

* proposal number

* reno

* fix tests and usage of content and content_type

* update branch & fix more tests

* mypy

* add docstring

* fix more tests

* review feedback

* improve __str__

* Apply suggestions from code review

Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>

* Update haystack/preview/dataclasses/document.py

Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>

* improve __str__

* fix tests

* fix more tests

* Update haystack/preview/document_stores/memory/document_store.py

---------

Co-authored-by: Daria Fokina <daria.fokina@deepset.ai>
2023-09-11 17:40:00 +02:00
Massimiliano Pippi
24b8cfb1c7
Update 3558-embedding_retriever.md (#5705) 2023-09-04 11:28:51 +02:00
ZanSara
4dda25d67c
proposal: LLM support in Haystack 2.0 (#5540)
* Add proposal

* add pr number

* file name

* clarify input of LLM component

* promptbuilder is tokenizer-aware

* typo

* feedback

* streaming

* Chat API
2023-08-28 10:33:07 +02:00
Stefano Fiorucci
52133d3a81
proposal: Embedders design (#5390)
* first draft

* rename

* refinements

* added clarifications

* improvements

* improvements

* improvements

* further improvements

* fix typo

* Apply suggestions from code review

Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>

* adapt to new Canals I/O

* fix links to previous proposals

* fix

* add migration example: update_embeddings

* rename EmbeddingService to EmbeddingBackend

---------

Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
2023-08-09 17:09:30 +02:00
elundaeva
e0cf1421c6
proposal: Add RecentnessRanker component (#5289)
proposal for adding Recentness Ranker to Haystack
2023-07-17 16:33:47 +02:00
ZanSara
651be37afc
proposal: DocumentStores and Retrievers (#4370)
* add proposal

* add proposal

* pr number

* pr number

* start second draft

* second draft

* node examples

* phrasing

* get_documents -> filter_documents
2023-03-28 16:31:42 +02:00
ZanSara
8ea7ba3a94
proposal: drop BaseComponent and re-implement Pipeline (#4284)
* draft proposal

* pr number

* reminder for an agent pipeline example

* proposal number

* add real query pipeline

* add paragraph on validation

* wording

* add_store

* decorator

* add rollout process and parameter's hierarchy examples

* rename project into application

* feedback from the meeting

* defer evaluation to another proposal

* smaller changes

* remove applications for now

* u-turn on pipeline.connect()

* typo

* connect_from/to

* update with Malte's feedback
2023-03-13 10:05:59 +01:00
tstadel
d096f03230
proposal: Shapers in Prompt Templates (#4172)
* add proposal

* Update 0000-shaper-in-prompt-template.md

* rename proposal file

* update proposal according to feedback

* add clarification about the number of prompts generated

* add section about parsing logic

* Revert "add section about parsing logic"

This reverts commit 904713558706206637eefe1579420d89663f58b8.

* add section about parsing logic

* fix typo

* improved the detailed design section

* fix code section

* chore formatting

* chore formatting

* updated adoption strategy

* final typo and expression changes
2023-03-07 09:52:18 +01:00
Vladimir Blagojevic
4b189c0b40
proposal: Implement Agent demo (#4085)
* Agent demo proposal

* Replace on-the-fly module with WebRetriever

* Update proposal with ideas from discussion with Julian

* Replace SerpAPI references with SearchEngine

* Add Agent memory

* Update Agent memory
2023-02-23 19:56:38 +01:00
Bijay Gurung
79f57d8460
Proposal: Add a JsonConverter node (#3959)
* Add Proposal: JsonConverter

* Add jsonl support + schema to JsonConverter Proposal

* Remove format option from JsonConverter Proposal

---------

Co-authored-by: ZanSara <sara.zanzottera@deepset.ai>
2023-02-09 09:57:00 +01:00
Julian Risch
d819d6badf
proposal: Add Agents for extended LLM support (#3925)
* draft proposal

* add link to colab notebook (api keys required)

* Add alternative name ideas for MRKLAgent

* Breakdown of agent steps

* Added more sections

* Add even more sections

* simplify tool/action mentions, shorten

* agents as new abstraction instead of BaseComponent

* agent tools can be pipelines or nodes

---------

Co-authored-by: Vladimir Blagojevic <dovlex@gmail.com>
2023-02-06 09:47:10 +01:00
Sebastian
96706e9e7b
proposal: TableCell (#3875)
* Initial commit for TableSpan proposal

* Updating the proposal

* More updates to the proposal

* More changes

* Rename of file per Proposal instructions

* Update link

* Adding drawbacks

* Fixing typos

* Changed TableSpan to TableCell and updated proposal based on discussions.

* Adding discussion on identified bug.

* Rename proposal to reflect name change made during discussion. Added point to make it clear that we will be able to return a List of TableCells

* Update proposal with discussion about storing table as a list of lists

* Adding some additional code change descriptions.
2023-02-01 09:08:12 +01:00
ZanSara
3ffdb0a9a3
chore: fix all EOF (#3852)
* fix all eof

* fix test

* fix test

* fix test

* typo

* fix sample

* fix sample

* add logs

* fix page_dynamic_result.txt
2023-01-16 12:34:50 +01:00
Benjamin BERNARD
15203d864b
docs: Proposal - CSV FAQ indexing feature (#3638)
* docs(proposal): Add new proposal about CSV FAQ indexing feature

* docs(proposal): Add new proposal about CSV FAQ indexing feature

Introduce PR number.

* Review feedback

* Mixed up the PR numbers

Co-authored-by: ZanSara <sara.zanzottera@deepset.ai>
2023-01-12 11:07:26 +01:00
Vladimir Blagojevic
ccda51fb43
proposal: Shaper pipeline component (#3784)
* Add InputOutputShaper proposal

* Add security section

* Rename to Shaper, small additions

* Rewording, rename contract_docs to concat
2023-01-11 18:50:12 +05:30
Massimiliano Pippi
2904587d4f
proposal: Create a dedicated Github repository for Haystack demos (#3695)
* first draft

* add PR number and motivations

* mention HSH

* review feedback

* Update 3695-demo-repository.md
2022-12-22 10:09:46 +01:00
Massimiliano Pippi
8edfd8978e
Update the proposals process (#3718)
* update the proposals process

* add stalebot to manage proposals lifecycle

* typo

* Update 0000-template.md

* clarify PR labelling staying away from implementation details
2022-12-19 14:35:07 +01:00
Vladimir Blagojevic
c69222faf4
Add PromptNode proposal (#3665) 2022-12-16 10:27:58 +01:00
Vladimir Blagojevic
c28f6688f5
proposal: New EmbeddingRetriever for Haystack 2.0 (#3558)
* Add EmbeddingRetriever proposal

* Update with Sara's feedback

* Consistent naming
2022-12-12 10:06:35 +01:00
Massimiliano Pippi
da6b0dc66f
feat: introduce proposal design process (#3333)
* add RFC process

* migrate old ADR to the new process

* typo

* review comments

* Apply suggestions from code review

Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>

* [skip ci] review feedback

* Apply suggestions from code review

Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>

* [skip ci] leftover

* rename to proposals

* Adjust naming

* Update 2170-pydantic-dataclasses.md

Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>
Co-authored-by: Malte Pietsch <malte.pietsch@deepset.ai>
2022-11-11 12:49:23 +01:00