Madeesh Kannan
|
fe60eedee9
|
fix: Fix deserialization of pipelines that contain LLMEvaluator subclasses (#7891)
|
2024-06-19 13:47:38 +02:00 |
|
David S. Batista
|
38747ff7a3
|
fix: failsafe for non-valid json and failed LLM calls (#7723)
* wip
* initial import
* adding tests
* adding params
* adding safeguards for nan in evaluators
* adding docstrings
* fixing tests
* removing unused imports
* adding tests to context and faithfullness evaluators
* fixing docstrings
* nit
* removing unused imports
* adding release notes
* attending PR comments
* fixing tests
* fixing tests
* adding types
* removing unused imports
* Update haystack/components/evaluators/context_relevance.py
Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
* Update haystack/components/evaluators/faithfulness.py
Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
* attending PR comments
---------
Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
|
2024-05-23 15:41:29 +00:00 |
|
David S. Batista
|
798dc4a4a5
|
fix: avoid FaithfulnessEvaluator and ContextRelevanceEvaluator return Nan (#7685)
* initial import
* fixing tests
* relaxing condition
* adding safeguard for ContextRelevanceEvaluator as well
* adding release notes
|
2024-05-14 17:08:51 +02:00 |
|
Massimiliano Pippi
|
10c675d534
|
chore: add license header to all modules (#7675)
* add license header to modules
* check license header at linting time
|
2024-05-09 13:40:36 +00:00 |
|
Julian Risch
|
2509eeea7e
|
refactor: Rename FaithfulnessEvaluator input responses to predicted_answers (#7621)
|
2024-04-30 16:30:57 +02:00 |
|
Julian Risch
|
07307709ee
|
test: Make FaithfulnessEvaluator integration test more robust (#7582)
|
2024-04-23 15:44:00 +00:00 |
|
Julian Risch
|
d7638cfd4b
|
refactor: FaithfulnessEvaluator specifies inputs explicitly (#7548)
* specify inputs explicitly. move out examples
* Update haystack/components/evaluators/faithfulness.py
Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
---------
Co-authored-by: Madeesh Kannan <shadeMe@users.noreply.github.com>
|
2024-04-22 12:52:10 +00:00 |
|
Julian Risch
|
9d02dc607a
|
feat: Add FaithfulnessEvaluator component (#7424)
* draft FaithfulnessEvaluator
* reno
* calculate score per statement and aggregate
* Update release note
* update default values in tests and fix import path
* remove instructions, inputs, outputs params
* remove unused imports
* add expected format example to docstring
* remove name 'llm' from tests and docstring
|
2024-04-04 16:33:59 +00:00 |
|