haystack

mirror of https://github.com/deepset-ai/haystack.git synced 2025-07-18 22:42:24 +00:00

Author	SHA1	Message	Date
Daniel Bichuetti	d715d0202d	fix: update ChromeDriver options on restricted environments and add ChromeDriver options as function parameter (#3043 ) * Fix when env does nto exist * Fix missed line * Set conservative chromedriver options * Set default options based on environment * Fix removed line * Updated documentation * Generate new schemas manually * Add arguments via iterator and helper function * Pre-push doc format * Use imported Option vs full namespace access * Manually update schema * Manually add documentation and schema * Fix language and documentation * Fix typo * Auto generated docs * Updated documentation	2022-08-22 12:59:33 +02:00
Daniel Bichuetti	d5e36ce6b4	fix(translator): write translated text to output documents, while keeping input untouched (#3077 ) * Set translated text on a copy of original document * Return new translated list * Manually generated docs TODO: check pre-commit * Hook generated file * Rename variables for better maintenance * fix(translator): prevent inputs from being changed * fix: manual update translator docs * style(translator): explicit type declaration on List * docs(translator): re-run pre-commit hook * style(translator): ignore mypy wrong type check * docs(translator): re-run pre-commit hook	2022-08-22 04:07:05 -04:00
Julian Risch	bc6f71b5ba	chore: increase version to next release candidate (#3067 ) * increase version to next release candidate * generate schema files	2022-08-19 14:49:50 +02:00
Julian Risch	eb0f0da0fd	Prepare 1.7.1 release (#3061 ) * prepare 1.7.1 release * Fix schemas * Update haystack/json-schemas/haystack-pipeline-1.7.1.schema.json Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> * change back main to master * remove newline at end of file * generate schema file with no newline Co-authored-by: ZanSara <sarazanzo94@gmail.com> Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai>	2022-08-19 13:24:40 +02:00
tstadel	1027ab3624	Bump Version to 1.7.1rc (#3041 ) * bump version to 1.7.1rc * update openapi	2022-08-18 10:31:57 +02:00
tstadel	baefd32b6f	Upgrade to v1.7.0 and copy docs folder (#3014 ) * update version to 1.7.0 * copy docs * update openapi * generate schemas * make update_json_schema() idempotent * update docs, schema and openapi	2022-08-15 14:20:30 +02:00
Julian Risch	d61755322f	chore: fix typo in API docs (#3023 ) * chore: fix typo in API docs * fix openapi Co-authored-by: Thomas Stadelmann <thomas.stadelmann@deepset.ai>	2022-08-15 13:25:20 +02:00
tstadel	0aa0c68785	Fix broken `MultiLabel` serialization (#3037 ) * Fix MultiLabel serialization * update docs * better comment * remove unused imports * remove unused imports (2)	2022-08-15 13:09:18 +02:00
Branden Chan	ff38a20863	docs: update File Classifier Docstring (#3018 ) * Update docstring * Trigger pre-commit hook * Trigger pre-commit hook * Incorporate reviewer feedback * Incorporate reviewer feedback	2022-08-15 12:37:28 +02:00
Branden Chan	7312f99584	Update Summarizer Docs (#3032 ) * Change text to content * Change text to content	2022-08-15 12:35:41 +02:00
bogdankostic	3a849d6c07	bug: Make `TranslationWrapperPipeline` work with `QuestionAnswerGenerationPipeline` (#3034 ) * Overwrite output_translator's run method with run_batch * Fix mypy * Revert change * Overwrite run method only with QuestionAnswerGenerationPipeline	2022-08-15 10:05:34 +02:00
Dmitry Goryunov	da7836a931	feat: Support embedding dimensions on DeepsetCloudDocumentStore (#2995 ) * Add embedding_dim to dc store * Remove similarity from query params, it is not used * Remove unused `return_embedding` parameter * Remove unused param * Update the documentation * Update schemas * Revert openapi changes * Revert openapi changes * Fix openapi * Fix json schema * Improve docstrings Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com> * Improve logs Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com> * Update the docs * Fix similarity Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>	2022-08-12 11:46:52 +02:00
James Briggs	26c938a8e6	test: add meta fields for meta_config to be used during testing (#3021 ) * added meta fields for meta_config to be used during realtime testing of PineconeDocumentStore * Add documentation on metadata filtering in docstring * docs Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai>	2022-08-12 10:27:56 +02:00
Sebastian	44e2b1beed	Resolving issue 2853: no answer logic in FARMReader (#2856 ) * Update FARMReader.eval_on_file to be consistent with FARMReader.eval * Update Documentation & Code Style Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-08-11 16:45:03 +02:00
Zoltan Fedor	408d8e6ff5	Enable the `JoinDocuments` node to work with documents with `score=None` (#2984 ) * Enable the `JoinDocuments` node to work with documents with `score=None` This fixes #2983 As of now, the `JoinDocuments` node will error out if any of the documents has `score=None` - which is possible, as some retriever are not able to provide a score, like the `TfidfRetriever` on Elasticsearch or the `BM25Retriever` on Weaviate. THe reason for the error is that the `JoinDocuments` always sorts the documents by score and cannot sort when `score=None`. There was a very similar issue for `JoinAnswers` too, which was addressed by this PR: https://github.com/deepset-ai/haystack/pull/2436 This solution applies the same solution to `JoinDocuments` - so both the `JoinAnswers` and `JoinDocuments` now will have the same additional argument to disable sorting when that is requried. The solution is to add an argument to `JoinDocuments` called `sort_by_score: bool`, which allows the user to turn off the sorting of documents by score, but keeps the current functionality of sorting being performed as the default. * Fixing test bug * Addressing PR review comments - Extending unit tests - Simplifying logic * Making the sorting work even with no scores By making the no score being sorted as -Inf * Forgot to commit the change in `join_docs.py` * [EMPTY] Re-trigger CI * Added am INFO log if the `JoinDocuments` is sorting while some of the docs have `score=None` * Adjusting the arguments of `any()` * [EMPTY] Re-trigger CI	2022-08-11 10:43:25 +02:00
Massimiliano Pippi	2cd65e99b8	revert Remove pipes (#3006 )	2022-08-11 10:42:22 +02:00
Zoltan Fedor	f4128d3581	Adding support for additional distance/similarity metrics for Weaviate (#3001 ) * Adding support for additional distance metrics for Weaviate Fixes #3000 * Updating the docs * Fixing error texts * Fixing issues raised by the review * Addressing the last issue from the reviews - removing test `test_weaviate.py::test_similarity` * [EMPTY] Re-trigger CI * Fixing things based on review * [EMPTY] Re-trigger CI	2022-08-11 09:48:21 +02:00
bogdankostic	5c3bfad078	feat: Add page number to Documents coming from PDFConverters and PreProcessor (#2932 ) * Add page number to Documents coming from PDFConverters and PreProcessor * Fix mypy * Update API Docs * Update API Docs * Remove unused imports * Generate JSON schema * Generate JSON schema * Make test variable shorter * Make regex a separate function * Move counting of page breaks to a function * Generate JSON schema * Apply suggestions from code review Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com> * Update API Documentation * Don't create instance for testing staticmethod * Update haystack/nodes/preprocessor/preprocessor.py Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com> Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>	2022-08-09 15:55:27 +02:00
Branden Chan	dfeb171686	Add API page for util functions (#2863 ) * Clean OpenAIAnswerGenerator docstrings * Incorporate reviewer feedback * Update Documentation & Code Style * Improve id_hash_keys description * Simplify id_hash_keys description * Update Documentation & Code Style Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-08-09 14:53:45 +02:00
Stefano Fiorucci	4a63484916	feat: Extend `TransformersQueryClassifier`: clean version (#2965 ) * extend query classifier in one commit * variable number of outgoing edges * improve tests * fix unused import * lightweight approach * fix _calculate_outgoing_edges * remove duplicate label validation * Remove print	2022-08-09 09:43:33 +02:00
MichelBartels	c91316e862	feat: add gradient accumulation in FARMReader (#2925 ) * expose gradient accumulation to train function of FARMReader * add documentation for gradient accumulation * Update Documentation & Code Style * doc string improvements Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com> * doc string improvements Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com> * doc string improvements Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com> * Update Documentation & Code Style Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Julian Risch <julian.risch@deepset.ai> Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>	2022-08-08 18:42:21 +02:00
Vladimir Blagojevic	d1f8b7118c	Add progress bar to batch run component ops (#2864 ) * Add progress bar to batch run component ops * Update docs * Update schema * PR review: thanks Bogdan	2022-08-08 09:32:44 -04:00
Sara Zan	1a0a4c8836	Remove pipes from code block (#2973 ) * Remove pipes * Generate md	2022-08-05 19:18:57 +02:00
Vladimir Blagojevic	4f8d11c591	Update Seq2SeqGenerator API documentation (#2970 ) * Seq2SeqGenerator - update API docs	2022-08-05 17:39:23 +02:00
Vladimir Blagojevic	762a12fcb1	Print eval reports improvements (#2941 )	2022-08-04 11:21:27 -04:00
Bilge Yücel	489699bd98	Fix docs code format for sentence transformers (#2957 ) Co-authored-by: bilge4 <bilge@techwolf.ai>	2022-08-04 12:31:42 +02:00
Vladimir Blagojevic	368828fd4a	Component batch_size should be defined rather than Optional (#2958 ) * Ensure batch_size for components is defined rather than Optional * PR review - update schema	2022-08-04 12:20:28 +02:00
Francesco Castelli	1b238c880b	Generalize <sep>, <pad> and </s> tokens of QuestionGenerator node (#2769 ) * fixed tokens in question generation * simplified assignment * same behavior also for pad and eos * use skip_special_tokens in batch_decode * fixed black error and update docs * fixed schemas ci error * JSON schemas * Add git diff to debug schema issues * opensearch schema was missing * Add missing instruction in the workflow error message * typo	2022-08-03 18:51:34 +02:00
Zoltan Fedor	1e20818328	Ability to run Ray Serve detached (#2945 ) * Ability to run Ray Serve detached Fixes #2944 Ability to run Ray Serve detached - to allow running multiple instances of the app (HA). See https://docs.ray.io/en/latest/serve/package-ref.html#core-apis * Generating the docs * Re-trigger the CI pipeline * Retrigger the CI Pipeline * Typo in docstrings * Fixing docstring and typing issues * Regenerating docs * [EMPTY] Re-trigger CI * [EMPTY] Re-trigger CI * Refactoring to allow any number of args for the `serve.start()` method There seems to be additional arguments of the `serve.start()` method, so we should probably cover all of them at once, instead of only the `detached` option. * [EMPTY] Re-trigger CI * Test whether the ServeControllerClient in fact has the supplied `detached` parameter	2022-08-03 18:49:03 +02:00
Zoltan Fedor	7b97bbbff0	Extending the Ray Serve integration to allow attributes for Serve deployments (#2918 ) * Extending the Ray Serve integration to allow attributes for Serve deployments This closes #2917 We should be able to set Ray Serve attributes for the nodes of pipelines, like amount of GPU to use, max_concurrent_queries, etc. Now this is possible from the pipeline yaml file for each node of the pipeline. * Ran black and regenerated the json schemas * Fixing the JSON Schema generation * Trying to fix the schema CI test issue * Fixing the test and the schemas Python 3.8 was generating a different schema than Python 3.7 is creating in the CI. You MUST use Python 3.7 to generate the schemas, otherwise the CIs will fail. * Merge the two Ray pipeline test cases * Generate the JSON schemas again after `$ pip install .[all]` * Removing `haystack/json-schemas/haystack-pipeline-1.16.schema.json` This was generated by the JSON generator, but based on @ZanSara's instructions, I am removing it. * Making changes based on @ZanSara's request - the newly requested test is failing * Fixing the JSON schema generation again * Renaming `replicas` and moving it under `serve_deployment_kwargs` * add extras validation, untested * Dcoumentation update * Black * [EMPTY] Re-trigger CI Co-authored-by: Sara Zan <sarazanzo94@gmail.com>	2022-08-03 16:38:22 +02:00
tstadel	2c56305ed3	Fix serialization of numpy arrays and pandas dataframes in REST API (#2838 ) * correct serialization of numpy arrays and pandas dataframes * Update Documentation & Code Style * set additional json_encoders globally * Update Documentation & Code Style * add tests for non primitive return types Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-08-02 09:49:32 +02:00
Massimiliano Pippi	e7627c3f8b	Use opensearch-py in OpenSearchDocumentStore (#2691 ) * add Opensearch extras * let OpenSearchDocumentStore use opensearch-py * Update Documentation & Code Style * fix a bug found after adding tests Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai>	2022-07-28 10:04:49 +02:00
Zoltan Fedor	adb2b2c312	Add support for BM25 with the Weaviate document store (#2860 ) * Upgrading Weaviate used for testing to 1.14.1 from 1.11.0 This has also brought up an issue with one of the test filtering for value "a". This test has started to fail, as "a" is a default stopword in Weaviate, so I have changed this test to look for value "c" instead of value "a" to get around the stopword issue. * Weaviate client upgrade From v3.3.3 to v3.6.0 * Adding BM25 Retrieval to Weaviate Weaviate now supports BM25 retrieval in experiment mode and with some limitations (like it cannot be combined with filters). This commit adds support for inverted index (BM25) querying against Weaviate. * Running Black on the recent code changes * Update Documentation & Code Style * Fixing linting issues after code changes by black * The BM25 query needs to be in all lowercase for now The BM25 query needs to be provided all lowercase while the functionality is in experimental mode in Weaviate. See https://app.slack.com/client/T0181DYT9KN/C017EG2SL3H/thread/C017EG2SL3H-1658790227.208119 * Fixing method parameter docstring to highlight that they are not supported in Weaviate * Update Documentation & Code Style Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-07-27 10:07:13 +02:00
Sara Zan	4e45062a00	Simplify `language_modeling.py` and `tokenization.py` (#2703 ) * Simplification of language_model.py and tokenization.py to remove code duplication Co-authored-by: vblagoje <dovlex@gmail.com>	2022-07-22 16:29:30 +02:00
tstadel	11c46006df	Fix corrupted csv from `EvaluationResult.save()` (#2854 ) * fix corrupted csv if text contains \r chars; make csv serialization configurable * Update Documentation & Code Style * incorporate feedback * Update Documentation & Code Style * adjust columns to be converted during loading Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-07-21 16:31:07 +02:00
Daniel Bichuetti	3948b997b2	Add support for custom trained PunktTokenizer in PreProcessor (#2783 ) * Add support for model folder into BasePreProcessor * First draft of custom model on PreProcessor * Update Documentation & Code Style * Update tests to support custom models * Update Documentation & Code Style * Test for wrong models in custom folder * Default to ISO names on custom model folder Use long names only when needed * Update Documentation & Code Style * Refactoring language names usage * Update fallback logic * Check unpickling error * Updated tests using parametrize Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> * Refactored common logic * Add format control to NLTK load * Tests improvements Add a sample for specialized model * Update Documentation & Code Style * Minor log text update * Log model format exception details * Change pickle protocol version to 4 for 3.7 compat * Removed unnecessary model folder parameter Changed logic comparisons Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> * Update Documentation & Code Style * Removed unused import * Change errors with warnings * Change to absolute path * Rename sentence tokenizer method Co-authored-by: tstadel * Check document content is a string before process * Change to log errors and not warnings * Update Documentation & Code Style * Improve split sentences method Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> * Update Documentation & Code Style * Empty commit - trigger workflow * Remove superfluous parameters Co-authored-by: tstadel * Explicit None checking Co-authored-by: tstadel Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai>	2022-07-21 09:50:45 +02:00
Julian Risch	f599ce9458	Change "text" to "content" as dict key (#2800 ) * change "text" to "content" as dict key * Update Documentation & Code Style Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-07-13 16:36:06 +02:00
Daniel Augustus Bichuetti Silva	77a513fe49	Fix crawler long file names (#2723 ) * Changing the name that crawled page is saved to avoid long file names error on some file systems * Custom naming function for saving crawled files * Update Documentation & Code Style * Remove bad characters on file name and preffix * Add test for naming function * Update Documentation & Code Style * Fix expensive regex recalculation and linter warns * Check for exceptions on file dump * Remove param_naming variable * Fix file paths on Windows, Linux and Mac * Update Documentation & Code Style * Test using one of the docstrings examples * Change default naming function Update docstrings * Applying formatting rules * Update Documentation & Code Style * Fix mypy incompatible assignment error * Remove unused type declaration * Fix typo * Update tests for naming function * Update Documentation & Code Style Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-07-11 12:16:32 +02:00
bogdankostic	195aed942f	Add `update_document_meta` to `InMemoryDocumentStore` (#2689 ) * Add update_document_meta to InMemoryDocumentStore * Fix typo * Update Documentation & Code Style * Add update_document_meta to BaseDocumentStore * Update Documentation & Code Style * Fix mypy * Update Documentation & Code Style * Add update_document_meta to MockDocumentStore Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-07-07 15:44:07 +02:00
Vladimir Blagojevic	a2905d05f7	Bump version to next release candidate (#2765 ) * Bump version to next release candidate * Update Documentation & Code Style Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-07-06 11:26:42 +02:00
Vladimir Blagojevic	c80336c424	Upgrade to v1.6.0 and copy docs folder (#2764 ) * Upgrade to v1.6.0 and copy docs folder * Update Documentation & Code Style Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-07-06 10:25:15 +02:00
Patrick Deutschmann	1db3fd0942	Add support for Multi-Hop Dense Retrieval (#2571 ) * Implement MDR * Adapt conftest to new MDR signature * Update Documentation & Code Style * Change signature of queries param in batch methods of MDR like in #2575 * Update Documentation & Code Style * Rename MultihopDenseRetriever to MultihopEmbeddingRetriever * Fix filters in retrieve_batch * Add docstring for MultihopEmbeddingRetriever.__init__ * Update Documentation & Code Style * Revert forward signature of TextSimilarityHead Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-07-05 11:31:11 +02:00
Tuana Celik	2a8b129bae	first version of save_to_remote for HF from FarmReader (#2618 ) * first version of save_to_remote for HF from FarmReader * Update Documentation & Code Style * Changes based on comments * Update Documentation & Code Style * imports order * making small changes to pydoc * indent fix * Update Documentation & Code Style * keyword arguments instead of positional * Changing to repo_id huggingface-hub package would have to be v0.5 or higher - checking how to handle with Thomas * Update Documentation & Code Style * adding huggingface-hub dependency 0.5 or above Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Sara Zan <sarazanzo94@gmail.com>	2022-07-04 15:39:56 +02:00
Julian Risch	1c1faa4742	Make check of document & embedding count optional in FAISS and Pinecone (#2677 ) * make validation optional & add method call in pinecone init * Update Documentation & Code Style Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-07-04 10:12:31 +02:00
Daniel Augustus Bichuetti Silva	e3b2ee956a	Improved crawler support for dynamically loaded pages (#2710 ) * Improved crawler support for dynamically loaded pages * Reduced scope of StaleElementReferenceException and removed deprecated code from WebDriver initialization * Improvements on crawler testing code * Code format and style applied on f028331948c170448613e86dfdfa222f7c2043fd * Update Documentation & Code Style * Remove unused imports/parameters Co-authored-by: Sara Zan <sara.zanzottera@deepset.ai> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-07-01 10:47:33 +02:00
mathislucka	8d65bc5f9b	Update document scores based on ranker node (#2048 ) * ranker should return scores for later usage * fix wrong tuple order * adjust ranker scores; add tests * Update Documentation & Code Style * fix mypy * Update Documentation & Code Style * fix mypy * Update Documentation & Code Style * relax ranker test tolerance * update ranker test score Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Julian Risch <julian.risch@deepset.ai>	2022-06-27 12:17:18 +02:00
tstadel	1168f6365d	Fix using id_hash_keys as pipeline params (#2717 ) * Fix using id_hash_keys as pipeline params * Update Documentation & Code Style * add tests Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-06-24 09:55:09 +02:00
Massimiliano Pippi	79b287b568	Extract common code for ES and OS into a base class (#2664 ) * extract common code for ES and OS into a base class * Update Documentation & Code Style * give the base class a more obvious name * Update Documentation & Code Style Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-06-20 09:47:44 +02:00
MichelBartels	964e6cdafb	Fix JoinAnswer/JoinNode (#2612 ) * fix join nodes * Update Documentation & Code Style * fix unused import * change arg order * Update Documentation & Code Style * fix kwargs check * add warning when there is only one input node * Update Documentation & Code Style * fix type hint * fix wrong import order * Update Documentation & Code Style * undo kwargs * add accidentally deleted newline# * fix type hint * fix type hint Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>	2022-06-17 16:29:15 +02:00
Sara Zan	584e046642	`AnswerToSpeech` (#2584 ) * Add new audio answer primitives * Add AnswerToSpeech * Add dependency group * Update Documentation & Code Style * Extract TextToSpeech in a helper class, create DocumentToSpeech and primitives * Add tests * Update Documentation & Code Style * Add ability to compress audio and more tests * Add audio group to test, all and all-gpu * fix pylint * Update Documentation & Code Style * Accidental git tag * Try pleasing mypy * Update Documentation & Code Style * fix pylint * Add warning for missing OS library and support in CI * Try fixing mypy * Update Documentation & Code Style * Add docs, simplify args for audio nodes and add tutorials * Fix mypy * Fix run_batch * Feedback on tutorials * fix mypy and pylint * Fix mypy again * Fix mypy yet again * Fix the ci * Fix dicts merge and install ffmpeg on CI * Make the audio nodes import safe * Trying to increase tolerance in audio test * Fix import paths * fix linter * Update Documentation & Code Style * Add audio libs in unit tests * Update _text_to_speech.py * Update answer_to_speech.py * Use dedicated dataset & update telemetry * Remove and use distilled roberta * Revert special primitives so that the nodes run in indexing * Improve tutorials and fix smaller bugs * Update Documentation & Code Style * Fix serialization issue * Update Documentation & Code Style * Improve tutorial * Update Documentation & Code Style * Update _text_to_speech.py * Minor lg updates * Minor lg updates to tutorial * Making indexing work in tutorials * Update Documentation & Code Style * Improve docstrings * Try to use GPU when available * Update Documentation & Code Style * Fixi mypy and pylint * Try to pass the device correctly * Update Documentation & Code Style * Use type of device * use .cpu() * Improve .ipynb * update apt index to be able to download libsndfile1 * Fix SpeechDocument.from_dict() * Change pip URL Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: Agnieszka Marzec <97166305+agnieszka-m@users.noreply.github.com>	2022-06-15 10:13:18 +02:00

1 2 3 4 5 ...

282 Commits