haystack/test/test_requirements.txt
Madeesh Kannan e6d6ce1c73
feat: Add NamedEntityExtractorcomponent (#6689)
* feat: Add `NamedEntityExtractor`component

This component accepts a list of `Document`s which it annotates with named entities. The annotations are stored in the `meta` dictionary of each `Document` under a specific key.

The component currently support two backends for the annotation models: Hugging Face `transformers` and spaCy.

* Address comments

* Expand release note

* Add the `[torch]` extra package specifier to the lazy import

* Remove dead code

---------

Co-authored-by: Massimiliano Pippi <mpippi@gmail.com>
2024-01-09 17:56:20 +01:00

24 lines
1.2 KiB
Plaintext

.[dev]
# Package Components
transformers[torch,sentencepiece]==4.36.2 # ExtractiveReader, TransformersSimilarityRanker, LocalWhisperTranscriber, HFGenerators...
spacy>=3.7,<3.8 # NamedEntityExtractor
spacy-curated-transformers>=0.2,<=0.3 # NamedEntityExtractor
https://github.com/explosion/spacy-models/releases/download/en_core_web_trf-3.7.3/en_core_web_trf-3.7.3-py3-none-any.whl # NamedEntityExtractor
# Converters
pypdf # PyPDFConverter
markdown-it-py # MarkdownToDocument
mdit_plain # MarkdownToDocument
tika # TikaDocumentConverter
azure-ai-formrecognizer>=3.2.0b2 # AzureOCRDocumentConverter
langdetect # TextLanguageRouter and DocumentLanguageClassifier
sentence-transformers>=2.2.0 # SentenceTransformersTextEmbedder and SentenceTransformersDocumentEmbedder
openai-whisper>=20231106 # LocalWhisperTranscriber
# OpenAPI
jsonref # OpenAPIServiceConnector, OpenAPIServiceToFunctions
openapi3