Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
data-pipelines
deep-learning
document-image-analysis
document-image-processing
document-parser
document-parsing
docx
donut
information-retrieval
langchain
llm
machine-learning
ml
natural-language-processing
nlp
ocr
pdf
pdf-to-json
pdf-to-text
preprocessing
Updated 2025-06-26 22:27:05 +00:00
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
ai
bert
chatgpt
generative-ai
gpt-3
information-retrieval
language-model
large-language-models
llm
machine-learning
nlp
python
pytorch
question-answering
rag
retrieval-augmented-generation
semantic-search
squad
summarization
transformers
Updated 2025-06-26 14:35:17 +00:00