Get your documents ready for gen AI
ai
convert
document-parser
document-parsing
documents
docx
html
markdown
pdf
pdf-converter
pdf-to-json
pdf-to-text
pptx
tables
xlsx
Updated 2025-06-26 17:52:43 +00:00
Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Updated 2025-06-26 12:36:33 +00:00
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
agent
agents
ai-search
chatbot
chatgpt
deep-learning
deepseek
deepseek-r1
document-parser
document-understanding
genai
graphrag
llm
nlp
ollama
pdf-to-text
rag
retrieval-augmented-generation
table-structure-recognition
text2sql
Updated 2025-06-26 11:28:46 +00:00
Python tool for converting files and office documents to Markdown.
Updated 2025-06-04 04:09:25 +00:00