yujunjun
  • Joined on 2025-03-09
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Updated 2025-06-27 05:22:29 +00:00
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
Updated 2025-06-27 04:14:25 +00:00
Toolkit for linearizing PDFs for LLM datasets/training
Updated 2025-06-27 02:57:26 +00:00
The Metadata Platform for your Data and AI Stack
Updated 2025-06-26 22:58:15 +00:00
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Updated 2025-06-26 22:27:05 +00:00
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
Updated 2025-06-26 20:59:01 +00:00
Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.
Updated 2025-06-26 20:54:04 +00:00
Build Real-Time Knowledge Graphs for AI Agents
Updated 2025-06-26 19:45:13 +00:00
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Updated 2025-06-26 18:33:20 +00:00
🚀 Strapi is the leading open-source headless CMS. It’s 100% JavaScript/TypeScript, fully customizable, and developer-first.
Updated 2025-06-26 16:54:20 +00:00
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Updated 2025-06-26 14:35:17 +00:00
Awesome multilingual OCR and Document Parsing toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Updated 2025-06-26 12:36:33 +00:00
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Updated 2025-06-26 11:28:46 +00:00
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge bases. It can effectively overcome the shortcomings of the traditional RAG vector similarity calculation model.
Updated 2025-06-26 08:51:26 +00:00