fabigr8's Stars
rob-med/awesome-TS-anomaly-detection
List of tools & datasets for anomaly detection on time-series data.
unclecode/crawl4ai
🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper
patched-codes/patchwork
Automate code reviews, patching and documentation with self-hosted LLM workflows.
leondz/garak
LLM vulnerability scanner
roboflow/supervision
We write your reusable computer vision tools. 💜
adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
AppFlowy-IO/AppFlowy-Cloud
AppFlowy is an open-source alternative to Notion. You are in charge of your data and customizations. Built with Flutter and Rust.
vanna-ai/vanna
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT/ Claude application.
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
nocodb/nocodb
🔥 🔥 🔥 Open Source Airtable Alternative
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
Lightning-AI/LitServe
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
deepset-ai/haystack
:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
tencent-ailab/persona-hub
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
encord-team/encord-active
The toolkit to test, validate, and evaluate your models and surface, curate, and prioritize the most valuable data for labeling.
RUC-NLPIR/FlashRAG
⚡FlashRAG: A Python Toolkit for Efficient RAG Research
ComposioHQ/composio
Composio equip's your AI agents & LLMs with 100+ high-quality integrations via function calling
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
adithya-s-k/marker-api
Easily deployable 🚀 API to convert PDF to markdown quickly with high accuracy.
zhaoxin94/awesome-domain-adaptation
A collection of AWESOME things about domian adaptation
refstudio/refstudio
Ref Studio is an open source integrated writing environment for technical writing
JefMari/awesome-wysiwyg-editors
A curated list of awesome WYSIWYG Editors.
pykeen/pykeen
🤖 A Python library for learning and evaluating knowledge graph embeddings
Doriandarko/maestro
A framework for Claude Opus to intelligently orchestrate subagents.
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
dataprofessor/langchain-ask-the-doc
Ask the Doc app built using Langchain and Streamlit.
argilla-io/distilabel
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
argilla-io/argilla
Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets