linan-github's Stars
adithya-s-k/omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
microsoft/graphrag
A modular graph-based Retrieval-Augmented Generation (RAG) system
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Layout-Parser/layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
BobLd/DocumentLayoutAnalysis
Document Layout Analysis resources repos for development with PdfPig.
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
ruecat/ollama-telegram
🦙 Ollama Telegram bot, with advanced configuration
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Colin-b/pyxelrest
Query REST APIs using Microsoft Excel User Defined Functions, VBA or Python functions
VBA-tools/VBA-Web
VBA-Web: Connect VBA, Excel, Access, and Office for Windows and Mac to web services and the web
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
Filimoa/open-parse
Improved file parsing for LLM’s
paul-gauthier/aider
aider is AI pair programming in your terminal
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
deepset-ai/haystack
:mag: LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
pawl/awesome-etl
A curated list of awesome ETL frameworks, libraries, and software.
python-bonobo/bonobo
Extract Transform Load for Python 3.5+
PiotrDabkowski/Js2Py
JavaScript to Python Translator & JavaScript interpreter written in 100% pure Python🚀 Try it online:
clemfromspace/scrapy-selenium
Scrapy middleware to handle javascript pages using selenium
ultrafunkamsterdam/undetected-chromedriver
Custom Selenium Chromedriver | Zero-Config | Passes ALL bot mitigation systems (like Distil / Imperva/ Datadadome / CloudFlare IUAM)
lorenzodifuccia/safaribooks
Download and generate EPUB of your favorite books from O'Reilly Learning (aka Safari Books Online) library.
ga-group/bsym
Bloomberg open symbology datasets
pynag/pynag
Python modules and utilities for Nagios plugins and configuration
awslabs/deequ
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
facebook/prophet
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.