wanglg007's Stars
ConnectAI-E/Feishu-Vector-Knowledge-Management
🎒 私有知识库机器人,基于向量数据库和openai实现。
WeiYe-Jing/datax-web
DataX集成可视化页面,选择数据源即可一键生成数据同步任务,支持RDBMS、Hive、HBase、ClickHouse、MongoDB等数据源,批量创建RDBMS数据同步任务,集成开源调度系统,支持分布式、增量同步数据、实时查看运行日志、监控执行器资源、KILL运行进程、数据源信息加密等。
CrawlScript/WebCollector
WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.
QuivrHQ/quivr
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: PGVector, Faiss. Any Files. Anyway you want.
yeungchenwa/OCR-SAM
Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with serval downstream tasks, e.g., Text Removal and Text Inpainting
open-mmlab/playground
A central hub for gathering and showcasing amazing projects that extend OpenMMLab with SAM and other exciting features.
Evezerest/PaddleOCR
Awesome OCR toolkits based on PaddlePaddle (基于飞桨的OCR工具库,包含总模型仅8.6M的超轻量级中文OCR,同时支持多种文本检测、文本识别的训练算法、服务部署和端侧部署)
Evezerest/PPOCRLabel
PPOCRLabel is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-recognize data. It is written in Python 3 and PyQT5, supporting rectangular box annotation and four-point annotation modes. Annotations can be directly used for the training of PP-OCR detection and recognition models.
chineseocr/trocr-chinese
transformers ocr for chinese
chineseocr/chineseocr
yolo3+ocr
mem0ai/mem0
The Memory layer for your AI apps
breezedeus/CnSTD
CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
WenmuZhou/PytorchOCR
基于Pytorch的OCR工具库,支持常用的文字检测和识别算法
hetianyi/digger
Digger is a powerful and flexible web crawler implemented by pure golang
wistbean/learn_python3_spider
python爬虫教程系列、从0到1学习python爬虫,包括浏览器抓包,手机APP抓包,如 fiddler、mitmproxy,各种爬虫涉及的模块的使用,如:requests、beautifulSoup、selenium、appium、scrapy等,以及IP代理,验证码识别,Mysql,MongoDB数据库的python使用,多线程多进程爬虫的使用,css 爬虫加密逆向破解,JS爬虫逆向,分布式爬虫,爬虫项目实战实例等
crawlab-team/crawlab
Distributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
taoshihan1991/go-fly
开源在线客服系统GO语言开发GO-FLY,免费在线客服系统/open source self-hosted private cloud customer support live chat software by golang
Sirwenhao/Deep-Learning-Notes
Image Processing学习,学习教程:https://github.com/WZMIAOMIAO/deep-learning-for-image-processing 视频对应:https://space.bilibili.com/18161609
hpcaitech/PaLM-colossalai
Scalable PaLM implementation of PyTorch
mistralai/FastChat-release
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
mistralai/mistral-inference
Official inference library for Mistral models
DA-southampton/TRM_tutorial
Transformer在CV和NLP领域的变体模型的从零解读:Transformer;VIT;Swin Transformer
WZMIAOMIAO/deep-learning-for-image-processing
deep learning for image processing including classification and object-detection etc.
Mintplex-Labs/anything-llm
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, and more.
levihsu/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
ncbi/GeneGPT
Code and data for GeneGPT.
mlrun/mlrun
MLRun is an open source MLOps platform for quickly building and managing continuous ML applications across their lifecycle. MLRun integrates into your development and CI/CD environment and automates the delivery of production data, ML pipelines, and online applications.
redoop/redoop-bigtop
Apache BigTop For Redoop