abdshomad's Stars
TheAlgorithms/Python
All Algorithms implemented in Python
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
Z4nzu/hackingtool
ALL IN ONE Hacking Tool For Hackers
nocodb/nocodb
🔥 🔥 🔥 Open Source Airtable Alternative
dokku/dokku
A docker-powered PaaS that helps you build and manage the lifecycle of applications
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
gitroomhq/postiz-app
📨 The ultimate social media scheduling tool, with a bunch of AI 🤖
anthropics/courses
Anthropic's educational courses
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
facebookresearch/deit
Official DeiT repository
Belval/TextRecognitionDataGenerator
A synthetic data generator for text recognition
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
midday-ai/v1
An open-source starter kit based on Midday.
microsoft/Phi-3CookBook
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SLMs) available, outperforming models of the same size and next size up across a variety of language, reasoning, coding, and math benchmarks.
Lightning-AI/LitServe
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
srush/MiniChain
A tiny library for coding with large language models.
cosmicoptima/loom
A Loom implementation in Obsidian
Charles-Xie/awesome-described-object-detection
A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.
csxmli2016/MARCONet
Learning Generative Structure Prior for Blind Text Image Super-resolution [CVPR 2023]
madisonmay/docai
Structured information extraction from documents
yfaqh/Awesome-Scene-Text-Image-Super-Resolution
A collection of papers and resources on scene text image super-resolution.
otriscon/llm-structured-output
zhangzjn/OCR-GAN
[TIP 2023] Omni-frequency Channel-selection Representations for Unsupervised Anomaly Detection
bhimrazy/chat-with-phi-3-vision
Chat with Phi 3.5/3 Vision LLMs. Phi-3.5-vision is a lightweight, state-of-the-art open multimodal model built upon datasets which include - synthetic data and filtered publicly available websites - with a focus on very high-quality, reasoning dense data both on text and vision.
CQU-EIE-Data-simulation-Lab/LLGS
LLGS: Illuminating Gaussian Splatting via absorptance Modulation
bhimrazy/chat-with-qwen2-vl
Chat with Qwen2-VL. Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
daily-demos/daily-bots-react-native-demo
chunchet-ng/paddleocr_lpr
A demo for License Plate Recognition using PaddleOCR.
papermerge/ocr-worker
OCR Worker