Omar280x

Omar280x's Stars

tensorflow/tensorflow
An Open Source Machine Learning Framework for Everyone
Language:C++186k 7.6k 39.8k74.2k
roboflow/supervision
We write your reusable computer vision tools. 💜
Language:Python22.6k 156 4141.7k
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Language:Python18.3k 109 1.2k1.9k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python13.6k 115 1k1.2k
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
Language:Python9.9k 85 131646
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Language:Jupyter Notebook9.1k 136 4431.4k
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
Language:Python8.9k 56 5231.4k
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
Language:Python8.8k 63 213561
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
Language:Python6.1k 50 1k611
pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Language:Python5.2k 60 2k496
DepthAnything/Depth-Anything-V2
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Language:Python3.4k 29 154277
deepdoctection/deepdoctection
A Repo For Document AI
Language:Python2.5k 18 180135
huggingface/optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
Language:Python2.5k 55 739446
onnx/tensorflow-onnx
Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
Language:Jupyter Notebook2.3k 59 1k432
prs-eth/Marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Language:Python2.2k 41 95124
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
Language:Python2.2k 38 141247
xuebinqin/DIS
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
Language:Jupyter Notebook2.2k 91 124258
BAAI-Agents/Cradle
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
Language:Python1.7k 22 31153
X-LANCE/AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
Language:Jupyter Notebook1.4k 64 33128
NVlabs/RADIO
Official repository for "AM-RADIO: Reduce All Domains Into One"
Language:Python629 25 3423
poloclub/unitable
UniTable: Towards a Unified Table Foundation Model
Language:Jupyter Notebook349 9 2924
ymy-k/Hi-SAM
[arXiv preprint] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
Language:Python192 12 1810
z-mahmud22/Dlib_Windows_Python3.x
Dlib compiled binary (.whl) for Python 3.7-3.12 and Windows x64
146 4 330
gastruc/osv5m
Language:Python125 8 29
VicenteVivan/geo-clip
This is an official PyTorch implementation of our NeurIPS 2023 paper "GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization"
Language:Python123 2 1521
fcakyon/ultralyticsplus
Huggingface utilities for Ultralytics/YOLOv8
Language:Python77 2 011
scabini/RADAM
We propose a new method named Random encoding of Aggregated Deep Activation Maps (RADAM) for feature extraction from pre-trained Deep CNNs. The technique consists of encoding the output at different depths of the CNN using a Randomized Autoencoder, producing a single image descriptor
Language:Python32 4 11
deep-diver/segformer-tf-transformers
This repository demonstrates how to use TensorFlow based SegFormer model in 🤗 transformers package.
Language:Jupyter Notebook30 4 44
ahmedramadan96/EALPR
A New Benchmark Dataset for Egyptian License Plate Detection and Recognition
13 2 02
SAP-samples/clustertabnet
Implementation of the table detection and table structure recognition deep learning model described in the paper "ClusterTabNet: Supervised clustering method for table detection and table structure recognition".
Language:Jupyter Notebook7 6 31