Omar280x's Stars
tensorflow/tensorflow
An Open Source Machine Learning Framework for Everyone
roboflow/supervision
We write your reusable computer vision tools. 💜
infiniflow/ragflow
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
facebookresearch/nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
bitsandbytes-foundation/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
pymupdf/PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
DepthAnything/Depth-Anything-V2
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
deepdoctection/deepdoctection
A Repo For Document AI
huggingface/optimum
🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools
onnx/tensorflow-onnx
Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX
prs-eth/Marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
microsoft/table-transformer
Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS evaluation metric.
xuebinqin/DIS
This is the repo for our new project Highly Accurate Dichotomous Image Segmentation
BAAI-Agents/Cradle
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
X-LANCE/AniTalker
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
NVlabs/RADIO
Official repository for "AM-RADIO: Reduce All Domains Into One"
poloclub/unitable
UniTable: Towards a Unified Table Foundation Model
ymy-k/Hi-SAM
[arXiv preprint] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
z-mahmud22/Dlib_Windows_Python3.x
Dlib compiled binary (.whl) for Python 3.7-3.12 and Windows x64
gastruc/osv5m
VicenteVivan/geo-clip
This is an official PyTorch implementation of our NeurIPS 2023 paper "GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization"
fcakyon/ultralyticsplus
Huggingface utilities for Ultralytics/YOLOv8
scabini/RADAM
We propose a new method named Random encoding of Aggregated Deep Activation Maps (RADAM) for feature extraction from pre-trained Deep CNNs. The technique consists of encoding the output at different depths of the CNN using a Randomized Autoencoder, producing a single image descriptor
deep-diver/segformer-tf-transformers
This repository demonstrates how to use TensorFlow based SegFormer model in 🤗 transformers package.
ahmedramadan96/EALPR
A New Benchmark Dataset for Egyptian License Plate Detection and Recognition
SAP-samples/clustertabnet
Implementation of the table detection and table structure recognition deep learning model described in the paper "ClusterTabNet: Supervised clustering method for table detection and table structure recognition".