m7mdhka's Stars
junkunyuan/HAP
[NeurIPS 2023] HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
GWxuan/ReID3D
layumi/Person_reID_baseline_pytorch
:bouncing_ball_person: Pytorch ReID: A tiny, friendly, strong pytorch implement of person re-id / vehicle re-id baseline. Tutorial 👉https://github.com/layumi/Person_reID_baseline_pytorch/tree/master/tutorial
facebookresearch/xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
pidahbus/deep-image-orientation-angle-detection
chengzhag/PanFusion
🍳 [CVPR'24 Highlight] Pytorch implementation of "Taming Stable Diffusion for Text to 360° Panorama Image Generation"
ShineChen1024/MagicClothing
Official implementation of Magic Clothing: Controllable Garment-Driven Image Synthesis
lyc0929/OOTDiffusion-train
levihsu/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
intel/intel-extension-for-transformers
⚡ Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platforms⚡
rotem-shalev/motion-diffusion-model
The official PyTorch implementation of the paper "Human Motion Diffusion Model"
stitionai/devika
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.
sign-language-processing/pose-to-video
Render pose sequences as photorealistic videos.
albumentations-team/albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Unstructured-IO/unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
PrAsAnNaRePo/TableOCR
A streamlit application to extract the table contents from the pdf page.
salim-benhamadi/TabularOCR
TabularOCR is a Python library that provides an easy-to-use Optical Character Recognition (OCR) solution for extracting tables from images and PDFs. It offers flexible output options, allowing you to export the extracted data in CSV, XLSX, or other spreadsheet formats.
ashishpatel26/LLM-Engineering-Crash-Course
LLM Engineering CrashCourse
nlmatics/llmsherpa
Developer APIs to Accelerate LLM Projects
BenSaunders27/ProgressiveTransformersSLP
Source code for "Progressive Transformers for End-to-End Sign Language Production" (ECCV 2020)
CMU-Perceptual-Computing-Lab/openpose
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
shoebham/text_to_isl
A Summer project that converts text to Indian Sign language through animation
rotem-shalev/Ham2Pose
Official implementation for "Ham2Pose: Animating Sign Language Notation into Pose Sequences" [CVPR 2023]
sign-language-processing/spoken-to-signed-translation
a text-to-gloss-to-pose-to-video pipeline for spoken to signed language translation
WongKinYiu/yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information
PaddlePaddle/Paddle-Lite-Demo
lib, demo, model, data
ultralytics/ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
clovaai/donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
deepdoctection/deepdoctection
A Repo For Document AI
VikParuchuri/surya
OCR, layout analysis, reading order, line detection in 90+ languages