GuanRainy

GuanRainy's Stars

mindee/doctr
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
Language:Python3.3k394
echo1118/Live_Detection
活体检测：眨眼检测、张嘴检测、摇头检测、点头检测
Language:Python206
codeniko/shape_predictor_81_face_landmarks
Custom shape predictor model trained to find 81 facial feature landmarks given any image
Language:Python456126
davisking/dlib-models
Trained model files for dlib example programs.
Language:C++1.4k367
abnercloud/Facial_106_Landmarks
Facial_106_Landmarks
Language:Python206
hpc203/yoloface-landmark106
纯YOLO系列的人脸检测+106个关键点检测
Language:Python2912
biubug6/Pytorch_Retinaface
Retinaface get 80.99% in widerface hard val using mobilenet0.25.
Language:Python2.5k750
jhb86253817/PIPNet
Efficient facial landmark detector
Language:Python39682
midasklr/facelandmarks
light-weight 98 points face landmark超轻98点人脸关键点检测模型
Language:Python5312
HumanSignal/awesome-data-labeling
A curated list of awesome data labeling tools
3.6k421
Evezerest/PPOCRLabel
PPOCRLabel is a semi-automatic graphic annotation tool suitable for OCR field, with built-in PP-OCR model to automatically detect and re-recognize data. It is written in Python 3 and PyQT5, supporting rectangular box annotation and four-point annotation modes. Annotations can be directly used for the training of PP-OCR detection and recognition models.
Language:Python17041
itmorn/robot-mouse-track
随着互联网技术的发展，鼠标轨迹识别算法在很多人机交互产品中的需求日益增加，比如，一些网站为了防止被爬，增加了一些滑块验证码，但是一些软件已经可以模拟人的行为破解滑块验证码。本项目就是通过对鼠标轨迹的特征分析，判定是否是人的行为还是机器行为。常见应用场景：网站反爬虫、在线考试系统脚本刷题。文档：https://robot-mouse-track.readthedocs.io
Language:Python4713
kwuking/TimeMixer
[ICLR 2024] Official implementation of "TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting"
Language:Python1k136
mitchellh/mapstructure
Go library for decoding generic map values into native Go structures and vice versa.
Language:Go7.8k668
google-research/timesfm
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
Language:Python3k223
fundamentalvision/Deformable-DETR
Deformable DETR: Deformable Transformers for End-to-End Object Detection.
Language:Python3k503
minio/minio
The Object Store for AI Data Infrastructure
Language:Go45.3k5.3k
google-ai-edge/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
Language:C++26.1k5k
DefTruth/torchlm
💎A high level pipeline for face landmarks detection, it supports training, evaluating, exporting, inference(Python/C++) and 100+ data augmentations, can easily install via pip.
Language:Python23323
reqable/reqable-app
Reqable issue track repo
2.5k85
leon-thomm/Ryven
Flow-based visual scripting for Python
Language:Python3.7k431
haochenheheda/segment-anything-annotator
We developed a python UI based on labelme and segment-anything for pixel-level annotation. It support multiple masks generation by SAM(box/point prompt), efficient polygon modification and category record. We will add more features (such as incorporating CLIP-based methods for category proposal and VOS methods for video datasets
Language:Python33052
LLaVA-VL/LLaVA-NeXT
Language:Python93551
yformer/EfficientSAM
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
Language:Jupyter Notebook2k142
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python14.8k1.4k
salesforce/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Jupyter Notebook9.1k903
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python17.8k1.9k
dvlab-research/LISA
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
Language:Python1.6k107
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Language:Python6.9k1.2k
OpenBMB/MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
Language:Python7.6k528