ake020675
A computer vision engineer and researcher with 13+ year's experiences. Have 3 academic papers published in areas of eye tracking and visual saliency model.
American
ake020675's Stars
WZMIAOMIAO/deep-learning-for-image-processing
deep learning for image processing including classification and object-detection etc.
blakeblackshear/frigate
NVR with realtime local object detection for IP cameras
Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Const-me/Whisper
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
roboflow/notebooks
This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM 2, Florence-2, PaliGemma 2, and Qwen2.5VL.
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
shouxieai/tensorRT_Pro
C++ library based on tensorrt integration
frgfm/torch-cam
Class activation maps for your PyTorch models (CAM, Grad-CAM, Grad-CAM++, Smooth Grad-CAM++, Score-CAM, SS-CAM, IS-CAM, XGrad-CAM, Layer-CAM)
microsoft/SoM
Set-of-Mark Prompting for GPT-4V and LMMs
ayushidalmia/awesome-fashion-ai
A repository to curate and summarise research papers related to fashion and e-commerce
IDEA-Research/DINO-X-API
DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding
kekmodel/FixMatch-pytorch
Unofficial PyTorch implementation of "FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence"
fkryan/gazelle
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025)
HuKai97/YOLOv5-LPRNet-Licence-Recognition
使用YOLOv5和LPRNet进行车牌检测+识别(CCPD数据集)
Liuyuxinict/prenet
tg-bomze/Style-Transfer-Collection
Colabs Collection of style transfer in photo and video
ChiShengChen/ResVMamba
The official repository implement of Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning.
1694439208/GOT-OCR-Inference
研究GOT-OCR-项目落地加速,不限语言
loopvoid/mls
moving least-squares for surface fitting
jianzhang96/fdsnet
[ICASSP 2022] FDSNet: An Accurate Real-Time Surface Defect Segmentation Network
jingdao/point_cloud_scene_completion
Point Cloud Scene Completion of Obstructed Building Facades with Generative Adversarial Inpainting
DHW-Master/NEU_Seg
muyouhang/3DFR
A pipeline for 3D face recognition(FR), including data preprocessing, feature extraction and face recognition. Suit for consumer RGB-D cameras, for example, Kinect V2.
KG-TSI-Civil/CrackSAM
Fine-tuning Segment Anything for crack segmentation
jswati31/stage
PyTorch implementation of STAGE model
AlenUbuntu/Awesome-Fashion-AI
A curated list of research papers, datasets, open-source codes, conferences, workshops related to AI for fashion and e-commerce.
nsssayom/OpenGaze
Web service for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation based on OpenFace 2.0
simonryu328/Golf-Swing-Extractor
Golf swing detection/extraction by computer vision and machine learning techniques. Using Roboflow's object detection model and RNNs in PyTorch
husky-helen/ObyGaze12
saadmdsabah/Skin-Cancer-Detection
Developed an AI-powered Skin Cancer detection system utilizing Convolutional Neural Networks in Python, enhancing early diagnosis and treatment through machine learning techniques