Pinned Repositories
3DNBF
Official code base for the ICCV 2023 paper "3D-Aware Neural Body Fitting for Occlusion Robust 3D Human Pose Estimation"
accelerated_features
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
ailia-models_bopw
The collection of pre-trained, state-of-the-art AI models for ailia SDK
AltFreezing
[CVPR 2023 Highlight] Official implementation of the paper: "AltFreezing for More General Video Face Forgery Detection"
crnn-ctc-loss-digit
OCR Handwriting Number
Detect_corner_label_product
Face-Anti-Spoofing-APK
Face-Anti-Spoofing-APK
Face_anti_spoofing_multimodels
Face Anti Spoofing
Tracking_Anything
Tracking any thing based on text prompt
web_demo_pill_retrieval
songuyenerza's Repositories
songuyenerza/3DNBF
Official code base for the ICCV 2023 paper "3D-Aware Neural Body Fitting for Occlusion Robust 3D Human Pose Estimation"
songuyenerza/accelerated_features
Implementation of XFeat (CVPR 2024). Do you need robust and fast local feature extraction? You are in the right place!
songuyenerza/crnn-ctc-loss-digit
OCR Handwriting Number
songuyenerza/awesome-SOTA-FER
A curated list of facial expression recognition in both 7-emotion classification and affect estimation.
songuyenerza/CDFSOD-benchmark
A benchmark for cross-domain few-shot object detection (ECCV24 paper: Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector)
songuyenerza/DiffSplat
[ICLR 2025] Official implementation of "DiffSplat: Repurposing Image Diffusion Models for Scalable 3D Gaussian Splat Generation".
songuyenerza/DocAligner
Predictions of the four corners of documents.
songuyenerza/EscherNet
[CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis
songuyenerza/Face-Analysis
Face-Analysis: Age, Race, Masked, Skintone, Emotion, Gender
songuyenerza/FAS_training
songuyenerza/flux
Official inference repo for FLUX.1 models
songuyenerza/GeoCalib
GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)
songuyenerza/GRM
Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
songuyenerza/handwriting-synthesis
Handwriting Synthesis with RNNs ✏️
songuyenerza/Image_retrieval_solar
songuyenerza/LLaVA-pp
🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)
songuyenerza/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
songuyenerza/LLMs-from-scratch
Implementing a ChatGPT-like LLM from scratch, step by step
songuyenerza/ml-direct2.5
songuyenerza/NLP_Research
songuyenerza/omniglue
Code release for CVPR'24 submission 'OmniGlue'
songuyenerza/One-DM
Official Code for ECCV 2024 paper — One-Shot Diffusion Mimicker for Handwritten Text Generation
songuyenerza/Paddleocr_dev
songuyenerza/Recommendations-Document-Image-Processing
This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.
songuyenerza/RemoteCLIP
🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing"
songuyenerza/RT-DETR
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
songuyenerza/Train_Face_Recognition
songuyenerza/tutorials_triton
This repository contains tutorials and examples for Triton Inference Server
songuyenerza/uni_fas
5th Chalearn Face Anti-spoofing Workshop and Challenge@CVPR2024
songuyenerza/VLM-R1
Solve Visual Understanding with Reinforced VLMs