Pinned Repositories
2019-CCF-BDCI-OCR-MCZJ-OCR-IdentificationIDElement
2019CCF-BDCI大赛 最佳创新探索奖获得者 基于OCR身份证要素提取赛题冠军 天晨破晓团队 赛题源码
2D_detection
TensorFlow implementation of SqueezeDet, trained on the KITTI dataset.
CodeFun
📚 这绝对是一份好看的深度学习笔记了😏
Dense-Head-Pose-Estimation
[ECCV 2020] Reimplementation of "Towards Fast, Accurate and Stable 3D Dense Face Alignment", face mesh, head pose, landmarks, and more.
drone_landing
SJTU Innovation Competition of AR.Drone
GrabCut_CUDA
GrabCut based video segmentation boosted with CUDA
GroupNorm-reproduce
A collection of code in different frameworks that reproduces experiments in "Group Normalization"
nnvm
Bring deep learning to bare metal
SCS-PRJ
基于caffe的视频监控系统
velo2cam_calibration
Automatic Calibration algorithm for Lidar-Stereo camera. ROS Package.
hajungong007's Repositories
hajungong007/autodistill
Images to inference with no labeling (use foundation models to train supervised models).
hajungong007/ControlNet
Let us control diffusion models
hajungong007/DigiHuman
Automatic 3D Character animation using Pose Estimation and Landmark Generation techniques
hajungong007/distill-sd
Segmind Distilled diffusion
hajungong007/DualStyleGAN
[CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
hajungong007/efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
hajungong007/flash-diffusion
Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
hajungong007/flowty-realtime-lcm-canvas
A realtime sketch to image demo using LCM and the gradio library.
hajungong007/GFPGAN-1024
GFPGAN 1024
hajungong007/gpupixel
Cross-Platform AI Beauty Effects Library, Achieving Commercial-Grade Beauty Effects. Written in C++11, Based on OpenGL/ES and VNN.
hajungong007/Hi-SAM
[arXiv preprint] Hi-SAM: Marrying Segment Anything Model for Hierarchical Text Segmentation
hajungong007/humor
Code for ICCV 2021 paper "HuMoR: 3D Human Motion Model for Robust Pose Estimation"
hajungong007/img2img-turbo
One-Step Image-to-Image with SD-Turbo
hajungong007/mediapipe
Cross-platform, customizable ML solutions for live and streaming media.
hajungong007/Medical-Image-Segmentation
MedSeg: Medical Image Segmentation GUI Toolbox 可视化医学图像分割工具箱
hajungong007/MedicalGPT-zh
MedicalGPT-zh:一个基于ChatGLM的在高质量指令数据集微调的中文医疗对话语言模型
hajungong007/MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
hajungong007/NeRF-Factory
An awesome PyTorch NeRF library
hajungong007/OmniQuant
OmniQuant is a simple and powerful quantization technique for LLMs.
hajungong007/oms-Diffusion
hajungong007/OpenGlass
Turn any glasses into AI-powered smart glasses
hajungong007/sd-scripts
hajungong007/smirk
Official Pytorch Implementation of SMIRK: 3D Facial Expressions through Analysis-by-Neural-Synthesis (CVPR 2024)
hajungong007/SonarSAM
Segment Anything Model, SAM, Sonar images
hajungong007/Stable-Diffusion-Inpaint
Stable diffusion for inpainting
hajungong007/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
hajungong007/VToonify
[SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer
hajungong007/Windrecorder
MacOS App Rewind's alternative on Windows platform, your personal memorize search engine. It can continuously recording your screen locally in small file size, and OCR the content so you can backtrack and query memory any time.
hajungong007/wunjo.wladradchenko.ru
Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.
hajungong007/xrslam
OpenXRLab Visual-inertial SLAM Toolbox and Benchmark