Pinned Repositories
DINOv
CVPR 2024
Efficient-AI-Backbones
Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
Live2Diff
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
LLaVA-UHD
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
LLaVA-v1_55
lvlm git gud
MobileResNet-50
a simplify and accuracy-maintain model of ResNet-50 by the Invert Residual Construction
MoE-LLaVA-but-Vision-Experts-as-well
Realtime_Multi-Person_Pose_Estimation
Code repo for realtime multi-person pose estimation, without using any person detector.
RSA_pycaffe
pycaffe version of RSA 'Recurrent Scale Approximation for Object Detection in CNN'
power0341's Repositories
power0341/RSA_pycaffe
pycaffe version of RSA 'Recurrent Scale Approximation for Object Detection in CNN'
power0341/Realtime_Multi-Person_Pose_Estimation
Code repo for realtime multi-person pose estimation, without using any person detector.
power0341/MobileResNet-50
a simplify and accuracy-maintain model of ResNet-50 by the Invert Residual Construction
power0341/DINOv
CVPR 2024
power0341/Efficient-AI-Backbones
Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
power0341/Live2Diff
Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.
power0341/LLaVA-UHD
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images
power0341/LLaVA-v1_55
lvlm git gud
power0341/MoE-LLaVA-but-Vision-Experts-as-well
power0341/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.