Pinned Repositories
AlphaPose
Real-Time and Accurate Full-Body Multi-Person Pose Estimation&Tracking System
CELNet
COMP5213_Project
CVProject_FaceRecognition
face recognition based on the eigenface paper.
DeepSpatialFusionCNN
Improving High Resolution Histology Image Classification with Deep Spatial Fusion Network
EV_GCN
Edge-variational Graph Convolutional Networks for Uncertainty-aware Disease Prediction
mindocr
A toolbox of OCR models, algorithms, and pipelines based on MindSpore
MLProject_CNN_CAE
MLProject_LR_FNN_SVM
comp5212 course project
MLProject_ProbMatrixFactorization
SamitHuang's Repositories
SamitHuang/mindcv
A toolbox of vision models and algorithms based on MindSpore
SamitHuang/mindocr
A toolbox of OCR models, algorithms, and pipelines based on MindSpore
SamitHuang/AnimateDiff
Official implementation of AnimateDiff.
SamitHuang/awesome-huge-models
A collection of AWESOME things about HUGE AI models.
SamitHuang/CogVideo
Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
SamitHuang/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image (uncensored)
SamitHuang/deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods.
SamitHuang/diff_wk_sd2
SamitHuang/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
SamitHuang/DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
SamitHuang/generative-models
Generative Models by Stability AI
SamitHuang/HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
SamitHuang/HunyuanVideo-I2V
HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo
SamitHuang/InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
SamitHuang/Janus
Janus-Series: Unified Multimodal Understanding and Generation Models
SamitHuang/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
SamitHuang/magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
SamitHuang/mindnlp
Easy-to-use and high-performance NLP and LLM framework based on MindSpore, compatible with models and datasets of 🤗Huggingface.
SamitHuang/mindocr-1
A toolbox of OCR models, algorithms, and pipelines based on MindSpore
SamitHuang/mindocr_test
SamitHuang/mindone
one for all, Optimal generator with No Exception
SamitHuang/mmocr
OpenMMLab Text Detection, Recognition and Understanding Toolbox
SamitHuang/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
SamitHuang/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
SamitHuang/open_clip
An open source implementation of CLIP.
SamitHuang/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
SamitHuang/stable-diffusion
A latent text-to-image diffusion model
SamitHuang/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
SamitHuang/video_recaption
SamitHuang/videocomposer
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability