Wang-Haoxiao

Wang-Haoxiao's Stars

labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python58.6k 465 1356k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python40.6k 231 1.5k4.5k
state-spaces/mamba
Mamba SSM architecture
Language:Python14k 103 6131.2k
ShiArthur03/ShiArthur03
Language:MATLAB10.3k 32 1.4k1.9k
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Language:Python9.7k 94 5281.3k
FoundationVision/VAR
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Jupyter Notebook6.6k 100 129431
timothybrooks/instruct-pix2pix
Language:Python6.5k 69 130544
isl-org/MiDaS
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
Language:Python4.7k 74 245655
nerfies/nerfies.github.io
Language:JavaScript2.9k 38 51.1k
prs-eth/Marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
Language:Python2.5k 42 111149
isl-org/ZoeDepth
Metric depth estimation from a single image
Language:Jupyter Notebook2.5k 35 120224
CS-BAOYAN/CSSummerCamp2024
2024年计算机保研夏令营&冬令营通知
1.6k 97 2114
atomiechen/THU-PPT-Theme
清华主题PPT模板
1.2k 8 085
CS-BAOYAN/CS-BAOYAN-2024
2024年保研经验贴和相关物料
1k 39 099
zju3dv/street_gaussians
[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting
Language:Python988 70 7861
CS-BAOYAN/CSYuTuiMian2024
2024年计算机保研预推免通知
732 42 146
cvg/NoPoSplat
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
Language:Python639 19 6426
duanyiqun/DiffusionDepth
PyTorch Implementation of introducing diffusion approach to 3D depth perception ECCV 2024
Language:Python317 7 4519
935963004/LaBraM
[ICLR 2024 spotlight] Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI
Language:Python299 2 5053
CS-BAOYAN/CS-BAOYAN-Wiki
Language:MDX198 3 1172
facebookresearch/HRViT
HRViT ("Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation"), CVPR 2022.
Language:Python190 10 716
Jiyao06/GenPose
[NeurIPS 2023] GenPose: Generative Category-Level Object Pose Estimation via Diffusion Models
Language:Python162 10 355
datawhalechina/faster-git
a chinese tutorial of git
149 6 758
XiongxiaoXu/SST
The official implementation of the paper: "SST: Multi-Scale Hybrid Mamba-Transformer Experts for Long-Short Range Time Series Forecasting"
Language:Python146 4 39
LYX0501/InstructNav
Language:Python98 1 136
EnVision-Research/DriveRecon
Language:Python75 6 31
fangzhou2000/DrivingForward
[AAAI 2025] Offical implementation of "DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible Surround-view Input"
Language:Python71 9 13
kaichen-z/DynPoint
[Neurips 2023] dynpoint: dynamic neural point for view synthesis
Language:Python52 4 412
YangLing0818/SemanticSDS-3D
Semantic Score Distillation Sampling for Compositional Text-to-3D Generation
Language:Python40 3 20
LJY-XCX/RFTrans
Language:C6 1 10