Wang-Haoxiao's Stars
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
state-spaces/mamba
Mamba SSM architecture
ShiArthur03/ShiArthur03
SWivid/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
FoundationVision/VAR
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
timothybrooks/instruct-pix2pix
isl-org/MiDaS
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
nerfies/nerfies.github.io
prs-eth/Marigold
[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
isl-org/ZoeDepth
Metric depth estimation from a single image
CS-BAOYAN/CSSummerCamp2024
2024年计算机保研夏令营&冬令营通知
atomiechen/THU-PPT-Theme
清华主题PPT模板
CS-BAOYAN/CS-BAOYAN-2024
2024年保研经验贴和相关物料
zju3dv/street_gaussians
[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting
CS-BAOYAN/CSYuTuiMian2024
2024年计算机保研预推免通知
cvg/NoPoSplat
No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images
duanyiqun/DiffusionDepth
PyTorch Implementation of introducing diffusion approach to 3D depth perception ECCV 2024
935963004/LaBraM
[ICLR 2024 spotlight] Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI
CS-BAOYAN/CS-BAOYAN-Wiki
facebookresearch/HRViT
HRViT ("Multi-Scale High-Resolution Vision Transformer for Semantic Segmentation"), CVPR 2022.
Jiyao06/GenPose
[NeurIPS 2023] GenPose: Generative Category-Level Object Pose Estimation via Diffusion Models
datawhalechina/faster-git
a chinese tutorial of git
XiongxiaoXu/SST
The official implementation of the paper: "SST: Multi-Scale Hybrid Mamba-Transformer Experts for Long-Short Range Time Series Forecasting"
LYX0501/InstructNav
EnVision-Research/DriveRecon
fangzhou2000/DrivingForward
[AAAI 2025] Offical implementation of "DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible Surround-view Input"
kaichen-z/DynPoint
[Neurips 2023] dynpoint: dynamic neural point for view synthesis
YangLing0818/SemanticSDS-3D
Semantic Score Distillation Sampling for Compositional Text-to-3D Generation
LJY-XCX/RFTrans