Pinned Repositories
3D_Attention
Inhibition-aware (-regularized) 3D attention for robust visual recognition
CelebBasis
Official Implementation of 'Inserting Anybody in Diffusion Models via Celeb Basis'
CodingEveryday
还是要多练
DesktopDuplication-MSDN-
A desktop duplication sample from MSDN.
FaceSwappingAllInOne
Put all face swapping method into a single repo
MSML
MSML: Enhancing Occlusion-Robustness by Multi-Scale Segmentation-Based Mask Learning for Face Recognition
MyDDA-BBT
DDA and BBT ways to capture the screen
NUSOpenSora
Open-Sora: Democratizing Efficient Video Production for All
ReliableSwap
Official Implementation of 'ReliableSwap: Boosting General Face Swapping Via Reliable Supervision'
XTryOn
ygtxr1997's Repositories
ygtxr1997/CelebBasis
Official Implementation of 'Inserting Anybody in Diffusion Models via Celeb Basis'
ygtxr1997/ReliableSwap
Official Implementation of 'ReliableSwap: Boosting General Face Swapping Via Reliable Supervision'
ygtxr1997/FaceSwappingAllInOne
Put all face swapping method into a single repo
ygtxr1997/MSML
MSML: Enhancing Occlusion-Robustness by Multi-Scale Segmentation-Based Mask Learning for Face Recognition
ygtxr1997/XTryOn
ygtxr1997/NUSOpenSora
Open-Sora: Democratizing Efficient Video Production for All
ygtxr1997/calvin_env
ygtxr1997/COMP7404A_Marking
ygtxr1997/ControlNet
Let us control diffusion models!
ygtxr1997/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
ygtxr1997/dlimp
dataloading is my passion
ygtxr1997/dp23rss_fork
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
ygtxr1997/DualPathTransformer
Dual Path Transformer and Angular-Adaptive Margin Loss
ygtxr1997/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
ygtxr1997/gr1_24iclr
Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"
ygtxr1997/LIBERO
Benchmarking Knowledge Transfer in Lifelong Robot Learning
ygtxr1997/mdt24rss_fork
[RSS 2024] Code for "Multimodal Diffusion Transformer: Learning Versatile Behavior from Multimodal Goals" for CALVIN experiments with pre-trained weights
ygtxr1997/mode25rss
Code for "Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning"
ygtxr1997/nerfstudio
A collaboration friendly studio for NeRFs
ygtxr1997/octo
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
ygtxr1997/oxe_torch_dataloader
Minimal adaption of the dataloader proposed in Octo for loading over 25 embodiments.
ygtxr1997/PKU-Open-Sora-Forked
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
ygtxr1997/PTCG.AI
ygtxr1997/PTCGTeamBP
A Ban&Pick (BP) Tool for Pokemon Trading Card Game
ygtxr1997/rdt24tsinghua
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation (Fork version)
ygtxr1997/ReliableSwap.github.io
The academic page of the paper: ReliableSwap: Boosting General Face Swapping Via Reliable Supervision
ygtxr1997/rlds_dataset_builder
An example RLDS dataset builder for X-embodiment dataset conversion.
ygtxr1997/SimplerEnv
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge) (CoRL 2024)
ygtxr1997/STIT
forked version
ygtxr1997/ygtxr1997.github.io
My academic homepage