yumingj

MMLab@NTU, Ph.D. Student

Nanyang Technological University

yumingj's Stars

Significant-Gravitas/AutoGPT
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Language:Python167k 1.6k 2.7k44.2k
xai-org/grok-1
Grok open release
Language:Python49.5k 562 2098.3k
NanmiCoder/MediaCrawler
小红书笔记 | 评论爬虫、抖音视频 | 评论爬虫、快手视频 | 评论爬虫、B 站视频｜评论爬虫、微博帖子｜评论爬虫、百度贴吧帖子｜百度贴吧评论回复爬虫 | 知乎问答文章｜评论爬虫
Language:Python16.7k 104 2995.3k
danielgatis/rembg
Rembg is a tool to remove images background
Language:Python16.4k 149 4981.8k
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Language:Python11.3k 160 3001k
xuebinqin/U-2-Net
The code for our newly accepted paper in Pattern Recognition 2020: "U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection."
Language:Python8.5k 142 3421.5k
LargeWorldModel/LWM
Language:Python7.1k 66 71549
LiheYoung/Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
Language:Python6.8k 49 211522
FoundationVision/VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Language:Python4k 116 81301
MooreThreads/Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
Language:Python3.1k 37 150240
ai-forever/Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
Language:Jupyter Notebook2.7k 48 87306
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Language:Python2.7k 46 0171
emeryberger/CSrankings
A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.
Language:Python2.7k 41 1.3k3.2k
Doubiiu/DynamiCrafter
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Language:Python2.4k 32 127196
TMElyralab/MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Language:Python2.4k 33 116253
layerdiffusion/LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
1.9k 112 3022
TencentARC/BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
Language:Python1.4k 41 67113
mhamilton723/FeatUp
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
Language:Jupyter Notebook1.3k 18 6379
openai/Video-Pre-Training
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Language:Python1.3k 27 31142
chflame163/ComfyUI_LayerStyle
A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.
Language:Python1.2k 11 29967
Computer-Vision-in-the-Wild/CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
1.2k 39 657
stylegan-human/StyleGAN-Human
StyleGAN-Human: A Data-Centric Odyssey of Human Generation
Language:Python1.1k 36 49142
williamyang1991/FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
Language:Jupyter Notebook712 12 4170
3DTopia/3DTopia
Text-to-3D Generation within 5 Minutes
Language:Python612 12 1241
microsoft/XPretrain
Multi-modality pre-training
Language:Python468 14 3836
Meituan-AutoML/VisionLLaMA
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
Language:Python357 23 610
mtli/HTML4Vision
A simple HTML visualization tool for computer vision research :hammer_and_wrench:
Language:Python232 7 1014
NIRVANALAN/LN3Diff
[ECCV-2024] LN3Diff creates high-quality 3D object mesh from text within 8 V100-SECONDS.
Language:Python149 11 29
wang-chen/thesis_template_ntu
Thesis Latex Template for Nanyang Technological University (NTU)
Language:TeX144 3 747
NannanLi999/UniHuman
Official code for CVPR 2024 paper UniHuman: A Unified Model For Editing Human Images in the Wild
Language:Python8 5 11

yumingj

yumingj's Stars

Significant-Gravitas/AutoGPT

xai-org/grok-1

NanmiCoder/MediaCrawler

danielgatis/rembg

PKU-YuanGroup/Open-Sora-Plan

xuebinqin/U-2-Net

LargeWorldModel/LWM

LiheYoung/Depth-Anything

FoundationVision/VAR

MooreThreads/Moore-AnimateAnyone

ai-forever/Kandinsky-2

PixArt-alpha/PixArt-alpha

emeryberger/CSrankings

Doubiiu/DynamiCrafter

TMElyralab/MuseV

layerdiffusion/LayerDiffuse

TencentARC/BrushNet

mhamilton723/FeatUp

openai/Video-Pre-Training

chflame163/ComfyUI_LayerStyle

Computer-Vision-in-the-Wild/CVinW_Readings

stylegan-human/StyleGAN-Human

williamyang1991/FRESCO

3DTopia/3DTopia

microsoft/XPretrain

Meituan-AutoML/VisionLLaMA

mtli/HTML4Vision

NIRVANALAN/LN3Diff

wang-chen/thesis_template_ntu

NannanLi999/UniHuman