hejingwenhejingwen

Currently a research intern in Shenzhen Institutes of Advanced Technology.

Shenzhen Institutes of Advanced Technology

hejingwenhejingwen's Stars

hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python22.2k 187 5042.2k
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python9.1k 125 375856
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Language:Python5.8k 78 142374
AILab-CVC/VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Language:Python4.6k 71 84339
XPixelGroup/DiffBIR
Official codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior
Language:Python3.4k 35 128289
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Language:Python3k 32 135264
williamyang1991/Rerender_A_Video
[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Language:Jupyter Notebook3k 26 112202
PixArt-alpha/PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Language:Python2.8k 47 0181
Alpha-VLLM/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
Language:Python2.7k 37 137176
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
Language:Python2.1k 30 8788
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
Language:Python1.7k 23 106177
NVlabs/edm
Elucidating the Design Space of Diffusion-Based Generative Models (EDM)
Language:Python1.4k 29 26146
sczhou/Upscale-A-Video
[CVPR 2024] Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution
Language:Python1k 80 2651
ali-vilab/videocomposer
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability
Language:Python900 25 4480
OpenGVLab/SAM-Med2D
Official implementation of SAM-Med2D
Language:Jupyter Notebook876 13 6982
lucidrains/muse-maskgit-pytorch
Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch
Language:Python868 34 3681
NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
Language:Python835 19 2343
Vchitect/Vchitect-2.0
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
Language:Python643 7 1317
Vchitect/VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
Language:Python576 11 7228
lucidrains/magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
Language:Python562 28 3534
YingqingHe/ScaleCrafter
[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.
Language:Python493 9 3029
TianxingWu/FreeInit
[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models
Language:Python490 5 1924
Zj-BinXia/DiffIR
This project is the official implementation of 'Diffir: Efficient diffusion model for image restoration', ICCV2023
Language:Jupyter Notebook469 5 6720
Vchitect/VEnhancer
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation
Language:Python458 19 2825
VINHYU/CoSeR
[CVPR 2024] CoSeR: Bridging Image and Language for Cognitive Super-Resolution
320 30 109
city-super/MatrixCity
Language:Python219 10 449
bornfly-detachment/asymmetric_magvitv2
In 2024, the strongest open-source implementation of asymmetric magvit_v2 supports inference code but excludes VQVAE. It supports the joint encoding of images and videos, accommodating arbitrary video lengths and resolutions. It surpasses all open-source models in FID and FVD, with 4z and 16z models available on huggingface.
Language:Python125 10 55
Shuweis/ResMaster
62 8 10
XPixelGroup/SEAL
ICLR 2024 (Spotlight) - SEAL: A Framework for Systematic Evaluation of Real-World Super-Resolution
Language:Python48 3 15
Vchitect/LiteGen
A light-weight and high-efficient training framework for accelerating diffusion tasks.
Language:Python40 4 12

hejingwenhejingwen

hejingwenhejingwen's Stars

hpcaitech/Open-Sora

THUDM/CogVideo

OpenGVLab/LLaMA-Adapter

AILab-CVC/VideoCrafter

XPixelGroup/DiffBIR

ali-vilab/VGen

williamyang1991/Rerender_A_Video

PixArt-alpha/PixArt-alpha

Alpha-VLLM/LLaMA2-Accessory

Alpha-VLLM/Lumina-T2X

Vchitect/Latte

NVlabs/edm

sczhou/Upscale-A-Video

ali-vilab/videocomposer

OpenGVLab/SAM-Med2D

lucidrains/muse-maskgit-pytorch

NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion

Vchitect/Vchitect-2.0

Vchitect/VBench

lucidrains/magvit2-pytorch

YingqingHe/ScaleCrafter

TianxingWu/FreeInit

Zj-BinXia/DiffIR

Vchitect/VEnhancer

VINHYU/CoSeR

city-super/MatrixCity

bornfly-detachment/asymmetric_magvitv2

Shuweis/ResMaster

XPixelGroup/SEAL

Vchitect/LiteGen