sysuyy

Sun Yat-sen UniversityGuangzhou, China

sysuyy's Stars

eloialonso/diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
Language:Python1.7k115
etched-ai/open-oasis
Inference script for Oasis 500M
Language:Python1.7k145
dqxiu/ICL_PaperList
Paper List for In-context Learning 🌷
82759
ali-vilab/VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Language:Python3k267
PKU-YuanGroup/ChronoMagic-Bench
[NeurIPS 2024 D&B Spotlight🔥] ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Language:Python19014
mseitzer/pytorch-fid
Compute FID scores with PyTorch.
Language:Python3.5k518
genmoai/mochi
The best OSS video generation models
Language:Python2.6k266
microsoft/LongRoPE
LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.
Language:Python11813
rhymes-ai/Allegro
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple text input.
Language:Python1k51
inFaaa/Autoregressive-Models-in-Vision-Survey
The paper collections for the autoregressive visual models.
3
Leminhbinh0209/FinetuneVAE-SD
Fine-tune VAE of Stable Diffusion model
Language:Python264
google/break-a-scene
Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]
Language:Python50624
minerllabs/basalt-2022-behavioural-cloning-baseline
Simple behavioural cloning baseline solution for BASALT 2022
Language:Python2920
nahyeonkaty/textboost
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder
Language:Python484
360CVGroup/FancyVideo
This is the official reproduction of FancyVideo.
Language:Python83272
MineDojo/MineDojo
Building Open-Ended Embodied Agents with Internet-Scale Knowledge
Language:Java1.9k165
facebookresearch/Ego4d
Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset
Language:Jupyter Notebook37852
BolinLai/LEGO
[ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation via Visual Instruction Tuning".
Language:Python34
doubleZ0108/Digital-Media-Technology-PKU
Fundamentals of Digital Media Technology(04713901) | Peking University ECE Course Materials
Language:C191
OpenGVLab/EgoExoLearn
[CVPR 2024] Data and benchmark code for the EgoExoLearn dataset
Language:Python48
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python4.1k248
TencentARC/SEED-Voken
SEED-Voken: A Series of Powerful Visual Tokenizers
Language:Python80431
baaivision/Emu3
Next-Token Prediction is All You Need
Language:Python2k77
huangb23/VTimeLLM
[CVPR'2024 Highlight] Official PyTorch implementation of the paper "VTimeLLM: Empower LLM to Grasp Video Moments".
Language:Python24012
NVIDIA/aistore
AIStore: scalable storage for AI applications
Language:Go1.3k183
jquesnelle/yarn
YaRN: Efficient Context Window Extension of Large Language Models
Language:Python1.4k118
Stability-AI/StableCascade
Official Code for Stable Cascade
Language:Jupyter Notebook6.6k534
pytorch/torchdynamo
A Python-level JIT compiler designed to make unmodified PyTorch programs faster.
Language:Python1k124
HyeonHo99/Video-Motion-Customization
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models (CVPR 2024)
Language:Python1848
SkalskiP/top-cvpr-2024-papers
This repository is a curated collection of the most exciting and influential CVPR 2024 papers. 🔥 [Paper + Code + Demo]
Language:Python67959

sysuyy

sysuyy's Stars

eloialonso/diamond

etched-ai/open-oasis

dqxiu/ICL_PaperList

ali-vilab/VGen

PKU-YuanGroup/ChronoMagic-Bench

mseitzer/pytorch-fid

genmoai/mochi

microsoft/LongRoPE

rhymes-ai/Allegro

inFaaa/Autoregressive-Models-in-Vision-Survey

Leminhbinh0209/FinetuneVAE-SD

google/break-a-scene

minerllabs/basalt-2022-behavioural-cloning-baseline

nahyeonkaty/textboost

360CVGroup/FancyVideo

MineDojo/MineDojo

facebookresearch/Ego4d

BolinLai/LEGO

doubleZ0108/Digital-Media-Technology-PKU

OpenGVLab/EgoExoLearn

QwenLM/Qwen2-VL

TencentARC/SEED-Voken

baaivision/Emu3

huangb23/VTimeLLM

NVIDIA/aistore

jquesnelle/yarn

Stability-AI/StableCascade

pytorch/torchdynamo

HyeonHo99/Video-Motion-Customization

SkalskiP/top-cvpr-2024-papers