thatname's Stars
gpt-engineer-org/gpt-engineer
Platform to experiment with the AI Software Engineer. Terminal based. NOTE: Very different from https://gptengineer.app
geekan/MetaGPT
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
lllyasviel/Omost
Your image is almost there!
HVision-NKU/StoryDiffusion
Accepted as [NeurIPS 2024] Spotlight Presentation Paper
Doubiiu/ToonCrafter
[SIGGRAPH Asia 2024, Journal Track] ToonCrafter: Generative Cartoon Interpolation
wjakob/instant-meshes
Interactive field-aligned mesh generator
ufal/whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
eloialonso/diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
Bklieger/infinite-bookshelf
Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3
THUDM/AutoWebGLM
An LLM-based Web Navigating Agent (KDD'24)
ShuhongChen/panic3d-anime-reconstruction
CVPR 2023: PAniC-3D Stylized Single-view 3D Reconstruction from Portraits of Anime Characters
Scthe/nanite-webgpu
UE5's Nanite implementation using WebGPU. Includes the meshlet LOD hierarchy, software rasterizer and billboard impostors. Culling on both per-instance and per-meshlet basis.
hzxie/CityDreamer
The official implementation of "CityDreamer: Compositional Generative Model of Unbounded 3D Cities". (Xie et al., CVPR 2024)
madebyollin/taesd
Tiny AutoEncoder for Stable Diffusion
open-mmlab/FoleyCrafter
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
donahowe/AutoStudio
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
Vchitect/Vlogger
[CVPR2024] Make Your Dream A Vlog
AlonzoLeeeooo/awesome-video-generation
A collection of awesome video generation studies.
SamurAIGPT/Text-To-Video-AI
Generate video from text using AI
SarahWeiii/diso
Differentiable Iso-Surface Extraction Package (DISO)
eliphatfs/zerorf
ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining
pjlab-songcomposer/songcomposer
nath1295/LLMFlex
A python package for developing AI applications with local LLMs.
MiuLab/PersonaLLM-Survey
thu-nics/ViDiT-Q
ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
parlance-zz/dualdiffusion
Fourier Dual Diffusion
BB31420/loveListLace
Python GUI using OpenAI to make video stories from real-time Craigslist data
desaixie/carve3d
Code for Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning
YichenZW/Pacing
This repository includes the code implementation of the paper Improving Pacing in Long-Form Story Planning by Yichen Wang, Kevin Yang, Xiaoming Liu, and Dan Klein.