ZHANG1023's Stars
ziyc/drivestudio
A 3DGS framework for omni urban scene reconstruction and simulation.
tatp22/multidim-positional-encoding
An implementation of 1D, 2D, and 3D positional encoding in Pytorch and TensorFlow
RunpeiDong/DreamLLM
[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation
Alpha-VLLM/Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
dingo-actual/infini-transformer
PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)
BillyXYB/FaceDNeRF
[NeurIPS 2023] FaceDNeRF: Semantics-Driven Face Reconstruction, Prompt Editing and Relighting with Diffusion Models
reworkd/AgentGPT
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
MooreThreads/Moore-AnimateAnyone
Character Animation (AnimateAnyone, Face Reenactment)
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
modelscope/lite-sora
An initiative to replicate Sora
SysCV/sam-pt
SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.
ZHANG1023/FED-NeRF
xxlong0/Wonder3D
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
yangcaoai/CoDA_NeurIPS2023
Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
rotemtzaban/STIT
weihaox/GAN-Inversion
[TPAMI 2022] GAN Inversion: A Survey
hongyixu37/omniavatar-proj
LeslieZhoa/HeSer.Pytorch
unofficial implementation of Few-Shot Head Swapping in the Wild
ZHANG1023/FLNeRF
FMInference/FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios.
openai/point-e
Point cloud diffusion for 3D model synthesis
CompVis/stable-diffusion
A latent text-to-image diffusion model
NationalSecurityAgency/ghidra
Ghidra is a software reverse engineering (SRE) framework