lilin-hitcrt
Harbin Institute of Technology, Zhejiang University866 Yuhangtang Rd, Hangzhou 310058, P.R. China
lilin-hitcrt's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
Stability-AI/generative-models
Generative Models by Stability AI
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
google-deepmind/deepmind-research
This repository contains implementations and illustrative code to accompany DeepMind publications
state-spaces/mamba
Mamba SSM architecture
huggingface/trl
Train transformer language models with reinforcement learning.
g-truc/glm
OpenGL Mathematics (GLM)
huggingface/tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
cloneofsimo/lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
openai/guided-diffusion
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
google/prompt-to-prompt
rinongal/textual_inversion
Yujun-Shi/DragDiffusion
[CVPR2024, Highlight] Official code for DragDiffusion
ActiveVisionLab/Awesome-LLM-3D
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
THUDM/P-tuning
A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.
PRIV-Creation/Awesome-Controllable-T2I-Diffusion-Models
A collection of resources on controllable generation with text-to-image diffusion models.
google/dreambooth
NUS-HPC-AI-Lab/Neural-Network-Parameter-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
donydchen/mvsplat
🌊 [ECCV'24 Oral] MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
TencentARC/MasaCtrl
[ICCV 2023] Consistent Image Synthesis and Editing
bytedance/ibot
iBOT :robot:: Image BERT Pre-Training with Online Tokenizer (ICLR 2022)
chungmin99/garfield
[CVPR'24] Group Anything with Radiance Fields
kxhit/EscherNet
[CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis
OpenDriveLab/LaneSegNet
[ICLR 2024] Map Learning with Lane Segment for Autonomous Driving
PJLab-ADG/DiLu
[ICLR 2024] DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models
thu-ml/Bridge-TTS
Official codebase for "Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis" (https://arxiv.org/abs/2312.03491).