zhujiagang's Stars
commaai/openpilot
openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Control for 250+ supported car makes and models.
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
guoyww/AnimateDiff
Official implementation of AnimateDiff.
acheong08/EdgeGPT
Reverse engineered API of Microsoft's Bing Chat AI
openai/consistency_models
Official repo for consistency models.
CompVis/taming-transformers
Taming Transformers for High-Resolution Image Synthesis
zhayujie/bot-on-anything
Connect AI models (like ChatGPT-3.5/4.0, Baidu Yiyan, New Bing, Bard) to apps (like Wechat, public account, DingTalk, Telegram, QQ). 将 ChatGPT、必应、文心一言、谷歌Bard 等对话模型连接各类应用,如微信、公众号、QQ、Telegram、Gmail、Slack、Web、企业微信、飞书、钉钉等。
facebookresearch/ijepa
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."
MaybeShewill-CV/lanenet-lane-detection
Unofficial implemention of lanenet model for real time lane detection
mit-han-lab/bevfusion
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
X-PLUG/MobileAgent
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
open-mmlab/mmgeneration
MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.
ytongbai/LVM
Vchitect/Latte
Latte: Latent Diffusion Transformer for Video Generation.
protectai/llm-guard
The Security Toolkit for LLM Interactions
danielroich/PTI
Official Implementation for "Pivotal Tuning for Latent-based editing of Real Images" (ACM TOG 2022) https://arxiv.org/abs/2106.05744
google-research/magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
eric-ai-lab/MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
dvlab-research/LLaMA-VID
Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models
lucasjinreal/DCNv2_latest
DCNv2 supports decent pytorch such as torch 1.5+ (now 1.8+)
cure-lab/MagicDrive
[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
jxbbb/ADAPT
This repository is an official implementation of ADAPT: Action-aware Driving Caption Transformer, accepted by ICRA 2023.
wayveai/Driving-with-LLMs
PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"
wl-zhao/UniPC
[NeurIPS 2023] UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models
wzzheng/OccWorld
3D World Model for Autonomous Driving
huang-yh/SelfOcc
[CVPR 2024] SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction
f1yfisher/DriveDreamer2
pixeli99/TrackDiffusion
Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)
KaiChen1998/GeoDiffusion
Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)