vaesl

vaesl's Stars

ChatGPTNextWeb/ChatGPT-Next-Web
A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
Language:TypeScript75.4k 409 3.1k58.9k
chenfei-wu/TaskMatrix
Language:Python34.5k 300 3523.3k
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Language:Python20.7k 204 3742.1k
THUDM/ChatGLM3
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
Language:Python13.4k 98 7771.6k
openai/shap-e
Generate 3D objects conditioned on text or images
Language:Python11.6k 238 115939
gaomingqi/Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Language:Python6.4k 62 138477
geekyutao/Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
Language:Jupyter Notebook6.4k 53 148525
princeton-vl/infinigen
Infinite Photorealistic Worlds using Procedural Generation
Language:Python5.3k 85 280455
QwenLM/Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Language:Python4.9k 49 441373
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python3k 36 226247
z-x-yang/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
Language:Jupyter Notebook2.8k 51 153338
Farama-Foundation/HighwayEnv
A minimalist environment for decision-making in autonomous driving
Language:Python2.6k 29 464739
ttengwang/Caption-Anything
Caption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/spaces/TencentARC/Caption-Anything https://huggingface.co/spaces/VIPLab/Caption-Anything
Language:Python1.7k 15 24103
AGI-Edgerunners/LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
Language:Python1.1k 12 5899
wilson1yan/VideoGPT
Language:Jupyter Notebook963 23 38119
Thinklab-SJTU/Awesome-LLM4AD
A curated list of awesome LLM for Autonomous Driving resources (continually updated)
919 39 547
weiyithu/SurroundOcc
[ICCV 2023] SurroundOcc: Multi-camera 3D Occupancy Prediction for Autonomous Driving
Language:Python788 25 11697
vimalabs/VIMA
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
Language:Python761 17 5285
PJLab-ADG/neuralsim
neuralsim: 3D surface reconstruction and simulation based on 3D neural rendering.
Language:Python597 42 6031
exiawsh/StreamPETR
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
Language:Python567 13 22764
PJLab-ADG/DriveLikeAHuman
Drive Like a Human: Rethinking Autonomous Driving with Large Language Models
Language:Python349 8 2133
danijar/daydreamer
DayDreamer: World Models for Physical Robot Learning
Language:Jupyter Notebook270 10 1428
haoningwu3639/StoryGen
[CVPR 2024] Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models
Language:Python200 9 389
wenyuqing/panacea
[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"
Language:Python181 20 259
wudongming97/TopoMLP
[ICLR2024] TopoMLP: A Simple yet Strong Pipeline for Driving Topology Reasoning
Language:Python159 5 3013
megvii-research/Far3D
[AAAI2024] Far3D: Expanding the Horizon for Surround-view 3D Object Detection
Language:Jupyter Notebook138 4 3411
wudongming97/Prompt4Driving
Language:Python116 12 40
wudongming97/OnlineRefer
[ICCV 2023] OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation
Language:Python48 5 43
WayneMao/PillarNeSt
The Official Implementation of PillarNeSt
Language:Python37 2 21
zyayoung/Awesome-Video-LLMs
Explore VLM-Eval, a framework for evaluating Video Large Language Models, enhancing your video analysis with cutting-edge AI technology.
Language:Python24 2 12