qimizhang's Stars
dongyh20/Insight-V
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
InhwanBae/LMTrajectory
Official Code for "Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction (CVPR 2024)"
FoundationVision/VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
SunzeY/AlphaCLIP
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
1Konny/gradcam_plus_plus-pytorch
A Simple pytorch implementation of GradCAM and GradCAM++
pranavgupta2603/CLIP-ViL-GradCAM
An implemention of CLIP-ViL Gradcam for VQA tasks
xmed-lab/NuInstruct
bronyayang/Law_of_Vision_Representation_in_MLLMs
Official implementation of the Law of Vision Representation in MLLMs
NVlabs/BEV-Planner
Robot-K/Hint-AD
CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving
zhengli97/Awesome-Prompt-Adapter-Learning-for-VLMs
A curated list of awesome prompt/adapter learning methods for vision-language models like CLIP.
hustvl/VAD
[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving
nutonomy/nuscenes-devkit
The devkit of the nuScenes dataset.
AutoSurveys/AutoSurvey
tsinghua-fib-lab/EmbodiedCity
HKUST-DRIVE-AI-LAB/FollowNet
Source code for paper "FollowNet: A Comprehensive Benchmark for Car-Following Behavior Modeling"
HKUDS/EasyRec
"EasyRec: Simple yet Effective Language Model for Recommendation"
remyxai/VQASynth
Compose multimodal datasets 🎹
samwit/langchain-tutorials
A set of LangChain Tutorials from my youtube channel
BradyFU/Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
TempleX98/MoVA
[NeurIPS 2024] MoVA: Adapting Mixture of Vision Experts to Multimodal Context
meta-llama/llama3
The official Meta Llama 3 GitHub site
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
LLM-Red-Team/kimi-free-api
🚀 KIMI AI 长文本大模型逆向API【特长:长文本解读整理】,支持高速流式输出、智能体对话、联网搜索、长文档解读、图像解析、多轮对话,零配置部署,多路token支持,自动清理会话痕迹,仅供测试,如需商用请前往官方开放平台。
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models with support for multiple inference backends.
curiousily/Get-Things-Done-with-Prompt-Engineering-and-LangChain
LangChain & Prompt Engineering tutorials on Large Language Models (LLMs) such as ChatGPT with custom data. Jupyter notebooks on loading and indexing data, creating prompt templates, CSV agents, and using retrieval QA chains to query the custom data. Projects for using a private LLM (Llama 2) for chat with PDF files, tweets sentiment analysis.
huggingface/text-generation-inference
Large Language Model Text Generation Inference
dvlab-research/MGM
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
google-deepmind/opro
official code for "Large Language Models as Optimizers"