Wu-Zongyu's Stars
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
lm-sys/RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
open-compass/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
test-time-training/ttt-lm-pytorch
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
showlab/Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
lucidrains/transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
XuyangBai/TransFusion
[PyTorch] Official implementation of CVPR2022 paper "TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers". https://arxiv.org/abs/2203.11496
OpenGVLab/Multi-Modality-Arena
Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!
mlcommons/croissant
Croissant is a high-level format for machine learning datasets that brings together four rich layers.
seyonechithrananda/bert-loves-chemistry
bert-loves-chemistry: a repository of HuggingFace models applied on chemical SMILES data for drug design, chemical modelling, etc.
jun0wanan/awesome-large-multimodal-agents
DSPsleeporg/smiles-transformer
Original implementation of the paper "SMILES Transformer: Pre-trained Molecular Fingerprint for Low Data Drug Discovery" by Shion Honda et al.
yuweihao/MM-Vet
MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)
Unispac/Visual-Adversarial-Examples-Jailbreak-Large-Language-Models
Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models
open-compass/MMBench
Official Repo of "MMBench: Is Your Multi-modal Model an All-around Player?"
MMStar-Benchmark/MMStar
[NeurIPS 2024] This repo contains evaluation code for the paper "Are We on the Right Way for Evaluating Large Vision-Language Models"
nupurkmr9/concept-ablation
Ablating Concepts in Text-to-Image Diffusion Models (ICCV 2023)
liudaizong/Awesome-LVLM-Attack
😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.
YanjieZe/Improved-3D-Diffusion-Policy
[arXiv 2024] Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies. Part 1: Train & Deploy of iDP3
junxia97/Mole-BERT
[ICLR 2023] "Mole-BERT: Rethinking Pre-training Graph Neural Networks for Molecules"
FairyFali/SLMs-Survey
Survey of Small Language Models from Penn State, ...
YanjieZe/Humanoid-Teleoperation
[arXiv 2024] Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies. Part 2: Humanoid Teleoperation
jinzhuoran/RWKU
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
ChnQ/LLM4Mol
Code implementation for paper "Can Large Language Models Empower Molecular Property Prediction?"
dmis-lab/ReSimNet
Implementation of ReSimNet for drug response similarity prediction
Haochen-Luo/CroPA
agiresearch/TrustAgent
TrustAgent: Towards Safe and Trustworthy LLM-based Agents
sail-sg/Meta-Unlearning
zzwjames/FailureLLMUnlearning
Wu-Zongyu/LanPHal