hxhcreate's Stars
allenai/ai2thor
An open-source platform for Visual AI.
shengyin1224/SafeAgentBench
Codes for paper "SafeAgentBench: A Benchmark for Safe Task Planning of \\ Embodied LLM Agents"
GAIR-NLP/PC-Agent
PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World
sail-sg/Agent-Smith
[ICML 2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
SALT-NLP/PopupAttack
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
wz0919/VLN-SRDF
Official implementation of: Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
showlab/ShowUI
Repository for ShowUI: One Vision-Language-Action Model for GUI Visual Agent
showlab/Awesome-GUI-Agent
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
xlang-ai/OSWorld
[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
EvolvingLMMs-Lab/multimodal-sae
Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.
luka-group/mDPO
[EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.
stone-zeng/fduthesis
LaTeX thesis template for Fudan University
yzbrlan/fudan-thesis-latex-template
复旦论文latex模版,包括毕业论文模版,普通课程论文模版(带封皮)
ZHZisZZ/weak-to-strong-search
[NeurIPS'24] Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
hxhcreate/VLSBench
Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety
UNITES-Lab/MoE-RBench
[ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"
OpenSparseLLMs/Skip-DiT
✈️ Accelerating Vision Diffusion Transformers with Skip Branches.
DripNowhy/ETA
PyTorch Implementation of "ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time"
eric-ai-lab/MSSBench
Official codebase for the paper "Multimodal Situational Safety"
Qinyu-Allen-Zhao/LVLM-LP
The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?
microsoft/bing-search-sdk-for-python
Bing Search APIs SDK for python
xiaobai1217/Awesome-Video-Datasets
Video datasets
JusticeFighterDance/JusticeFighter110
田柯宇 (Tian Keyu)恶意攻击集群事件的证据揭露
NiuTrans/Vision-LLM-Alignment
This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
erfanshayegani/Jailbreak-In-Pieces
[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal Language Models
umd-huang-lab/VLM-Poisoning
Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"
FDUCSLG/mirror-issues
请求新镜像、报告 bug
hzwer/WritingAIPaper
Writing AI Conference Papers: A Handbook for Beginners
RobustNLP/DeRTa
A novel approach to improve the safety of large language models, enabling them to transition effectively from unsafe to safe state.
JailbreakBench/jailbreakbench
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]