emigmo's Stars
HowieHwong/TrustLLM
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
austrian-code-wizard/c3po
sail-sg/Agent-Smith
[ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
dreamgaussian/dreamgaussian
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
GAIR-NLP/scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
XuandongZhao/weak-to-strong
Weak-to-Strong Jailbreaking on Large Language Models
SalesforceAIResearch/DiffusionDPO
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
OpenRobotLab/EmbodiedScan
[CVPR 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
lapisrocks/rpo
Official repository for "Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks"
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
GAIR-NLP/OPO
facebookresearch/EgoObjects
[ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding
netease-youdao/QAnything
Question and Answer based on Anything.
Mihaiii/llm_steer
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors
TaskingAI/TaskingAI
The open source platform for AI-native application development.
cohere-ai/human-feedback-paper
Code and data from the paper 'Human Feedback is not Gold Standard'
fe1ixxu/ALMA
State-of-the-art LLM-based translation models.
OFA-Sys/Ditto
A self-ailgnment method for role-play. Benchmark for role-play. Resources for "Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment".
vlf-silkie/VLFeedback
minghanqin/LangSplat
Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]
ajyl/dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
dauparas/LigandMPNN
penghao-wu/vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
GXimingLu/IPA
Codebase for Inference-Time Policy Adapters
HXYfighter/MolRL-MGPT
NeurIPS 2023 paper: De novo Drug Design using Reinforcement Learning with Multiple GPT Agents
abhika-m/FAVA
Hritikbansal/sparse_feedback
e2b-dev/awesome-ai-agents
A list of AI autonomous agents
anthropics/sleeper-agents-paper
Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".