emigmo

Tsinghua UniversityBeijing

emigmo's Stars

HowieHwong/TrustLLM
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
Language:Python43340
austrian-code-wizard/c3po
Language:Python275
sail-sg/Agent-Smith
[ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
Language:Python8011
uclaml/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
Language:Python97588
dreamgaussian/dreamgaussian
[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation
Language:Python3.9k346
GAIR-NLP/scaleeval
Scalable Meta-Evaluation of LLMs as Evaluators
Language:Python403
XuandongZhao/weak-to-strong
Weak-to-Strong Jailbreaking on Large Language Models
Language:Python628
SalesforceAIResearch/DiffusionDPO
Code for "Diffusion Model Alignment Using Direct Preference Optimization"
Language:Python23221
OpenRobotLab/EmbodiedScan
[CVPR 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
Language:Python45434
lapisrocks/rpo
Official repository for "Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks"
Language:Python385
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
Language:Python1.9k121
GAIR-NLP/OPO
Language:Python496
facebookresearch/EgoObjects
[ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding
Language:Python754
netease-youdao/QAnything
Question and Answer based on Anything.
Language:Python11.4k1.1k
Mihaiii/llm_steer
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vectors
Language:Python19210
TaskingAI/TaskingAI
The open source platform for AI-native application development.
Language:Python6k298
cohere-ai/human-feedback-paper
Code and data from the paper 'Human Feedback is not Gold Standard'
Language:Jupyter Notebook181
fe1ixxu/ALMA
State-of-the-art LLM-based translation models.
Language:Ruby39229
OFA-Sys/Ditto
A self-ailgnment method for role-play. Benchmark for role-play. Resources for "Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment".
Language:Jupyter Notebook15415
vlf-silkie/VLFeedback
Language:Python742
minghanqin/LangSplat
Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]
Language:Python63164
ajyl/dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
Language:Jupyter Notebook458
dauparas/LigandMPNN
Language:Python20742
penghao-wu/vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"
Language:Python50433
GXimingLu/IPA
Codebase for Inference-Time Policy Adapters
Language:Python192
HXYfighter/MolRL-MGPT
NeurIPS 2023 paper: De novo Drug Design using Reinforcement Learning with Multiple GPT Agents
Language:Python201
abhika-m/FAVA
Language:Python531
Hritikbansal/sparse_feedback
Language:Python291
e2b-dev/awesome-ai-agents
A list of AI autonomous agents
9.8k710
anthropics/sleeper-agents-paper
Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".
819

emigmo

emigmo's Stars

HowieHwong/TrustLLM

austrian-code-wizard/c3po

sail-sg/Agent-Smith

uclaml/SPIN

dreamgaussian/dreamgaussian

GAIR-NLP/scaleeval

XuandongZhao/weak-to-strong

SalesforceAIResearch/DiffusionDPO

OpenRobotLab/EmbodiedScan

lapisrocks/rpo

PKU-YuanGroup/MoE-LLaVA

GAIR-NLP/OPO

facebookresearch/EgoObjects

netease-youdao/QAnything

Mihaiii/llm_steer

TaskingAI/TaskingAI

cohere-ai/human-feedback-paper

fe1ixxu/ALMA

OFA-Sys/Ditto

vlf-silkie/VLFeedback

minghanqin/LangSplat

ajyl/dpo_toxic

dauparas/LigandMPNN

penghao-wu/vstar

GXimingLu/IPA

HXYfighter/MolRL-MGPT

abhika-m/FAVA

Hritikbansal/sparse_feedback

e2b-dev/awesome-ai-agents

anthropics/sleeper-agents-paper