lili0710432's Stars
deepfakes/faceswap
Deepfakes Software For All
triton-lang/triton
Development repository for the Triton language and compiler
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
llm-attacks/llm-attacks
Universal and Transferable Attacks on Aligned Language Models
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
wfh45678/radar
实时风控引擎(Risk Engine),自定义规则引擎(Rule Script),完美支持中文,适用于反欺诈(Anti-fraud)应用场景,开箱即用!!!移动互联网时代的风险管理利器,你 Get 到了吗?
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
deepseek-ai/awesome-deepseek-integration
VITA-MLLM/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
citizenlab/chat-censorship
Data related to the investigation of realtime censorship
volcengine/verl
veRL: Volcano Engine Reinforcement Learning for LLM
THUDM/LongCite
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
centerforaisafety/HarmBench
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
SheltonLiu-N/AutoDAN
The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models".
Ablustrund/LoRAMoE
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
GraySwanAI/circuit-breakers
Improving Alignment and Robustness with Circuit Breakers
Vance0124/Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
thu-ml/Attack-Bard
DAMO-NLP-SG/multilingual-safety-for-LLMs
[ICLR 2024]Data for "Multilingual Jailbreak Challenges in Large Language Models"
ZHZisZZ/modpo
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
foundation-multimodal-models/CAL
[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
NJUNLP/x-LLM
GeWu-Lab/Certifiable-Robust-Multi-modal-Training
A python implement for Certifiable Robust Multi-modal Training
elsevier-AI-Lab/BioBLP
A Modular Framework for Learning on Multimodal Biomedical Knowledge Graphs
alenai97/PEFT-MLLM
Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"
ZNLP/Language-Imbalance-Driven-Rewarding
Language Imbalance Driven Rewarding for Multilingual Self-improving
lili0710432/ALBEF
Code for ALBEF: a new vision-language pre-training method
lili0710432/chat-censorship
Data related to the investigation of realtime censorship
lili0710432/radar
实时风控引擎(Risk Engine),自定义规则引擎(Rule Script),完美支持中文,适用于反欺诈(Anti-fraud)应用场景,开箱即用!!!移动互联网时代的风险管理利器,你 Get 到了吗?
lili0710432/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback