lili0710432

Fortune favors the dream with attention.

Beijing.China

lili0710432's Stars

deepfakes/faceswap
Deepfakes Software For All
Language:Python52.8k 1.5k 86813.3k
triton-lang/triton
Development repository for the Triton language and compiler
Language:C++13.9k 198 1.6k1.7k
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.9k 111 137420
llm-attacks/llm-attacks
Universal and Transferable Attacks on Aligned Language Models
Language:Python3.5k 33 98484
salesforce/ALBEF
Code for ALBEF: a new vision-language pre-training method
Language:Python1.6k 13 141200
wfh45678/radar
实时风控引擎(Risk Engine)，自定义规则引擎（Rule Script），完美支持中文，适用于反欺诈(Anti-fraud)应用场景，开箱即用！！！移动互联网时代的风险管理利器，你 Get 到了吗？
Language:Java1.5k 56 16487
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python1.4k 18 88119
deepseek-ai/awesome-deepseek-integration
1.1k 15 29103
VITA-MLLM/VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM
Language:Python1.1k 38 6768
citizenlab/chat-censorship
Data related to the investigation of realtime censorship
Language:Lua655 55 3102
volcengine/verl
veRL: Volcano Engine Reinforcement Learning for LLM
Language:Python573 10 2942
THUDM/LongCite
LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Language:Python444 11 1533
centerforaisafety/HarmBench
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Language:Jupyter Notebook384 6 5061
SheltonLiu-N/AutoDAN
The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language Models".
Language:Python267 5 1745
Ablustrund/LoRAMoE
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment
Language:Python263 2 1819
GraySwanAI/circuit-breakers
Improving Alignment and Robustness with Circuit Breakers
Language:Jupyter Notebook170 15 1124
Vance0124/Token-level-Direct-Preference-Optimization
Reference implementation for Token-level Direct Preference Optimization(TDPO)
Language:Python122 1 713
thu-ml/Attack-Bard
Language:Python90 6 86
DAMO-NLP-SG/multilingual-safety-for-LLMs
[ICLR 2024]Data for "Multilingual Jailbreak Challenges in Large Language Models"
64 7 06
ZHZisZZ/modpo
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
Language:Python61 2 34
foundation-multimodal-models/CAL
[NeurIPS'24] Official PyTorch Implementation of Seeing the Image: Prioritizing Visual Correlation by Contrastive Alignment
Language:Python57 0 52
NJUNLP/x-LLM
27 3 52
GeWu-Lab/Certifiable-Robust-Multi-modal-Training
A python implement for Certifiable Robust Multi-modal Training
Language:Python16 1 20
elsevier-AI-Lab/BioBLP
A Modular Framework for Learning on Multimodal Biomedical Knowledge Graphs
Language:Jupyter Notebook13 4 53
alenai97/PEFT-MLLM
Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"
Language:Python120
ZNLP/Language-Imbalance-Driven-Rewarding
Language Imbalance Driven Rewarding for Multilingual Self-improving
Language:Python12 1 20
lili0710432/ALBEF
Code for ALBEF: a new vision-language pre-training method
1
lili0710432/chat-censorship
Data related to the investigation of realtime censorship
1
lili0710432/radar
实时风控引擎(Risk Engine)，自定义规则引擎（Rule Script），完美支持中文，适用于反欺诈(Anti-fraud)应用场景，开箱即用！！！移动互联网时代的风险管理利器，你 Get 到了吗？
1
lili0710432/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
1

lili0710432

lili0710432's Stars

deepfakes/faceswap

triton-lang/triton

huggingface/alignment-handbook

llm-attacks/llm-attacks

salesforce/ALBEF

wfh45678/radar

PKU-Alignment/safe-rlhf

deepseek-ai/awesome-deepseek-integration

VITA-MLLM/VITA

citizenlab/chat-censorship

volcengine/verl

THUDM/LongCite

centerforaisafety/HarmBench

SheltonLiu-N/AutoDAN

Ablustrund/LoRAMoE

GraySwanAI/circuit-breakers

Vance0124/Token-level-Direct-Preference-Optimization

thu-ml/Attack-Bard

DAMO-NLP-SG/multilingual-safety-for-LLMs

ZHZisZZ/modpo

foundation-multimodal-models/CAL

NJUNLP/x-LLM

GeWu-Lab/Certifiable-Robust-Multi-modal-Training

elsevier-AI-Lab/BioBLP

alenai97/PEFT-MLLM

ZNLP/Language-Imbalance-Driven-Rewarding

lili0710432/ALBEF

lili0710432/chat-censorship

lili0710432/radar

lili0710432/safe-rlhf