HUJA9

HUJA9's Stars

lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python36.4k 348 1.8k4.5k
jindongwang/transferlearning
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
Language:Python13.3k 340 3383.8k
naklecha/llama3-from-scratch
llama3 implementation one matrix multiplication at a time
Language:Jupyter Notebook13k 93 161k
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.
Language:Jupyter Notebook11.6k 92 3261.6k
marcotcr/lime
Lime: Explaining the predictions of any machine learning classifier
Language:JavaScript11.5k 263 6341.8k
py-why/dowhy
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
Language:Python7k 136 472924
uber/causalml
Uplift modeling and causal inference with machine learning algorithms
Language:Python5k 84 393767
OpenBMB/ToolBench
[ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
Language:Python4.7k 49 285401
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.5k 107 133385
llm-attacks/llm-attacks
Universal and Transferable Attacks on Aligned Language Models
Language:Python3.3k 34 93457
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python1.3k 17 83120
yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
1.1k 16 860
allenai/natural-instructions
Expanding natural instructions
Language:Python941 21 161187
causaltext/causal-text-papers
Curated research at the intersection of causal inference and natural language processing.
772 39 295
zhijing-jin/Causality4NLP_Papers
A reading list for papers on causality for natural language processing (NLP)
484 23 056
wjmaddox/swa_gaussian
Code repo for "A Simple Baseline for Bayesian Uncertainty in Deep Learning"
Language:Jupyter Notebook437 13 2181
HowieHwong/TrustLLM
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
Language:Python427 8 2738
voidism/DoLa
Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"
Language:Python404 3 1548
SCLBD/BackdoorBench
Language:Jupyter Notebook379 5 4060
AI-secure/DecodingTrust
A Comprehensive Assessment of Trustworthiness in GPT Models
Language:Python249 6 2252
P2333/Bag-of-Tricks-for-AT
Empirical tricks for training robust models (ICLR 2021)
Language:Python249 4 725
LLM-Tuning-Safety/LLMs-Finetuning-Safety
We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.
Language:Python217 4 622
thunlp/OpenBackdoor
An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)
Language:Python148 10 3023
cooperleong00/Awesome-LLM-Interpretability
A curated list of LLM Interpretability related material - Tutorial, Library, Survey, Paper, Blog, etc..
105 2 14
Aligner2024/aligner
Achieving Efficient Alignment through Learned Correction
Language:Python102 1 75
causalNLP/corr2cause
Data and code for the Corr2Cause paper (ICLR 2024)
Language:Python79 2 113
nrimsky/LM-exp
LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces
Language:Jupyter Notebook72 1 021
niconi19/LLM-Conversation-Safety
[NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey
63 3 16
GodXuxilie/PromptAttack
An LLM can Fool Itself: A Prompt-Based Adversarial Attack (ICLR 2024)
Language:Python40 2 17
wang2226/Trojan-Activation-Attack
Language:Python121