hanqi-qi's Stars
hanqi-qi/Revisit_monosemanticity
Hzfinfdu/Diffusion-BERT
ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models
pydantic/pydantic
Data validation using Python type hints
zyxnlp/ICL-Interpretation-Analysis-Resources
Links to publications that focus on the interpretation and analysis of in-context learning
oyarsa/event_extraction
Extract events from text: cause, effect and relation
cece00/SAPGraph
code and data for paper: SAPGraph: Structure-aware Extractive Summarization for Scientific Papers with Heterogeneous Graph
ChicagoHAI/hypothesis-generation
This is the official repository for HypoGeniC (Hypothesis Generation in Context), which is an automated, data-driven tool that leverages large language models to generate hypothesis for open-domain research. For more details, please see the original paper using the link below.
Yu-Fangxu/COLD-Attack
[ICML 2024] COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability
demelin/moral_stories
Data and code for the "Moral Stories: Situated Reasoning about Norms, Intents, Actions, and their Consequences" (Emelin et al., 2021) paper.
UKPLab/EMNLP2023_jiu_jitsu_argumentation_for_rebuttals
This repository contains the code and data for the EMNLP 2023 paper "Exploring Jiu-Jitsu Argumentation for Writing Peer Review Rebuttals"
openai/automated-interpretability
mlfoundations/open_clip
An open source implementation of CLIP.
OAfzal/nlp-for-peer-review
xyzCS/InfoAC
Addressing Order Sensitivity of In-Context Demonstration Examples in Causal Language Models
ZHZisZZ/modpo
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
wesg52/sparse-probing-paper
Sparse probing paper full code.
YuejiangLIU/csl
[Preprint] Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts
hanqi-qi/Matte
nrimsky/LM-exp
LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces
ajyl/dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
PKU-Aligner/aligner
[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct
hanqi-qi/Mirror
rohitgandikota/sliders
Concept Sliders for Precise Control of Diffusion Models
andyzoujm/representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
kmeng01/rome
Locating and editing factual associations in GPT (NeurIPS 2022)
kmeng01/memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
joeljang/RLPHF
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging
facebookresearch/cascade
Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).
mutschcr/C-MCTS
huchenxucs/ChatDB
The official repository of "ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory".