hanqi-qi

King's College LondonUnited Kingdom

hanqi-qi's Stars

hanqi-qi/Revisit_monosemanticity
Language:Jupyter Notebook2
Hzfinfdu/Diffusion-BERT
ACL'2023: DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models
Language:Python29225
pydantic/pydantic
Data validation using Python type hints
Language:Python21k1.9k
zyxnlp/ICL-Interpretation-Analysis-Resources
Links to publications that focus on the interpretation and analysis of in-context learning
2
oyarsa/event_extraction
Extract events from text: cause, effect and relation
Language:Python51
cece00/SAPGraph
code and data for paper: SAPGraph: Structure-aware Extractive Summarization for Scientific Papers with Heterogeneous Graph
Language:Python5
ChicagoHAI/hypothesis-generation
This is the official repository for HypoGeniC (Hypothesis Generation in Context), which is an automated, data-driven tool that leverages large language models to generate hypothesis for open-domain research. For more details, please see the original paper using the link below.
Language:Python271
Yu-Fangxu/COLD-Attack
[ICML 2024] COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability
Language:Python10118
demelin/moral_stories
Data and code for the "Moral Stories: Situated Reasoning about Norms, Intents, Actions, and their Consequences" (Emelin et al., 2021) paper.
Language:Python517
UKPLab/EMNLP2023_jiu_jitsu_argumentation_for_rebuttals
This repository contains the code and data for the EMNLP 2023 paper "Exploring Jiu-Jitsu Argumentation for Writing Peer Review Rebuttals"
Language:Python4
openai/automated-interpretability
Language:Python958113
mlfoundations/open_clip
An open source implementation of CLIP.
Language:Python10.2k975
OAfzal/nlp-for-peer-review
292
xyzCS/InfoAC
Addressing Order Sensitivity of In-Context Demonstration Examples in Causal Language Models
Language:Python3
ZHZisZZ/modpo
[ACL'24] Beyond One-Preference-Fits-All Alignment: Multi-Objective Direct Preference Optimization
Language:Python523
wesg52/sparse-probing-paper
Sparse probing paper full code.
Language:Jupyter Notebook4910
YuejiangLIU/csl
[Preprint] Co-Supervised Learning: Improving Weak-to-Strong Generalization with Hierarchical Mixture of Experts
Language:Python15
hanqi-qi/Matte
Language:Python6
nrimsky/LM-exp
LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces
Language:Jupyter Notebook7623
ajyl/dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
Language:Jupyter Notebook518
PKU-Aligner/aligner
[NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct
Language:Python1106
hanqi-qi/Mirror
Language:Python10
rohitgandikota/sliders
Concept Sliders for Precise Control of Diffusion Models
Language:Jupyter Notebook96876
andyzoujm/representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
Language:Jupyter Notebook71286
kmeng01/rome
Locating and editing factual associations in GPT (NeurIPS 2022)
Language:Python570120
kmeng01/memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
Language:Python43451
joeljang/RLPHF
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging
Language:Python946
facebookresearch/cascade
Implementation of CASCADE in Learning General World Models in a Handful of Reward-Free Deployments (NeurIPS 22).
Language:Python291
mutschcr/C-MCTS
Language:C++5
huchenxucs/ChatDB
The official repository of "ChatDB: Augmenting LLMs with Databases as Their Symbolic Memory".
Language:Python53046

hanqi-qi

hanqi-qi's Stars

hanqi-qi/Revisit_monosemanticity

Hzfinfdu/Diffusion-BERT

pydantic/pydantic

zyxnlp/ICL-Interpretation-Analysis-Resources

oyarsa/event_extraction

cece00/SAPGraph

ChicagoHAI/hypothesis-generation

Yu-Fangxu/COLD-Attack

demelin/moral_stories

UKPLab/EMNLP2023_jiu_jitsu_argumentation_for_rebuttals

openai/automated-interpretability

mlfoundations/open_clip

OAfzal/nlp-for-peer-review

xyzCS/InfoAC

ZHZisZZ/modpo

wesg52/sparse-probing-paper

YuejiangLIU/csl

hanqi-qi/Matte

nrimsky/LM-exp

ajyl/dpo_toxic

PKU-Aligner/aligner

hanqi-qi/Mirror

rohitgandikota/sliders

andyzoujm/representation-engineering

kmeng01/rome

kmeng01/memit

joeljang/RLPHF

facebookresearch/cascade

mutschcr/C-MCTS

huchenxucs/ChatDB