Bakser's Stars
openai/sparse_autoencoder
TransformerLensOrg/CircuitsVis
Mechanistic Interpretability Visualizations using React
ndif-team/nnsight
The nnsight package enables interpreting and manipulating the internals of deep learned models.
facebookresearch/llm-transparency-tool
LLM Transparency Tool (LLM-TT), an open-source interactive toolkit for analyzing internal workings of Transformer-based language models. *Check out demo at* https://huggingface.co/spaces/facebook/llm-transparency-tool-demo
stanfordnlp/pyvene
Stanford NLP Python Library for Understanding and Improving PyTorch Models via Interventions
PlusLabNLP/GENEVA
hollobit/GenAI_LLM_timeline
ChatGPT, GenerativeAI and LLMs Timeline
thunlp/UltraChat
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
guidance-ai/guidance
A guidance language for controlling large language models.
princeton-nlp/MeZO
[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333
nomic-ai/gpt4all
GPT4All: Chat with Local LLMs on Any Device
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
google-research/arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
THUDM/GLM-130B
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
MLNLP-World/Paper-Writing-Tips
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
lixin4ever/Conference-Acceptance-Rate
Acceptance rates for the major AI conferences
THU-KEG/OmniEvent
A comprehensive, unified and modular event extraction toolkit.
THU-KEG/MAVEN-dataset
Source code and dataset for EMNLP 2020 paper "MAVEN: A Massive General Domain Event Detection Dataset".
huggingface/neuralcoref
✨Fast Coreference Resolution in spaCy with Neural Networks
thunlp/Prompt-Transferability
On Transferability of Prompt Tuning for Natural Language Processing
geekan/HowToLiveLonger
程序员延寿指南 | A programmer's guide to live longer
carrie0307/DL_EventExtractionPapers
2015年以来基于深度学习方法的事件抽取论文整理
OpenBMB/BMTrain
Efficient Training (including pre-training and fine-tuning) for Big Models
thunlp/OpenDelta
A plug-and-play library for parameter-efficient-tuning (Delta Tuning)
orionw/RedditHumorDetection
Code and datasets for the paper "Humor Detection: A Transformer Gets the Last Laugh"
acmi-lab/cmu-10717-the-art-of-the-paper
Official repository for CMU Machine Learning Department's 10717: "The Art of the Paper".
awebson/prompt_semantics
This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”
lucidrains/x-transformers
A simple but complete full-attention transformer with a set of promising experimental features from various papers
hunterhector/EvmEval
The event mention detection and corefrene evaluators, and associated utilities (converters, validators)
raspberryice/gen-arg
Code for paper "Document-Level Argument Extraction by Conditional Generation". NAACL 21'