lshowway's Stars
karpathy/LLM101n
LLM101n: Let's build a Storyteller
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
wdndev/llm_interview_note
主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题
GanjinZero/awesome_Chinese_medical_NLP
中文医学NLP公开资源整理:术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc
jiqizhixin/Artificial-Intelligence-Terminology-Database
A comprehensive mapping database of English to Chinese technical vocabulary in the artificial intelligence domain
allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
Ruzim/NSFC-application-template-latex
国家自然科学基金申请书正文(面上项目)LaTeX 模板(非官方)
minyoungg/platonic-rep
google-deepmind/AQuA
A algebraic word problem dataset, with multiple choice questions annotated with rationales.
facebookresearch/SpinQuant
Code repo for the paper "SpinQuant LLM quantization with learned rotations"
locuslab/massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
nrimsky/CAA
Steering Llama 2 with Contrastive Activation Addition
Tribleave/SCAPT-ABSA
Code for EMNLP 2021 paper: "Learning Implicit Sentiment in Aspect-based Sentiment Analysis with Supervised Contrastive Pre-Training"
DAMO-NLP-SG/LLM-Sentiment
[NAACL 2024] Data and code for our paper "Sentiment Analysis in the Era of Large Language Models: A Reality Check"
mega002/ff-layers
The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Levy. EMNLP, 2021.
i-machine-think/diagNNose
diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.
Dakingrai/awesome-mechanistic-interpretability-lm-papers
google-research/heldout-influence-estimation
alonj/Same-Task-More-Tokens
The code for the paper: "Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models"
whaleloops/KEPT
auto icd coding with prompt
thomasnguyen92/MIMIC-IV-ICD-data-processing
kdu4108/semiring-backprop-exps
VirtuosoResearch/ML4RoadSafety
A dataset for traffic accident analysis in the US
JacksonWuxs/Interpret_Instruction_Tuning_LLMs
Understanding Why and How Instruction Tuning Changes Pre-trained Models
xjjxmu/QSLAW
The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2024]
bheinzerling/numeric-property-repr
Code for the paper: Monotonic Representation of Numeric Properties in Language Models (ACL 2024)
Dakingrai/neuron-analysis-cot-arithmetic-reasoning
lacoco-lab/sensitivity-hardness
Code for the paper
paihengxu/XICL
zijian678/FreeCtrl