lshowway

PhD candidate at BUAA Visiting student at KU

Copenhagen UniversityDenmark

lshowway's Stars

karpathy/LLM101n
LLM101n: Let's build a Storyteller
29.6k 2.3k 01.6k
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬
Language:Jupyter Notebook8k 96 1041.1k
wdndev/llm_interview_note
主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题
Language:HTML3.5k 18 6404
GanjinZero/awesome_Chinese_medical_NLP
中文医学NLP公开资源整理：术语集/语料库/词向量/预训练模型/知识图谱/命名实体识别/QA/信息抽取/模型/论文/etc
2.2k 46 5362
jiqizhixin/Artificial-Intelligence-Terminology-Database
A comprehensive mapping database of English to Chinese technical vocabulary in the artificial intelligence domain
1.9k 84 16330
allenai/dolma
Data and tools for generating and inspecting OLMo pre-training data.
Language:Python974 20 75108
Ruzim/NSFC-application-template-latex
国家自然科学基金申请书正文（面上项目）LaTeX 模板（非官方）
Language:TeX843 11 16205
minyoungg/platonic-rep
Language:Python454 12 929
google-deepmind/AQuA
A algebraic word problem dataset, with multiple choice questions annotated with rationales.
296 22 144
facebookresearch/SpinQuant
Code repo for the paper "SpinQuant LLM quantization with learned rotations"
Language:Python131 6 1213
locuslab/massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
Language:Python120 5 68
nrimsky/CAA
Steering Llama 2 with Contrastive Activation Addition
Language:Jupyter Notebook94 1 630
Tribleave/SCAPT-ABSA
Code for EMNLP 2021 paper: "Learning Implicit Sentiment in Aspect-based Sentiment Analysis with Supervised Contrastive Pre-Training"
Language:Python92 3 1121
DAMO-NLP-SG/LLM-Sentiment
[NAACL 2024] Data and code for our paper "Sentiment Analysis in the Era of Large Language Models: A Reality Check"
Language:Python83 7 114
mega002/ff-layers
The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Levy. EMNLP, 2021.
Language:Python83 1 15
i-machine-think/diagNNose
diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.
Language:Python81 8 698
Dakingrai/awesome-mechanistic-interpretability-lm-papers
65 4 05
google-research/heldout-influence-estimation
Language:Python61 7 08
alonj/Same-Task-More-Tokens
The code for the paper: "Same Task, More Tokens: the Impact of Input Length on the Reasoning Performance of Large Language Models"
Language:Jupyter Notebook49 1 13
whaleloops/KEPT
auto icd coding with prompt
Language:Jupyter Notebook47 4 917
thomasnguyen92/MIMIC-IV-ICD-data-processing
Language:Jupyter Notebook27 1 79
kdu4108/semiring-backprop-exps
Language:Jupyter Notebook17 1 02
VirtuosoResearch/ML4RoadSafety
A dataset for traffic accident analysis in the US
Language:Python17 1 71
JacksonWuxs/Interpret_Instruction_Tuning_LLMs
Understanding Why and How Instruction Tuning Changes Pre-trained Models
Language:Python16 3 23
xjjxmu/QSLAW
The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2024]
Language:Python73
bheinzerling/numeric-property-repr
Code for the paper: Monotonic Representation of Numeric Properties in Language Models (ACL 2024)
Language:HTML5 1 00
Dakingrai/neuron-analysis-cot-arithmetic-reasoning
Language:Python5 1 00
lacoco-lab/sensitivity-hardness
Code for the paper
Language:Python51
paihengxu/XICL
Language:HTML2 2 04
zijian678/FreeCtrl
Language:Python11

lshowway

lshowway's Stars

karpathy/LLM101n

SakanaAI/AI-Scientist

wdndev/llm_interview_note

GanjinZero/awesome_Chinese_medical_NLP

jiqizhixin/Artificial-Intelligence-Terminology-Database

allenai/dolma

Ruzim/NSFC-application-template-latex

minyoungg/platonic-rep

google-deepmind/AQuA

facebookresearch/SpinQuant

locuslab/massive-activations

nrimsky/CAA

Tribleave/SCAPT-ABSA

DAMO-NLP-SG/LLM-Sentiment

mega002/ff-layers

i-machine-think/diagNNose

Dakingrai/awesome-mechanistic-interpretability-lm-papers

google-research/heldout-influence-estimation

alonj/Same-Task-More-Tokens

whaleloops/KEPT

thomasnguyen92/MIMIC-IV-ICD-data-processing

kdu4108/semiring-backprop-exps

VirtuosoResearch/ML4RoadSafety

JacksonWuxs/Interpret_Instruction_Tuning_LLMs

xjjxmu/QSLAW

bheinzerling/numeric-property-repr

Dakingrai/neuron-analysis-cot-arithmetic-reasoning

lacoco-lab/sensitivity-hardness

paihengxu/XICL

zijian678/FreeCtrl