joelrorseth

PhD @ UWaterloo. Currently focused on explainability and interpretability for LLMs.

Waterloo, Ontario

joelrorseth's Stars

ggerganov/llama.cpp
LLM inference in C/C++
Language:C++70.7k 560 4.3k10.2k
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python37.5k 352 1.8k4.6k
karpathy/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Language:Python20.8k 263 722.7k
guidance-ai/guidance
A guidance language for controlling large language models.
Language:Jupyter Notebook19.4k 119 5521.1k
BerriAI/litellm
Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
Language:Python16.2k 90 4k1.9k
google/gemma_pytorch
The official PyTorch implementation of Google's Gemma models
Language:Python5.3k 39 42517
allenai/OLMo
Modeling, training, eval, and inference code for OLMo
Language:Python5k 51 215522
facebookresearch/lingua
Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
Language:Python4.4k 29 41227
openai/transformer-debugger
Language:Python4.1k 26 14241
PAIR-code/lit
The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
Language:TypeScript3.5k 69 137357
spcl/graph-of-thoughts
Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
Language:Python2.2k 22 22166
run-llama/llama_deploy
Deploy your agentic worfklows to production
Language:Python1.9k 25 157203
TransformerLensOrg/TransformerLens
A library for mechanistic interpretability of GPT-style language models
Language:Python1.7k 16 267316
facebookresearch/DPR
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
Language:Python1.7k 23 210306
google-research/bleurt
BLEURT is a metric for Natural Language Generation based on transfer learning.
Language:Python711 13 5185
likenneth/honest_llama
Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
Language:Python493 9 3939
AlignmentResearch/tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer
Language:Python459 7 5447
ndif-team/nnsight
The nnsight package enables interpreting and manipulating the internals of deep learned models.
Language:Jupyter Notebook457 4 9241
kmeng01/memit
Mass-editing thousands of facts into a transformer memory (ICLR 2023)
Language:Python455 6 1957
epfml/landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
Language:Python419 39 1536
nelson-liu/lost-in-the-middle
Code and data for "Lost in the Middle: How Language Models Use Long Contexts"
Language:Python325 5 1427
night-chen/ToolQA
ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.
Language:Jupyter Notebook249 5 79
davidbau/baukit
Language:Python184 11 414
ADAPT-uiuc/dias
Dias: Dynamic Rewriting of Pandas Code
Language:Python64 5 167
google/belief-localization
This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Can Be Injected in Language Models."
58 3 57
DigitalHarborFoundation/llm-math-education
Retrieval augmented generation for middle-school math question answering and hint generation.
Language:Jupyter Notebook34 4 04
AI4LIFE-GROUP/LLM_Explainer
Code for paper: Are Large Language Models Post Hoc Explainers?
Language:Jupyter Notebook28 5 04
seilna/CNN-Units-in-NLP
:scissors: Repository for our ICLR 2019 paper: Discovery of Natural Language Concepts in Individual Units of CNNs
Language:Python27 3 01
stefan-grafberger/mlwhatif
Data-Centric What-If Analysis for Native Machine Learning Pipelines
Language:Jupyter Notebook15 3 232
DigitalHarborFoundation/rag-for-math-qa
Analysis code for a research paper
Language:Jupyter Notebook13 3 11

joelrorseth

joelrorseth's Stars

ggerganov/llama.cpp

lm-sys/FastChat

karpathy/minGPT

guidance-ai/guidance

BerriAI/litellm

google/gemma_pytorch

allenai/OLMo

facebookresearch/lingua

openai/transformer-debugger

PAIR-code/lit

spcl/graph-of-thoughts

run-llama/llama_deploy

TransformerLensOrg/TransformerLens

facebookresearch/DPR

google-research/bleurt

likenneth/honest_llama

AlignmentResearch/tuned-lens

ndif-team/nnsight

kmeng01/memit

epfml/landmark-attention

nelson-liu/lost-in-the-middle

night-chen/ToolQA

davidbau/baukit

ADAPT-uiuc/dias

google/belief-localization

DigitalHarborFoundation/llm-math-education

AI4LIFE-GROUP/LLM_Explainer

seilna/CNN-Units-in-NLP

stefan-grafberger/mlwhatif

DigitalHarborFoundation/rag-for-math-qa