soheeyang

PhD student/Intern at UCL/DeepMind. Previously MS student at KAIST AI and research engineer at Naver Clova. NLP & ML. Wherever curiosity leads me.

UCL/DeepMindLondon, United Kingdom

soheeyang's Stars

tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python29.6k 342 2684.1k
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
27.2k 286 422.3k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.1k 225 2633.1k
koekeishiya/yabai
A tiling window manager for macOS based on binary space partitioning
Language:C24k 100 2.1k652
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Language:Python23.7k 381 1812k
openai/chatgpt-retrieval-plugin
The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.
Language:Python21.1k 323 2313.7k
karpathy/llama2.c
Inference Llama 2 in one file of pure C
Language:C17.5k 192 2222.1k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16.4k 111 1.1k1.6k
BlinkDL/RWKV-LM
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.
Language:Python12.7k 134 216862
databrickslabs/dolly
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
Language:Python10.8k 137 1621.2k
OptimalScale/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Language:Python8.3k 72 410827
openai/transformer-debugger
Language:Python4k 25 14235
TransformerLensOrg/TransformerLens
A library for mechanistic interpretability of GPT-style language models
Language:Python1.6k 17 256304
Farama-Foundation/chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
Language:Python1.4k 19 23131
openai/automated-interpretability
Language:Python967 16 20113
allenai/natural-instructions
Expanding natural instructions
Language:Python958 21 161189
google-research-datasets/dstc8-schema-guided-dialogue
The Schema-Guided Dialogue Dataset
Language:Python549 38 50124
reasoning-machines/pal
PaL: Program-Aided Language Models (ICML 2023)
Language:Python474 9 1459
princeton-nlp/LESS
[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning
Language:Jupyter Notebook371 5 3532
google-deepmind/synjax
Language:Python240 13 214
manyoso/haltt4llm
This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious current problem in widespread adoption of LLM's for many real purposes.
Language:Python220 3 423
jax-ml/oryx
Oryx is a library for probabilistic programming and deep learning built on top of Jax.
Language:Python219 10 1910
TransformerLensOrg/CircuitsVis
Mechanistic Interpretability Visualizations using React
Language:Jupyter Notebook197 2 2631
ArthurConmy/Automatic-Circuit-Discovery
Language:Jupyter Notebook187 0 1736
OSU-NLP-Group/GrokkedTransformer
Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'
Language:Python160 5 512
google-research-datasets/presto
A Multilingual Dataset for Parsing Realistic Task-Oriented Dialogs
113 8 16
seonghyeonye/TAPP
[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
Language:Python79 6 42
edenbiran/RippleEdits
Evaluating the Ripple Effects of Knowledge Editing in Language Models
Language:Python50 5 74
Nix07/finetuning
This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity Tracking".
Language:Jupyter Notebook17 1 02
edenbiran/HoppingTooLate
Exploring the Limitations of Large Language Models on Multi-Hop Queries
Language:Python121

soheeyang

soheeyang's Stars

tatsu-lab/stanford_alpaca

google-research/tuning_playbook

meta-llama/llama3

koekeishiya/yabai

microsoft/JARVIS

openai/chatgpt-retrieval-plugin

karpathy/llama2.c

huggingface/peft

BlinkDL/RWKV-LM

databrickslabs/dolly

OptimalScale/LMFlow

openai/transformer-debugger

TransformerLensOrg/TransformerLens

Farama-Foundation/chatarena

openai/automated-interpretability

allenai/natural-instructions

google-research-datasets/dstc8-schema-guided-dialogue

reasoning-machines/pal

princeton-nlp/LESS

google-deepmind/synjax

manyoso/haltt4llm

jax-ml/oryx

TransformerLensOrg/CircuitsVis

ArthurConmy/Automatic-Circuit-Discovery

OSU-NLP-Group/GrokkedTransformer

google-research-datasets/presto

seonghyeonye/TAPP

edenbiran/RippleEdits

Nix07/finetuning

edenbiran/HoppingTooLate