Sunkyoung

Being a Frog Out of the Well 🐸 💭

KAIST

Sunkyoung's Stars

lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python36.7k 349 1.8k4.5k
microsoft/unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
Language:Python19.9k 304 1.4k2.5k
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Language:Python19.2k 278 2.9k2.7k
deepset-ai/haystack
:mag: AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conversational agent chatbots.
Language:Python17.2k 139 3.5k1.9k
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Language:Python14.9k 261 2102.6k
ivy-llc/ivy
Convert Machine Learning Code Between Frameworks
Language:Python14k 70 16.9k5.8k
dsdanielpark/Bard-API
The unofficial python package that returns response of Google Bard through cookie value.
Language:Python5.3k 47 204530
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
Language:HTML4.2k 43 34301
thunlp/PromptPapers
Must-read papers on prompt-based tuning for pre-trained language models.
4.1k 118 5377
mosaicml/llm-foundry
LLM training code for Databricks foundation models
Language:Python4k 47 382524
Beomi/KoAlpaca
KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델
Language:Jupyter Notebook1.5k 29 99237
EleutherAI/the-pile
Language:Python1.5k 31 100128
AGI-Edgerunners/LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
Language:Python1.1k 11 59100
hollobit/GenAI_LLM_timeline
ChatGPT, GenerativeAI and LLMs Timeline
941 84 460
SinclairCoder/Instruction-Tuning-Papers
Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).
753 16 324
allenai/dont-stop-pretraining
Code associated with the Don't Stop Pretraining ACL 2020 paper
Language:Python525 9 3973
bigscience-workshop/xmtf
Crosslingual Generalization through Multitask Finetuning
Language:Jupyter Notebook514 6 2237
EleutherAI/polyglot
Polyglot: Large Language Models of Well-balanced Competence in Multi-languages
470 21 1339
AlignmentResearch/tuned-lens
Tools for understanding how transformer predictions are built layer-by-layer
Language:Python418 7 5344
JohnGiorgi/DeCLUTR
The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
Language:Python378 12 8333
microsoft/Table-Pretraining
ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor
Language:Python287 6 3139
krishnap25/mauve
Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.
Language:Python272 4 1324
bigscience-workshop/lm-evaluation-harness
A framework for few-shot evaluation of autoregressive language models.
Language:Python99 4 2630
joeljang/ELM
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
Language:Python97 3 46
microsoft/TUTA_table_understanding
TUTA and ForTaP for Structure-Aware and Numerical-Reasoning-Aware Table Pre-Training
Language:Python97 11 1620
joeljang/continual-knowledge-learning
[ICLR 2022] Towards Continual Knowledge Learning of Language Models
Language:Python93 6 28
seonghyeonye/TAPP
[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
Language:Python79 6 42
boychaboy/KOLD
KOLD: Korean Offensive Language Dataset
78 3 13
seonghyeonye/RoSPr
[EMNLP 2023 Findings] Efficiently Enhancing Zero-Shot Performance of Instruction Following Model via Retrieval of Soft Prompt
Language:Python20 2 00
swstarlab-infolab/format_converter
Space-efficient graph data converter
Language:C++13 2 03

Sunkyoung

Sunkyoung's Stars

lm-sys/FastChat

microsoft/unilm

huggingface/datasets

deepset-ai/haystack

openai/evals

ivy-llc/ivy

dsdanielpark/Bard-API

Instruction-Tuning-with-GPT-4/GPT-4-LLM

thunlp/PromptPapers

mosaicml/llm-foundry

Beomi/KoAlpaca

EleutherAI/the-pile

AGI-Edgerunners/LLM-Adapters

hollobit/GenAI_LLM_timeline

SinclairCoder/Instruction-Tuning-Papers

allenai/dont-stop-pretraining

bigscience-workshop/xmtf

EleutherAI/polyglot

AlignmentResearch/tuned-lens

JohnGiorgi/DeCLUTR

microsoft/Table-Pretraining

krishnap25/mauve

bigscience-workshop/lm-evaluation-harness

joeljang/ELM

microsoft/TUTA_table_understanding

joeljang/continual-knowledge-learning

seonghyeonye/TAPP

boychaboy/KOLD

seonghyeonye/RoSPr

swstarlab-infolab/format_converter