Shawn-Guo-CN

Fourth-year CDT-NLP student at University of Edinburgh.

School of Informatics, University of EdinburghEdinburgh, UK

Shawn-Guo-CN's Stars

codecrafters-io/build-your-own-x
Master programming by recreating your favorite technologies from scratch.
Language:Markdown323k 5.6k 70829.9k
xai-org/grok-1
Grok open release
Language:Python49.8k 592 2148.3k
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python23k 190 5242.3k
Dao-AILab/flash-attention
Fast and memory-efficient exact attention
Language:Python14.9k 123 1.2k1.4k
alshedivat/al-folio
A beautiful, simple, clean, and responsive Jekyll theme for academics
Language:HTML11.7k 28 59311.4k
NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Language:Python11k 168 8132.5k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.5k 77 1.3k1.4k
mistralai/mistral-inference
Official inference library for Mistral models
Language:Jupyter Notebook9.8k 127 147871
leptonai/search_with_lepton
Building a quick conversation-based search demo with Lepton AI.
Language:TypeScript7.9k 55 671k
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.9k 111 137420
BAAI-Agents/Cradle
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
Language:Python1.9k 26 38169
gkamradt/LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
Language:Jupyter Notebook1.6k 17 26178
ZhiningLiu1998/awesome-imbalanced-learning
😎 Everything about class-imbalanced/long-tail learning: papers, codes, frameworks, and libraries | 有关类别不平衡/长尾学习的一切：论文、代码、框架与库
1.4k 44 6226
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python1.1k 21 3276
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
Language:Python771 8 2448
jzhang38/EasyContext
Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.
Language:Python680 8 4648
bojone/bytepiece
更纯粹、更高压缩率的Tokenizer
Language:Python465 9 1823
123penny123/Awesome-LLM-RL
A comprehensive list of PAPERS, CODEBASES, and, DATASETS on Decision Making using Foundation Models including LLMs and VLMs.
342 6 020
LLaMafia/llamafia.github
Language:Python317 21 216
sanderwood/bgpt
Beyond Language Models: Byte Models are Digital World Simulators
Language:Python313 4 420
sangmichaelxie/doremi
Pytorch implementation of DoReMi, a method for optimizing the data mixture weights in language modeling datasets
Language:HTML313 5 3033
rwitten/HighPerfLLMs2024
Language:Python257 14 423
allenai/WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
Language:Python204 4 939
zorazrw/awesome-tool-llm
204 4 213
Edward-Sun/easy-to-hard
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
Language:Python106 3 1211
kddubey/cappr
Completion After Prompt Probability. Make your LLM make a choice
Language:Python71 2 33
facebookresearch/LIGHT
LIGHT is a platform for text-situated dialogue research. We originally hosted LIGHT as a live game with dialogue models in a grounded setting. This repo contains all of the code to get the LIGHT game running, as well as reproducible code for the research projects along the way of getting LIGHT to where it was.
Language:Python68 9 45
jiahe7ay/infini-mini-transformer
This is a personal reimplementation of Google's Infini-transformer, utilizing a small 2b model. The project includes both model and training code.
Language:Python54 2 25
allenai/easy-to-hard-generalization
Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"
Language:Python45 6 04
LLaMafia/SFT_function_learning
Explore what LLMs are really leanring over SFT
Language:Python28 1 02

Shawn-Guo-CN

Shawn-Guo-CN's Stars

codecrafters-io/build-your-own-x

xai-org/grok-1

hpcaitech/Open-Sora

Dao-AILab/flash-attention

alshedivat/al-folio

NVIDIA/Megatron-LM

huggingface/trl

mistralai/mistral-inference

leptonai/search_with_lepton

huggingface/alignment-handbook

BAAI-Agents/Cradle

gkamradt/LLMTest_NeedleInAHaystack

ZhiningLiu1998/awesome-imbalanced-learning

RLHFlow/RLHF-Reward-Modeling

ContextualAI/HALOs

jzhang38/EasyContext

bojone/bytepiece

123penny123/Awesome-LLM-RL

LLaMafia/llamafia.github

sanderwood/bgpt

sangmichaelxie/doremi

rwitten/HighPerfLLMs2024

allenai/WildBench

zorazrw/awesome-tool-llm

Edward-Sun/easy-to-hard

kddubey/cappr

facebookresearch/LIGHT

jiahe7ay/infini-mini-transformer

allenai/easy-to-hard-generalization

LLaMafia/SFT_function_learning