ericxian1997

SYSU

ericxian1997's Stars

microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python36.2k 349 2.9k4.2k
geekan/HowToLiveLonger
程序员延寿指南 | A programmer's guide to live longer
30.8k 236 1292.1k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.9k 235 2753.2k
facebookresearch/fastText
Library for fast text representation and classification.
Language:HTML26k 846 1.1k4.7k
spotify/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
Language:Python18k 471 1k2.4k
Lightning-AI/litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
Language:Python11.2k 100 8221.1k
jzhang38/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Language:Python8.1k 110 158490
openai/transformer-debugger
Language:Python4.1k 26 14241
atfortes/Awesome-LLM-Reasoning
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓
2.3k 40 3127
opendilab/PPOxFamily
PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）
Language:Python2k 17 18180
GAIR-NLP/O1-Journey
O1 Replication Journey: A Strategic Progress Report – Part I
1.9k 35 1557
jquesnelle/yarn
YaRN: Efficient Context Window Extension of Large Language Models
Language:Python1.4k 15 56118
mlfoundations/dclm
DataComp for Language Models
Language:HTML1.2k 37 68111
yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
1.1k 16 858
trotsky1997/MathBlackBox
Language:Python993 26 10101
SimpleBerry/LLaMA-O1
Large Reasoning Models
Language:Python787 19 2544
NVIDIA/NeMo-Curator
Scalable data pre processing and curation toolkit for LLMs
Language:Jupyter Notebook738 14 15896
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
Language:Python674 18 8484
madaan/self-refine
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
Language:Python654 14 2154
FranxYao/Long-Context-Data-Engineering
Implementation of paper Data Engineering for Scaling Language Models to 128K Context
Language:Python448 8 1729
MARIO-Math-Reasoning/Super_MARIO
Language:Python294 12 3023
facebookresearch/MetaICL
An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi
Language:Python258 9 2136
OFA-Sys/InsTag
InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
236 4 107
withinmiaov/A-Survey-on-Mixture-of-Experts
The official GitHub page for the survey paper "A Survey on Mixture of Experts".
194 8 413
src-d/minhashcuda
Weighted MinHash implementation on CUDA (multi-gpu).
Language:C++116 11 324
TIGER-AI-Lab/LongICLBench
Code and Data for "Long-context LLMs Struggle with Long In-context Learning"
Language:Python97 3 44
allenai/allennlp-reading-comprehension
Language:Python81 12 825
OpenLMLab/scaling-rope
code for Scaling Laws of RoPE-based Extrapolation
71 3 22
hughbzhang/o1_inference_scaling_laws
Replicating O1 inference-time scaling laws
Language:Python70 1 03
tml-epfl/icl-alignment
Is In-Context Learning Sufficient for Instruction Following in LLMs?
Language:Python26 2 03

ericxian1997

ericxian1997's Stars

microsoft/DeepSpeed

geekan/HowToLiveLonger

meta-llama/llama3

facebookresearch/fastText

spotify/luigi

Lightning-AI/litgpt

jzhang38/TinyLlama

openai/transformer-debugger

atfortes/Awesome-LLM-Reasoning

opendilab/PPOxFamily

GAIR-NLP/O1-Journey

jquesnelle/yarn

mlfoundations/dclm

yaodongC/awesome-instruction-dataset

trotsky1997/MathBlackBox

SimpleBerry/LLaMA-O1

NVIDIA/NeMo-Curator

NVIDIA/NeMo-Aligner

madaan/self-refine

FranxYao/Long-Context-Data-Engineering

MARIO-Math-Reasoning/Super_MARIO

facebookresearch/MetaICL

OFA-Sys/InsTag

withinmiaov/A-Survey-on-Mixture-of-Experts

src-d/minhashcuda

TIGER-AI-Lab/LongICLBench

allenai/allennlp-reading-comprehension

OpenLMLab/scaling-rope

hughbzhang/o1_inference_scaling_laws

tml-epfl/icl-alignment