Heepo

Machine Learning | Large Language Models | NLP | Search | Recommendation

Beijing University of Posts and TelecommunicationsBeijing

Heepo's Stars

simoninithomas/Deep_reinforcement_learning_Course
Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch
Language:Jupyter Notebook3.7k1.2k
tencent-ailab/persona-hub
Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
Language:Python59440
openai/simple-evals
Language:Python1.4k117
karpathy/minbpe
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
Language:Python8.8k797
westlake-repl/Recommendation-Systems-without-Explicit-ID-Features-A-Literature-Review
Paper List of Pre-trained Foundation Recommender Models
26121
LargeWorldModel/LWM
Language:Python7k540
labmlai/annotated_deep_learning_paper_implementations
🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Language:Python52k5.4k
allenai/fm-cheatsheet
Website for hosting the Open Foundation Models Cheat Sheet.
Language:JavaScript25118
EleutherAI/cookbook
Deep learning for dummies. All the practical details and useful utilities that go into working with real models.
Language:Python21413
WLiK/LLM4Rec-Awesome-Papers
A list of awesome papers and resources of recommender system on large language model (LLM).
99386
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
Language:C++4.9k847
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda22k2.4k
SafeAILab/EAGLE
Official Implementation of EAGLE-1 and EAGLE-2
Language:Python66864
declare-lab/instruct-eval
This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.
Language:Python49238
qinyiwei/InfoBench
Language:Python416
openai/transformer-debugger
Language:Python4k232
huggingface/alignment-handbook
Robust recipes to align language models with human and AI preferences
Language:Python4.2k361
dora-rs/dora
DORA (Dataflow-Oriented Robotic Application) is middleware designed to streamline and simplify the creation of AI-based robotic applications. It offers low latency, composable, and distributed dataflow capabilities. Applications are modeled as directed graphs, also referred to as pipelines.
Language:Rust1.4k69
OFA-Sys/InsTag
InsTag: A Tool for Data Analysis in LLM Supervised Fine-tuning
1485
google/active-learning
Language:Python1.1k205
hpcaitech/Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Language:Python20.7k2k
NUS-HPC-AI-Lab/OpenDiT
OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference
Language:Python1.3k87
databricks/dbrx
Code examples and resources for DBRX, a large language model developed by Databricks
Language:Python2.5k231
arcee-ai/mergekit
Tools for merging pretrained large language models.
Language:Python4.1k359
RLHFlow/RLHF-Reward-Modeling
Recipes to train reward model for RLHF.
Language:Python42931
PKU-Alignment/beavertails
BeaverTails is a collection of datasets designed to facilitate research on safety alignment in large language models (LLMs).
Language:Makefile883
allenai/reward-bench
RewardBench: the first evaluation tool for reward models.
Language:Python29430
ContextualAI/HALOs
A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).
Language:Python63434
abacusai/smaug
462
stanfordnlp/dspy
DSPy: The framework for programming—not prompting—foundation models
Language:Python14.6k1.1k

Heepo

Heepo's Stars

simoninithomas/Deep_reinforcement_learning_Course

tencent-ailab/persona-hub

openai/simple-evals

karpathy/minbpe

westlake-repl/Recommendation-Systems-without-Explicit-ID-Features-A-Literature-Review

LargeWorldModel/LWM

labmlai/annotated_deep_learning_paper_implementations

allenai/fm-cheatsheet

EleutherAI/cookbook

WLiK/LLM4Rec-Awesome-Papers

NVIDIA/cutlass

karpathy/llm.c

SafeAILab/EAGLE

declare-lab/instruct-eval

qinyiwei/InfoBench

openai/transformer-debugger

huggingface/alignment-handbook

dora-rs/dora

OFA-Sys/InsTag

google/active-learning

hpcaitech/Open-Sora

NUS-HPC-AI-Lab/OpenDiT

databricks/dbrx

arcee-ai/mergekit

RLHFlow/RLHF-Reward-Modeling

PKU-Alignment/beavertails

allenai/reward-bench

ContextualAI/HALOs

abacusai/smaug

stanfordnlp/dspy