kk-machine-learning

kk-machine-learning's Stars

lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Language:Python36.6k 348 1.8k4.5k
SerenityOS/serenity
The Serenity Operating System 🐞
Language:C++30.4k 349 4.2k3.2k
apache/skywalking
APM, Application Performance Monitoring System
Language:Java23.8k 836 5.3k6.5k
brendangregg/FlameGraph
Stack trace visualizer
Language:Perl17.1k 481 1492k
pwndbg/pwndbg
Exploit Development and Reverse Engineering with GDB Made Easy
Language:Python7.4k 136 894875
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Language:Python5.4k 55 540389
imoneoi/openchat
OpenChat: Advancing Open-source Language Models with Imperfect Data
Language:Python5.2k 49 187399
deepseek-ai/DeepSeek-V2
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
3.5k 29 84143
hellogcc/100-gdb-tips
A collection of gdb tips. 100 maybe just mean many here.
Language:Go3k 174 7710
microsoft/CodeBERT
CodeBERT
Language:Python2.2k 38 297448
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python2.1k 21 249206
PKU-Alignment/safe-rlhf
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Language:Python1.3k 18 84119
GanjinZero/RRHF
[NIPS2023] RRHF & Wombat
Language:Python792 10 4949
naver/splade
SPLADE: sparse neural search (SIGIR21, SIGIR22)
Language:Python750 20 5083
NVIDIA/NeMo-Aligner
Scalable toolkit for efficient model alignment
Language:Python522 16 7058
salesforce/CodeRL
This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).
Language:Python494 18 2762
allenai/OLMoE
OLMoE: Open Mixture-of-Experts Language Models
Language:Jupyter Notebook397 8 530
RLHFlow/Online-RLHF
A recipe for online RLHF and online iterative DPO.
Language:Python383 18 2144
OpenBMB/Eurus
Language:Python275 11 1015
apache/skywalking-rover
Monitor and profiler powered by eBPF to monitor network traffic, and diagnose CPU and network performance.
Language:Go194 41 043
KiraMelody/nemu
抄nemu的同学点个star好嘛
Language:C141 3 020
deepseek-ai/ESFT
Expert Specialized Fine-Tuning
Language:Python135 7 413
juyongjiang/CodeUp
CodeUp: A Multilingual Code Generation Llama2 Model with Parameter-Efficient Instruction-Tuning on a Single RTX 3090
Language:Jupyter Notebook116 9 610
sl1673495/bytedance-apm-group
字节跳动 APM 团队预备招聘社群，来一起聊聊大厂面试经验、简历如何编写、技术……
110 8 04
imbue-ai/carbs
Cost aware hyperparameter tuning algorithm
Language:Jupyter Notebook99 9 56
sail-sg/sdft
[ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".
Language:Shell85 6 124
louieworth/awesome-rlhf
An index of algorithms for reinforcement learning from human feedback (rlhf))
84 8 01
hkust-nlp/dart-math
[NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*
Language:Jupyter Notebook61 1 43
wang2226/FOLK
Language:Python16 1 04
bravikov/parallel-stacks
Language:Python14 1 10