1JZER

1JZER's Stars

LeslieTrue/SFTvsRL
Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Language:Python24914
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python5.8k564
huggingface/open-r1
Fully open reproduction of DeepSeek-R1
Language:Python23.1k2.1k
istio/istio
Connect, secure, control, and observe services.
Language:Go36.6k7.9k
unslothai/unsloth
Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Language:Python35.5k2.7k
haoel/haoel.github.io
Language:Shell12.9k2k
Jiayi-Pan/TinyZero
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Language:Python11.3k1.4k
EvolvingLMMs-Lab/open-r1-multimodal
A fork to add multimodal model training to open-r1
Language:Python1.1k59
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
Language:Python1.3k168
Marker-Inc-Korea/AutoRAG
AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation
Language:Python3.7k288
FreedomIntelligence/Evaluation-of-ChatGPT-on-Information-Extraction
An Evaluation of ChatGPT on Information Extraction task, including Named Entity Recognition (NER), Relation Extraction (RE), Event Extraction (EE) and Aspect-based Sentiment Analysis (ABSA).
Language:Python13211
OneSizeFitsQuorum/MIT6.824-2021
4 labs + 2 challenges + 4 docs
Language:Shell1.5k247
wangzhengquan/MIT6.824
Language:Go212
avelino/awesome-go
A curated list of awesome Go frameworks, libraries and software
Language:Go140k12.2k
gorilla/websocket
Package gorilla/websocket is a fast, well-tested and widely used WebSocket implementation for Go.
Language:Go23.3k3.5k
golangci/golangci-lint
Fast linters runner for Go
Language:Go16.5k1.4k
istio/community
Istio governance material.
Language:Go2.9k579
romkatv/powerlevel10k
A Zsh theme
Language:Shell48.5k2.3k
golang-standards/project-layout
Standard Go Project Layout
Language:Makefile51.5k5.3k
meta-llama/llama
Inference code for Llama models
Language:Python57.9k9.7k
Tongji-KGLLM/RAG-Survey
2k129
AIoT-MLSys-Lab/Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
1.1k95
FreedomIntelligence/OVM
Language:Python656
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
3.8k237
xtaci/kcptun
A Quantum-Safe Secure Tunnel based on QPP, KCP, FEC, and N:M multiplexing.
Language:Go14.1k2.6k
wenge-research/YAYI-UIE
雅意信息抽取大模型：在百万级人工构造的高质量信息抽取数据上进行指令微调，由中科闻歌算法团队研发。 (Repo for YAYI Unified Information Extraction Model)
29514
fatedier/frp
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
Language:Go91.7k13.8k
datawhalechina/easy-rl
强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/
Language:Jupyter Notebook10.7k2k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python12.7k1.7k
jasonvanf/llama-trl
LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA
Language:Python20523