orangeadegit

orangeadegit's Stars

996icu/996.ICU
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
270k 4.2k 021.1k
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
Language:Python30.8k 425 4.2k6.4k
lib-pku/libpku
贵校课程资料民间整理
Language:TeX30.5k 1.2k 418.3k
meta-llama/llama3
The official Meta Llama 3 GitHub site
Language:Python27.8k 233 2743.2k
karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda25k 252 1412.8k
google-deepmind/deepmind-research
This repository contains implementations and illustrative code to accompany DeepMind publications
Language:Jupyter Notebook13.4k 326 3252.6k
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验（大模型工程化、大模型应用落地）
Language:HTML12.6k 104 241.4k
yidao620c/python3-cookbook
《Python Cookbook》 3rd Edition Translation
Language:Jupyter Notebook11.8k 499 1003k
ctgk/PRML
PRML algorithms implemented in Python
Language:Jupyter Notebook11.5k 416 243.3k
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.5k 77 1.3k1.4k
fuck-xuexiqiangguo/Fuck-XueXiQiangGuo
学习强国懒人刷分工具自动学习
8.3k 284 5861.8k
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Language:Python3.9k 128 419678
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
3.6k 61 4220
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python3.6k 28 381341
knazeri/edge-connect
EdgeConnect: Structure Guided Image Inpainting using Edge Prediction, ICCV 2019 https://arxiv.org/abs/1901.00212
Language:Python2.5k 70 174532
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python2.3k 19 84189
kingyiusuen/image-to-latex
Convert images of LaTex math equations into LaTex code.
Language:Python2.1k 19 28312
fh2019ustc/Awesome-Document-Image-Rectification
A comprehensive list of awesome document image rectification papers.
383 13 429
fh2019ustc/DocTr
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
Language:Python363 17 3150
djiajunustc/Voxel-R-CNN
Language:Python265 5 3141
djiajunustc/TransVG
Language:Python171 2 4227
djiajunustc/H-23D_R-CNN
Language:Python65 2 64
teslacool/m-curl
M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning
Language:Python28 2 14
LQNew/Deeper_Larger_Actor-Critic_RL
Pytorch implementation of large network design in continous control RL.
Language:Python18 1 10
teslacool/preprocess_iwslt
data preprocess for fairseq input
Language:Shell8 0 00
teslacool/RL-Algo-Zoo
Language:Python4 1 01
orangeadegit/RL-Algo-Zoo
Language:Python3 1 00