orangeadegit's Stars
996icu/996.ICU
Repo for counting stars and contributing. Press F to pay respect to glorious developers.
facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
lib-pku/libpku
贵校课程资料民间整理
meta-llama/llama3
The official Meta Llama 3 GitHub site
karpathy/llm.c
LLM training in simple, raw C/CUDA
google-deepmind/deepmind-research
This repository contains implementations and illustrative code to accompany DeepMind publications
liguodongiot/llm-action
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
yidao620c/python3-cookbook
《Python Cookbook》 3rd Edition Translation
ctgk/PRML
PRML algorithms implemented in Python
huggingface/trl
Train transformer language models with reinforcement learning.
fuck-xuexiqiangguo/Fuck-XueXiQiangGuo
学习强国 懒人刷分工具 自动学习
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
opendilab/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
knazeri/edge-connect
EdgeConnect: Structure Guided Image Inpainting using Edge Prediction, ICCV 2019 https://arxiv.org/abs/1901.00212
eric-mitchell/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
kingyiusuen/image-to-latex
Convert images of LaTex math equations into LaTex code.
fh2019ustc/Awesome-Document-Image-Rectification
A comprehensive list of awesome document image rectification papers.
fh2019ustc/DocTr
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
djiajunustc/Voxel-R-CNN
djiajunustc/TransVG
djiajunustc/H-23D_R-CNN
teslacool/m-curl
M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning
LQNew/Deeper_Larger_Actor-Critic_RL
Pytorch implementation of large network design in continous control RL.
teslacool/preprocess_iwslt
data preprocess for fairseq input
teslacool/RL-Algo-Zoo
orangeadegit/RL-Algo-Zoo