LucienJi

Student of TTIC, jijingtian@ttic.edu

TTICChicago

LucienJi's Stars

afshinea/stanford-cs-229-machine-learning
VIP cheatsheets for Stanford's CS 229 Machine Learning
17.7k4k
NLP-LOVE/ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现，也是作为一个算法工程师必会的理论基础知识。
Language:Jupyter Notebook16.2k4.6k
kailashahirwar/cheatsheets-ai
Essential Cheat Sheets for deep learning and machine learning researchers https://medium.com/@kailashahirwar/essential-cheat-sheets-for-machine-learning-and-deep-learning-researchers-efb6a8ebd2e5
15.1k3.4k
A-make/awesome-control-theory
Awesome resources for learning control theory
53974
amkatrutsa/optimization_course
A course on Optimization Methods
Language:Jupyter Notebook15053
PKU-Alignment/omnisafe
JMLR: OmniSafe is an infrastructural framework for accelerating SafeRL research.
Language:Python952132
chauncygu/Safe-Reinforcement-Learning-Baselines
The repository is for safe reinforcement learning baselines.
Language:Jupyter Notebook52981
singhaman1750/Legged-Robots
This repository contains papers in the field of legged robots.
344
AtsushiSakai/PythonRobotics
Python sample codes for robotics algorithms.
Language:Python23.6k6.6k
Skylark0924/Reinforcement-Learning-in-Robotics
This is a private learning repository for reinforcement learning techniques used in robotics.
Language:HTML38554
eleurent/phd-bibliography
References on Optimal Control, Reinforcement Learning and Motion Planning
930205
MJianM/ros_best_practices
Best practices, conventions, and tricks for ROS. Do you want to become a robotics master? Then consider graduating or working at the Robotics Systems Lab at ETH in Zürich!
1
opendilab/awesome-diffusion-model-in-rl
A curated list of Diffusion Model in RL resources (continually updated)
87947
Improbable-AI/walk-these-ways
Sim-to-real RL training and deployment tools for the Unitree Go1 robot.
Language:Python630157
rvl-lab-utoronto/lab_onboarding_recommended_reading
This repository is a collection of papers and research material that students need to be aware of when they are getting started with research in the lab
644
jackaduma/ChatGLM-LoRA-RLHF-PyTorch
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatGPT but with ChatGLM
Language:Python12810
karpathy/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Language:Python37.9k6.1k
karpathy/ng-video-lecture
Language:Python3.6k953
lonePatient/awesome-pretrained-chinese-nlp-models
Awesome Pretrained Chinese NLP Models，高质量中文预训练模型&大模型&多模态模型&大语言模型集合
Language:Python5k480
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Language:Python40.8k5.2k
X-PLUG/CValues
面向中文大模型价值观的评估与对齐研究
Language:Python48020
jianzhnie/awesome-instruction-datasets
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
56230
CarperAI/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python4.5k471
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python35.9k4.2k
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Language:Python16.7k1.7k
OpenLMLab/MOSS-RLHF
Secrets of RLHF in Large Language Models Part I: PPO
Language:Python1.3k101
KernelA/nds-py
A Python implementation of the non-dominated sorting.
Language:Python131
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python136k27.3k
kenjihiranabe/The-Art-of-Linear-Algebra
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
Language:PostScript18.2k2.2k
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
Language:Python38.9k4.4k

LucienJi

LucienJi's Stars

afshinea/stanford-cs-229-machine-learning

NLP-LOVE/ML-NLP

kailashahirwar/cheatsheets-ai

A-make/awesome-control-theory

amkatrutsa/optimization_course

PKU-Alignment/omnisafe

chauncygu/Safe-Reinforcement-Learning-Baselines

singhaman1750/Legged-Robots

AtsushiSakai/PythonRobotics

Skylark0924/Reinforcement-Learning-in-Robotics

eleurent/phd-bibliography

MJianM/ros_best_practices

opendilab/awesome-diffusion-model-in-rl

Improbable-AI/walk-these-ways

rvl-lab-utoronto/lab_onboarding_recommended_reading

jackaduma/ChatGLM-LoRA-RLHF-PyTorch

karpathy/nanoGPT

karpathy/ng-video-lecture

lonePatient/awesome-pretrained-chinese-nlp-models

THUDM/ChatGLM-6B

X-PLUG/CValues

jianzhnie/awesome-instruction-datasets

CarperAI/trlx

microsoft/DeepSpeed

huggingface/peft

OpenLMLab/MOSS-RLHF

KernelA/nds-py

huggingface/transformers

kenjihiranabe/The-Art-of-Linear-Algebra

hpcaitech/ColossalAI