xuemei-ye's Stars
hydecorp/hydejack-starter-kit
A quicker, cleaner way to get started blogging with Hydejack.
boyu-ai/Hands-on-RL
https://hrl.boyuai.com/
Future-House/paper-qa
High accuracy RAG for answering questions from scientific documents with citations
opendilab/awesome-decision-transformer
A curated list of Decision Transformer resources (continually updated)
brianmaierjr/long-haul
A minimal, type-focused Jekyll theme.
cotes2020/chirpy-starter
A website startup template using the Chirpy theme gem.
OuYaMing/Image-classification-and-target-detection-by-pytorch
pytorch入门项目,包括线性回归、垃圾分类、水果目标检测、ssd
lang-du/fruit_detection
水果检测并分类
MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
blcuicall/taoli
"桃李“: 国际中文教育大模型
wangshusen/RecommenderSystem
Kulbear/deep-learning-coursera
Deep Learning Specialization by Andrew Ng on Coursera.
bumingbaipod/podcast
此 GitHub 作为《不明白播客》官网的备份站,用于分享文字版播客。 版权所有 ©️ 不明白播客 bumingbai.net
dennybritz/reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
apachecn/ailearning
AiLearning:数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2
chihming/competitive-recsys
A collection of resources for Recommender Systems (RecSys)
mitmath/1806
18.06 course at MIT
bannedbook/fanqiang
翻墙-科学上网
mengf1/DHER
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
jiangyiqun233/PRML_learning
learning fomula
remoteintech/remote-jobs
A list of semi to fully remote-friendly companies (jobs) in tech.
NeuroCSUT/DeepMind-Atari-Deep-Q-Learner-2Player
Multiagent Cooperation and Competition with Deep Reinforcement Learning
sisl/MADRL
Repo containing code for multi-agent deep reinforcement learning (MADRL).
openai/gym
A toolkit for developing and comparing reinforcement learning algorithms.
F4bwDP6a6W/FLY_US
美国大学备考资料 How to apply US colleges
apexrl/RL-Exploration-Paper-Lists
Paper Collection of Reinforcement Learning Exploration covers Exploration of Muti-Arm-Bandit, Reinforcement Learning and Multi-agent Reinforcement Learning.
junhyukoh/deep-reinforcement-learning-papers
A list of recent papers regarding deep reinforcement learning
brendanator/atari-rl
Atari - Deep Reinforcement Learning algorithms in TensorFlow
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).