xuemei-ye

Beijing

xuemei-ye's Stars

hydecorp/hydejack-starter-kit
A quicker, cleaner way to get started blogging with Hydejack.
Language:HTML151436
boyu-ai/Hands-on-RL
https://hrl.boyuai.com/
Language:Jupyter Notebook2.4k534
Future-House/paper-qa
High accuracy RAG for answering questions from scientific documents with citations
Language:Python6.2k582
opendilab/awesome-decision-transformer
A curated list of Decision Transformer resources (continually updated)
68928
brianmaierjr/long-haul
A minimal, type-focused Jekyll theme.
Language:SCSS673773
cotes2020/chirpy-starter
A website startup template using the Chirpy theme gem.
Language:Shell582289
OuYaMing/Image-classification-and-target-detection-by-pytorch
pytorch入门项目，包括线性回归、垃圾分类、水果目标检测、ssd
Language:Python9122
lang-du/fruit_detection
水果检测并分类
Language:Python247
MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning
This is the homepage of a new book entitled "Mathematical Foundations of Reinforcement Learning."
Language:MATLAB3.6k485
HqWu-HITCS/Awesome-Chinese-LLM
整理开源的中文大语言模型，以规模较小、可私有化部署、训练成本较低的模型为主，包括底座模型，垂直领域微调及应用，数据集与教程等。
15.4k1.4k
blcuicall/taoli
"桃李“: 国际中文教育大模型
Language:Python16718
wangshusen/RecommenderSystem
2.3k352
Kulbear/deep-learning-coursera
Deep Learning Specialization by Andrew Ng on Coursera.
Language:Jupyter Notebook7.5k5.5k
dennybritz/reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Language:Jupyter Notebook20.5k6k
apachecn/ailearning
AiLearning：数据分析+机器学习实战+线性代数+PyTorch+NLTK+TF2
Language:Python39.3k11.4k
chihming/competitive-recsys
A collection of resources for Recommender Systems (RecSys)
527115
mitmath/1806
18.06 course at MIT
Language:Jupyter Notebook2.5k682
bannedbook/fanqiang
翻墙-科学上网
Language:Kotlin38.2k7.3k
mengf1/DHER
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
Language:Python667
jiangyiqun233/PRML_learning
learning fomula
Language:Jupyter Notebook27859
remoteintech/remote-jobs
A list of semi to fully remote-friendly companies (jobs) in tech.
Language:JavaScript29.5k3.1k
NeuroCSUT/DeepMind-Atari-Deep-Q-Learner-2Player
Multiagent Cooperation and Competition with Deep Reinforcement Learning
Language:Lua12336
sisl/MADRL
Repo containing code for multi-agent deep reinforcement learning (MADRL).
Language:Python657123
openai/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python34.6k8.6k
F4bwDP6a6W/FLY_US
美国大学备考资料 How to apply US colleges
Language:HTML2.9k781
apexrl/RL-Exploration-Paper-Lists
Paper Collection of Reinforcement Learning Exploration covers Exploration of Muti-Arm-Bandit, Reinforcement Learning and Multi-agent Reinforcement Learning.
344
junhyukoh/deep-reinforcement-learning-papers
A list of recent papers regarding deep reinforcement learning
2.2k555
brendanator/atari-rl
Atari - Deep Reinforcement Learning algorithms in TensorFlow
Language:Python13732
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.6k829

xuemei-ye

xuemei-ye's Stars

hydecorp/hydejack-starter-kit

boyu-ai/Hands-on-RL

Future-House/paper-qa

opendilab/awesome-decision-transformer

brianmaierjr/long-haul

cotes2020/chirpy-starter

OuYaMing/Image-classification-and-target-detection-by-pytorch

lang-du/fruit_detection

MathFoundationRL/Book-Mathematical-Foundation-of-Reinforcement-Learning

HqWu-HITCS/Awesome-Chinese-LLM

blcuicall/taoli

wangshusen/RecommenderSystem

Kulbear/deep-learning-coursera

bumingbaipod/podcast

dennybritz/reinforcement-learning

apachecn/ailearning

chihming/competitive-recsys

mitmath/1806

bannedbook/fanqiang

mengf1/DHER

jiangyiqun233/PRML_learning

remoteintech/remote-jobs

NeuroCSUT/DeepMind-Atari-Deep-Q-Learner-2Player

sisl/MADRL

openai/gym

F4bwDP6a6W/FLY_US

apexrl/RL-Exploration-Paper-Lists

junhyukoh/deep-reinforcement-learning-papers

brendanator/atari-rl

ikostrikov/pytorch-a2c-ppo-acktr-gail