StevenJokess
ASB AI Principal Idiot. Research and code for International Communist. Real AI will give itself the power to judge world, consider human ignorance
FUCKING Money&Job hunting,失业要饭QQ群:171097552
StevenJokess's Stars
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
lm-sys/FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Sinaptik-AI/pandas-ai
Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
waditu/tushare
TuShare is a utility for crawling historical data of China stocks
google-deepmind/mujoco
Multi-Joint dynamics with Contact. A general purpose physics simulator.
huseinzol05/Stock-Prediction-Models
Gathers machine learning and deep learning models for Stock forecasting including trading bots and simulations
Farama-Foundation/Gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
linyiLYi/street-fighter-ai
This is an AI agent for Street Fighter II Champion Edition.
yihong0618/xiaogpt
Play ChatGPT and other LLM with Xiaomi AI Speaker
openmlsys/openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
LantaoYu/MARL-Papers
Paper list of multi-agent reinforcement learning (MARL)
suragnair/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
huggingface/deep-rl-class
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
openai/mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
Farama-Foundation/PettingZoo
An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
paperswithcode/releasing-research-code
Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)
quantumiracle/Popular-RL-Algorithms
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
BenjiKCF/Neural-Net-with-Financial-Time-Series-Data
This solution presents an accessible, non-trivial example of machine learning (Deep learning) with financial time series using TensorFlow
ailabx/ailabx
AI量化实验室,专注将前沿人工智能技术(深度学习/强化学习/知识图谱)应用于金融量化投资。
TikhonJelvis/RL-book
PKU-MARL/Multi-Agent-Transformer
zsc/xiaogpt
play chatgpt with xiaomi ai speaker
qlan3/Explorer
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
cryer/D.Silver_RL_Course
Some notes and experience about David Silver's Reinforcement Learning Course
datawhalechina/rl-papers
rl-papers
Ceruleanacg/Quantitative-Trading
💸 Papers and Code Implements for Quantitative-Trading
wwxFromTju/DRL_trick
shenhao-stu/CS224W-Fall2021
🌟🌟CS224W Fall 2021 | Stanford 的个人学习路线🌟🌟
werner-duvaud/openleaf-markdown-pdf
Python based markdown to PDF converter, specially designed for paginated documents
PiggyCh/RL_learning
Classical Reinforcement learning algorithm implement.