wang1946may7
hi, there! I am an AI researcher @ Sony. My research topic is about reinforcement learning and image processing. Plz feel free to contact me!
SonyTokyo
Pinned Repositories
BCORLE
BCORLE( λ ): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market
carla-offline-rl
Data Generation for Offline RL on CARLA
CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL
CS282_Final
Review on BCORLE(λ): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market
ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
FinGPT
Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥 We release the trained model on HuggingFace.
FinMem-LLM-StockTrading
FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design
FinRL
FinRL: Financial Reinforcement Learning. 🔥
FinRL-Meta
FinRL-Meta: Dynamic datasets and market environments for FinRL.
ICML-2020-MSBCB
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
wang1946may7's Repositories
wang1946may7/BCORLE
BCORLE( λ ): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market
wang1946may7/carla-offline-rl
Data Generation for Offline RL on CARLA
wang1946may7/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL
wang1946may7/CS282_Final
Review on BCORLE(λ): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market
wang1946may7/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
wang1946may7/FinGPT
Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥 We release the trained model on HuggingFace.
wang1946may7/FinMem-LLM-StockTrading
FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design
wang1946may7/FinRL
FinRL: Financial Reinforcement Learning. 🔥
wang1946may7/FinRL-Meta
FinRL-Meta: Dynamic datasets and market environments for FinRL.
wang1946may7/ICML-2020-MSBCB
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
wang1946may7/ai-research-code
wang1946may7/LLMAgentPapers
Must-read Papers on LLM Agents.
wang1946may7/min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym
wang1946may7/OSRL
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
wang1946may7/PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
wang1946may7/RL-Carla
Reinforcement Learning and Data Collection with Carla Simulator
wang1946may7/scope-rl
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
wang1946may7/stocknet-dataset
A comprehensive dataset for stock movement prediction from tweets and historical stock prices.
wang1946may7/wang1946may7
Config files for my GitHub profile.