wang1946may7

hi, there! I am an AI researcher @ Sony. My research topic is about reinforcement learning and image processing. Plz feel free to contact me!

SonyTokyo

Pinned Repositories

ai-research-code
Language:Python00
BCORLE
BCORLE( λ ): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market
Language:Jupyter Notebook00
carla-offline-rl
Data Generation for Offline RL on CARLA
Language:Python00
CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL
Language:Python0 0 00
CS282_Final
Review on BCORLE(λ): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market
Language:Jupyter Notebook00
ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
Language:Python0 0 00
FinGPT
Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥 We release the trained model on HuggingFace.
Language:Jupyter Notebook0 0 00
FinMem-LLM-StockTrading
FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design
Language:Python0 0 00
FinRL
FinRL: Financial Reinforcement Learning. 🔥
Language:Jupyter Notebook0 0 00
FinRL-Meta
FinRL-Meta: Dynamic datasets and market environments for FinRL.
Language:Python0 0 00

wang1946may7's Repositories

wang1946may7/ai-research-code
Language:Python00
wang1946may7/BCORLE
BCORLE( λ ): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market
Language:Jupyter Notebook00
wang1946may7/carla-offline-rl
Data Generation for Offline RL on CARLA
Language:Python00
wang1946may7/CORL
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL
Language:Python0 0 00
wang1946may7/CS282_Final
Review on BCORLE(λ): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market
Language:Jupyter Notebook00
wang1946may7/ElegantRL
Massively Parallel Deep Reinforcement Learning. 🔥
Language:Python0 0 00
wang1946may7/FinGPT
Data-Centric FinGPT. Open-source for open finance! Revolutionize 🔥 We release the trained model on HuggingFace.
Language:Jupyter Notebook0 0 00
wang1946may7/FinMem-LLM-StockTrading
FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design
Language:Python0 0 00
wang1946may7/FinRL
FinRL: Financial Reinforcement Learning. 🔥
Language:Jupyter Notebook0 0 00
wang1946may7/FinRL-Meta
FinRL-Meta: Dynamic datasets and market environments for FinRL.
Language:Python0 0 00
wang1946may7/ICML-2020-MSBCB
Code of ICML-2020 paper Dynamic Knapsack Optimization Towards Efficient Multi-Channel Sequential Advertising
Language:Python01
wang1946may7/LLMAgentPapers
Must-read Papers on LLM Agents.
wang1946may7/min-decision-transformer
Minimal implementation of Decision Transformer: Reinforcement Learning via Sequence Modeling in PyTorch for mujoco control tasks in OpenAI gym
Language:Python0 0
wang1946may7/OSRL
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
wang1946may7/PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
wang1946may7/RL-Carla
Reinforcement Learning and Data Collection with Carla Simulator
Language:Python0 0
wang1946may7/scope-rl
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
wang1946may7/stocknet-dataset
A comprehensive dataset for stock movement prediction from tweets and historical stock prices.
0 0
wang1946may7/wang1946may7
Config files for my GitHub profile.

wang1946may7

Pinned Repositories

ai-research-code

BCORLE

carla-offline-rl

CORL

CS282_Final

ElegantRL

FinGPT

FinMem-LLM-StockTrading

FinRL

FinRL-Meta

wang1946may7's Repositories

wang1946may7/ai-research-code

wang1946may7/BCORLE

wang1946may7/carla-offline-rl

wang1946may7/CORL

wang1946may7/CS282_Final

wang1946may7/ElegantRL

wang1946may7/FinGPT

wang1946may7/FinMem-LLM-StockTrading

wang1946may7/FinRL

wang1946may7/FinRL-Meta

wang1946may7/ICML-2020-MSBCB

wang1946may7/LLMAgentPapers

wang1946may7/min-decision-transformer

wang1946may7/OSRL

wang1946may7/PPO_Lagrangian_PyTorch

wang1946may7/RL-Carla

wang1946may7/scope-rl

wang1946may7/stocknet-dataset

wang1946may7/wang1946may7