Pinned Repositories
Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
HiCode
Hidden Community Detection in Social Networks
memoire
AIPS
Automatic Intrusion Prevention Systems using SDN
awesome-continual-learning
Resources collection for the hot research topic of Continual Learning, a fundamental step stone to Artificial General Intelligence (AGI).
awesome-quantum-machine-learning-cn
收集量子机器学习的基础、算法、学习、项目等资料的收集。Here you can get all the Quantum Machine learning Basics, Algorithms ,Study Materials ,Projects and the descriptions of the projects around the web
Distributed-Multi-Label-Continual-Learning
This is a distributed training framework for continual and incremental learning for multi-label multi-class image tasks
GPT-HyperAgent
The official code repo for HyperAgent for neural bandits and GPT-HyperAgent for content moderation.
HyperAgent
The official code repo for HyperAgent algorithm published in ICML 2024.
Stock-Time-Series-Analysis
Mathematical modeling for finantial time series data
szrlee's Repositories
szrlee/HyperAgent
The official code repo for HyperAgent algorithm published in ICML 2024.
szrlee/GPT-HyperAgent
The official code repo for HyperAgent for neural bandits and GPT-HyperAgent for content moderation.
szrlee/muzero-cpp
A C++ pytorch implementation of MuZero
szrlee/Distributed-Multi-Label-Continual-Learning
This is a distributed training framework for continual and incremental learning for multi-label multi-class image tasks
szrlee/Information_Directed_Sampling
Implementation of Russo and Van Roy work on Information Directed Sampling (2017)
szrlee/academic-kickstart
szrlee/academic-website
szrlee/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
szrlee/bror
szrlee/bsuite
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
szrlee/enn
szrlee/Exploration-in-RL
szrlee/graphbackup
Code release for Graph Backup: Data Efficient Backup Exploiting Markovian Transitions https://arxiv.org/abs/2205.15824
szrlee/hustthesis
:notebook_with_decorative_cover: An Unofficial Thesis Template in LaTeX for Huazhong University of Science and Technology
szrlee/HyperFQI
szrlee/LangevinDQN
Code for the Langevin DQN agent
szrlee/LMCTS
szrlee/logistic_bandit
Logistic Bandit experiments. Official code for the paper "Jointly Efficient and Optimal Algorithms for Logistic Bandits".
szrlee/model-based-muesli
muesli implementation based on muzero implementation from JimOhman (https://github.com/JimOhman/model-based-rl)
szrlee/MuZero-Tensor-Batch-MCTS
An idea to implement MCTS by tensors. This implementation is able to process a batch of observations on GPU.
szrlee/OB2I
Code for "Principled Exploration via Optimistic Bootstrapping and Backward Induction"
szrlee/offline-rl-neurips.github.io
szrlee/omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
szrlee/optimistic-init
Accompanying code for "Optimistic Initialization for Exploration in Continuous Control"
szrlee/rlberry
An easy-to-use reinforcement learning library for research and education.
szrlee/sigmazero
Generalizing DeepMind's MuZero algorithm on stochastic environments
szrlee/TabulaRL
szrlee/ts_tutorial
szrlee/ucbmq_code
szrlee/vae-anomaly-detector
Experiments on unsupervised anomaly detection using variational autoencoder. The variational autoencoder is implemented in Pytorch.