szrlee

Ph.D. candidate in The Chinese University of Hong Kong, Shenzhen, China.

Pinned Repositories

Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
5.3k 86 9292
HiCode
Hidden Community Detection in Social Networks
Language:C++12 7 25
memoire
Language:C++18 4 02
AIPS
Automatic Intrusion Prevention Systems using SDN
Language:Python5 8 02
awesome-continual-learning
Resources collection for the hot research topic of Continual Learning, a fundamental step stone to Artificial General Intelligence (AGI).
8 7 03
awesome-quantum-machine-learning-cn
收集量子机器学习的基础、算法、学习、项目等资料的收集。Here you can get all the Quantum Machine learning Basics, Algorithms ,Study Materials ,Projects and the descriptions of the projects around the web
Language:HTML9 4 01
Distributed-Multi-Label-Continual-Learning
This is a distributed training framework for continual and incremental learning for multi-label multi-class image tasks
Language:Python1 6 01
GPT-HyperAgent
The official code repo for HyperAgent for neural bandits and GPT-HyperAgent for content moderation.
Language:Python2 1 00
HyperAgent
The official code repo for HyperAgent algorithm published in ICML 2024.
Language:Python4 2 10
Stock-Time-Series-Analysis
Mathematical modeling for finantial time series data
Language:Jupyter Notebook40 6 449

szrlee's Repositories

szrlee/HyperAgent
The official code repo for HyperAgent algorithm published in ICML 2024.
Language:Python4 2 10
szrlee/GPT-HyperAgent
The official code repo for HyperAgent for neural bandits and GPT-HyperAgent for content moderation.
Language:Python2 1 00
szrlee/muzero-cpp
A C++ pytorch implementation of MuZero
Language:C++2 2 00
szrlee/Distributed-Multi-Label-Continual-Learning
This is a distributed training framework for continual and incremental learning for multi-label multi-class image tasks
Language:Python1 6 01
szrlee/Information_Directed_Sampling
Implementation of Russo and Van Roy work on Information Directed Sampling (2017)
Language:Python0 2 01
szrlee/academic-kickstart
Language:Shell3 0
szrlee/academic-website
Language:Shell2 0
szrlee/Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
0 0
szrlee/bror
Language:Python3 0
szrlee/bsuite
bsuite is a collection of carefully-designed experiments that investigate core capabilities of a reinforcement learning (RL) agent
Language:Python3 0
szrlee/enn
Language:Python1 0
szrlee/Exploration-in-RL
Language:Jupyter Notebook2 0
szrlee/graphbackup
Code release for Graph Backup: Data Efficient Backup Exploiting Markovian Transitions https://arxiv.org/abs/2205.15824
Language:Python1 0
szrlee/hustthesis
:notebook_with_decorative_cover: An Unofficial Thesis Template in LaTeX for Huazhong University of Science and Technology
Language:TeX2 0
szrlee/HyperFQI
Language:Python2 0
szrlee/LangevinDQN
Code for the Langevin DQN agent
Language:Jupyter Notebook1 0
szrlee/LMCTS
Language:Python1 0
szrlee/logistic_bandit
Logistic Bandit experiments. Official code for the paper "Jointly Efficient and Optimal Algorithms for Logistic Bandits".
Language:Python1 0
szrlee/model-based-muesli
muesli implementation based on muzero implementation from JimOhman (https://github.com/JimOhman/model-based-rl)
Language:Python1 0
szrlee/MuZero-Tensor-Batch-MCTS
An idea to implement MCTS by tensors. This implementation is able to process a batch of observations on GPU.
Language:Python1 0
szrlee/OB2I
Code for "Principled Exploration via Optimistic Bootstrapping and Backward Induction"
Language:Python2 0
szrlee/offline-rl-neurips.github.io
Language:HTML2 0
szrlee/omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.
Language:Python2 0
szrlee/optimistic-init
Accompanying code for "Optimistic Initialization for Exploration in Continuous Control"
Language:Python1 0
szrlee/rlberry
An easy-to-use reinforcement learning library for research and education.
Language:Python2 0
szrlee/sigmazero
Generalizing DeepMind's MuZero algorithm on stochastic environments
Language:Python1 0
szrlee/TabulaRL
Language:Python2 0
szrlee/ts_tutorial
Language:Jupyter Notebook2 0
szrlee/ucbmq_code
Language:Python2 0
szrlee/vae-anomaly-detector
Experiments on unsupervised anomaly detection using variational autoencoder. The variational autoencoder is implemented in Pytorch.
Language:Python2 0