Pinned Repositories
AML_bayes_opt
Supporting code for the Advanced Machine Learning module, MPhil Machine Learning and Machine Intelligence
attention-based-credit
Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt, and Mihaela van der Schaar
data_collection
inverse-online
Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies (ICLR 2022) by Alex J. Chan, Alicia Curth, and Mihaela van der Schaar.
MCMC-Project
Code for my project comparing theoretical bounds with practical convergence diagnostics in MCMC.
medkit-learn
The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation (NeurIPS 2021) by Alex J. Chan, Ioana Bica, Alihan Huyuk, Daniel Jarrett, and Mihaela van der Schaar.
scalable-birl
Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.
synthetic-model-combination
Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning (NeurIPS 2022) by Alex J. Chan and Mihaela van der Schaar.
transductive-dropout
Unlabelled Data Improves Bayesian Uncertainty Calibration under Covariate Shift (ICML 2020) by Alex J. Chan, Ahmed M. Alaa, Zhaozhi Qian, and Mihaela van der Schaar.
XanderJC.github.io
Personal website
XanderJC's Repositories
XanderJC/scalable-birl
Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.
XanderJC/medkit-learn
The Medkit-Learn(ing) Environment: Medical Decision Modelling through Simulation (NeurIPS 2021) by Alex J. Chan, Ioana Bica, Alihan Huyuk, Daniel Jarrett, and Mihaela van der Schaar.
XanderJC/attention-based-credit
Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt, and Mihaela van der Schaar
XanderJC/transductive-dropout
Unlabelled Data Improves Bayesian Uncertainty Calibration under Covariate Shift (ICML 2020) by Alex J. Chan, Ahmed M. Alaa, Zhaozhi Qian, and Mihaela van der Schaar.
XanderJC/synthetic-model-combination
Synthetic Model Combination: An Instance-wise Approach to Unsupervised Ensemble Learning (NeurIPS 2022) by Alex J. Chan and Mihaela van der Schaar.
XanderJC/inverse-online
Inverse Online Learning: Understanding Non-Stationary and Reactionary Policies (ICLR 2022) by Alex J. Chan, Alicia Curth, and Mihaela van der Schaar.
XanderJC/XanderJC.github.io
Personal website
XanderJC/AML_bayes_opt
Supporting code for the Advanced Machine Learning module, MPhil Machine Learning and Machine Intelligence
XanderJC/data_collection
XanderJC/llm-articulation
XanderJC/MCMC-Project
Code for my project comparing theoretical bounds with practical convergence diagnostics in MCMC.
XanderJC/my-cookiecutter
My cookiecutter template for ML projects
XanderJC/deepspeed_llama
Finetuning LLaMA with DeepSpeed
XanderJC/mphil-thesis
Supplementary code for my MPhil thesis.
XanderJC/rnn-handwriting-generation
Handwriting generation by RNN with TensorFlow, based on "Generating Sequences With Recurrent Neural Networks" by Alex Graves
XanderJC/RowingManager
XanderJC/trl
Train transformer language models with reinforcement learning.
XanderJC/TruthfulQA
TruthfulQA: Measuring How Models Imitate Human Falsehoods
XanderJC/XanderJC
Config files for my GitHub profile.