tlc4418
Research Engineer in AI and Machine Learning. MLMI MPhil graduate from the University of Cambridge, and Computing graduate from Imperial College London.
University of CambridgeCambridge, UK
Pinned Repositories
google-research
Google Research
UK-Biobank-Visualisation
A web interface to visualise and explore the UK Biobank
alfworld
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
JAXSeq
Train very large language models in Jax.
llm_optimization
A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.
MEng-project
Imperial master's project codebase
neural-processes
Codebase for a replication study of Conditional Neural Processes
WebShop
[NeurIPS 2022] đź›’WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
tlc4418's Repositories
tlc4418/llm_optimization
A repo for RLHF training and BoN over LLMs, with support for reward model ensembles.
tlc4418/neural-processes
Codebase for a replication study of Conditional Neural Processes
tlc4418/alfworld
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
tlc4418/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
tlc4418/human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
tlc4418/JAXSeq
Train very large language models in Jax.
tlc4418/MEng-project
Imperial master's project codebase
tlc4418/WebShop
[NeurIPS 2022] đź›’WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents