Pinned Repositories
alignmen_iIWL
Robust recipes to align language models with human and AI preferences
better_supervisory_signal
CPSC547_YIREN
finetuning_dynamics
how_to_prepare_taskhead
iICL
IL_for_MAE
Learning_dynamics_LLM
maddpg-again
Neural_Iterated_Learning
Pytorch implementation of the paper 'Compositional language emerge in a neural iterated learning' (ICLR 2020).
Joshua-Ren's Repositories
Joshua-Ren/Neural_Iterated_Learning
Pytorch implementation of the paper 'Compositional language emerge in a neural iterated learning' (ICLR 2020).
Joshua-Ren/better_supervisory_signal
Joshua-Ren/Learning_dynamics_LLM
Joshua-Ren/how_to_prepare_taskhead
Joshua-Ren/iICL
Joshua-Ren/maddpg-again
Joshua-Ren/IL_for_MAE
Joshua-Ren/alignmen_iIWL
Robust recipes to align language models with human and AI preferences
Joshua-Ren/CPSC547_YIREN
Joshua-Ren/finetuning_dynamics
Joshua-Ren/GPT2_SCAN
Joshua-Ren/HomePage
Yunhe Wang's HomePage
Joshua-Ren/ICL_toy_my
Joshua-Ren/IL_for_SSL
Joshua-Ren/IPP-Template
Template for the Informatics Project Proposal course
Joshua-Ren/joshua-ren.github.io
Joshua-Ren/Knowledge_distill
Joshua-Ren/MNIST_learning_speed_toy
Joshua-Ren/ms_thesis
Joshua-Ren/my_bvae
my test version of beta-vae
Joshua-Ren/numeral_thesis
Small numeral emergent language game
Joshua-Ren/P4_GPS
Joshua-Ren/P4_Graph
Joshua-Ren/P6_TS
Joshua-Ren/ReinforcementLearningBookExamples
Example codes to implement the examples in Richard's book, Reinforcement Learning: An Introduction.
Joshua-Ren/SimLang
Simulating Language Course
Joshua-Ren/simplicity_bias_learning_dynamics
Joshua-Ren/SPIN_iIWL
Try SPIN and combine with IL.
Joshua-Ren/ssl_graph
PyTorch implementation of BGRL (https://arxiv.org/abs/2102.06514)
Joshua-Ren/tre_thesis
Tre metric and some fundamental communication games.