keishihara
I like to train deep neural nets to play games, drive cars (in Carla), and produce texts (LLMs). My previous account @KeishiIshihara
Pinned Repositories
dotfiles
finetuning_llama3_hf
notion_integrations
oasst_editor
policy-gradients-pytorch
Simple Policy Gradient implementations in PyTorch for Reinforcement Learning.
WorldOnRails
(ICCV 2021, Oral) RL and distillation in CARLA using a factorized world model
keishihara's Repositories
keishihara/dotfiles
keishihara/finetuning_llama3_hf
keishihara/notion_integrations
keishihara/oasst_editor
keishihara/policy-gradients-pytorch
Simple Policy Gradient implementations in PyTorch for Reinforcement Learning.
keishihara/WorldOnRails
(ICCV 2021, Oral) RL and distillation in CARLA using a factorized world model