keishihara

I like to train deep neural nets to play games, drive cars (in Carla), and produce texts (LLMs). My previous account @KeishiIshihara

Pinned Repositories

dotfiles
Language:Shell0 1 00
finetuning_llama3_hf
Language:Jupyter Notebook00
notion_integrations
Language:Python00
oasst_editor
Language:Jupyter Notebook00
policy-gradients-pytorch
Simple Policy Gradient implementations in PyTorch for Reinforcement Learning.
Language:Python00
WorldOnRails
(ICCV 2021, Oral) RL and distillation in CARLA using a factorized world model
Language:Python0 0 00

keishihara's Repositories

keishihara/dotfiles
Language:Shell0 1 00
keishihara/finetuning_llama3_hf
Language:Jupyter Notebook00
keishihara/notion_integrations
Language:Python00
keishihara/oasst_editor
Language:Jupyter Notebook00
keishihara/policy-gradients-pytorch
Simple Policy Gradient implementations in PyTorch for Reinforcement Learning.
Language:Python00
keishihara/WorldOnRails
(ICCV 2021, Oral) RL and distillation in CARLA using a factorized world model
Language:Python0 0 00