Pinned Repositories
ScienceWorld
ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
adaptive-hanabi
SubGoal_Distillation_LLM
Code for paper Sub-goal Distillation: A Method to Improve Small Language Agents, accepted at CoLLAs 2024.
tgi-for-mila
A toolkit for running text-generation-inference on Mila and Compute Canada
C-MoBLeS
Clustering Subspace Generalization to Obtain Faster Reinforcement Learning
hashemzadeh.github.io
home
homepage
HomePage2
Offline-Online-RL
Code for OFFLINE-ONLINE REINFORCEMENT LEARNING: EXTENDING BATCH AND ONLINE RL
MHashemzadeh's Repositories
MHashemzadeh/Offline-Online-RL
Code for OFFLINE-ONLINE REINFORCEMENT LEARNING: EXTENDING BATCH AND ONLINE RL
MHashemzadeh/C-MoBLeS
Clustering Subspace Generalization to Obtain Faster Reinforcement Learning
MHashemzadeh/hashemzadeh.github.io
MHashemzadeh/home
MHashemzadeh/homepage
MHashemzadeh/HomePage2