Pinned Repositories
CGOptimizer
CMOptimizer
CriticalGradientOptimization
Critical Gradient Optimization.
EpiK-Eval
Benchmark to evaluate the capability of language models to consolidate and recall information from multiple training documents.
IIRC
IIRC: Incremental Implicitly Refined Classification
Lifelong-Hanabi
A Continual Multi-agent RL testbed based on Hanabi
LoCA
PatchUp
Recall2Imagine
Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024
RLHive
chandar-lab's Repositories
chandar-lab/RLHive
chandar-lab/AMPLIFY
chandar-lab/Recall2Imagine
Recall to Imagine, a model-based RL algorithm with superhuman memory. Oral (1.2%) @ ICLR 2024
chandar-lab/PatchUp
chandar-lab/IIRC
IIRC: Incremental Implicitly Refined Classification
chandar-lab/Lifelong-Hanabi
A Continual Multi-agent RL testbed based on Hanabi
chandar-lab/EfficientLLMs
chandar-lab/LoCA
chandar-lab/CMOptimizer
chandar-lab/EpiK-Eval
Benchmark to evaluate the capability of language models to consolidate and recall information from multiple training documents.
chandar-lab/CGOptimizer
chandar-lab/COE
chandar-lab/LoCA2
chandar-lab/SubGoal_Distillation_LLM
Code for paper Sub-goal Distillation: A Method to Improve Small Language Agents, accepted at CoLLAs 2024.
chandar-lab/CAIRO
We explain why fairness metrics don't correlate and propose CAIRO to make them correlate.
chandar-lab/crystal-design
Reinforcement Learning for Crystal Structure Design
chandar-lab/FASP
We study the effect of attention head pruning on fairness in large language models
chandar-lab/healthy-data-diet
Reduce gender bias in machine learning models.
chandar-lab/RL-Tuner-CP
chandar-lab/tgi-for-mila
A toolkit for running text-generation-inference on Mila and Compute Canada
chandar-lab/adaptive-hanabi
chandar-lab/INF8245e-assignments-public
chandar-lab/INF8250ae-assignments-2023
chandar-lab/INF8250e-assignments-public
chandar-lab/Lookbehind-SAM
Implementation of Lookbehind-SAM: k steps back, 1 step forward (ICML 2024)
chandar-lab/amp
chandar-lab/AMPLIFY-novel_architectures
A fork of AMPLIFY for testing of new architectures for protein language modeling.
chandar-lab/INF8245e-assignments-2023
chandar-lab/r2i.github.io
chandar-lab/RISC