BartekCupial

Pinned Repositories

autoascend
The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge
Language:Python1 0 00
baba-is-ai
Language:Python1 0 00
crafter
Benchmarking the Spectrum of Agent Capabilities
Language:Python10
dungeonsdata-neurips2022
Dataset Instructions and Tutorials for Submission to Neurips2022
Language:Jupyter Notebook1 1 00
fast_inference
Language:Python1 1 00
finetuning-RL-as-CL
Language:Jupyter Notebook30
how-to-use-plgrid
2 1 00
LLAMA-compression
Language:Jupyter Notebook2 1 00
llm_debate
Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"
Language:Python1 0 00
sample-factory
High throughput synchronous and asynchronous reinforcement learning
Language:Python4 0 00

BartekCupial's Repositories

BartekCupial/sample-factory
High throughput synchronous and asynchronous reinforcement learning
Language:Python4 0 00
BartekCupial/finetuning-RL-as-CL
Language:Jupyter Notebook30
BartekCupial/how-to-use-plgrid
2 1 00
BartekCupial/LLAMA-compression
Language:Jupyter Notebook2 1 00
BartekCupial/autoascend
The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge
Language:Python1 0 00
BartekCupial/baba-is-ai
Language:Python1 0 00
BartekCupial/crafter
Benchmarking the Spectrum of Agent Capabilities
Language:Python10
BartekCupial/dungeonsdata-neurips2022
Dataset Instructions and Tutorials for Submission to Neurips2022
Language:Jupyter Notebook1 1 00
BartekCupial/fast_inference
Language:Python1 1 00
BartekCupial/llm_debate
Code release for "Debating with More Persuasive LLMs Leads to More Truthful Answers"
Language:Python1 0 00
BartekCupial/minihack
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Language:Python1 0 00
BartekCupial/nle-code-wrapper
Language:Python1
BartekCupial/nle-dashboard
Language:HTML1 1 00
BartekCupial/nle-demo
Language:Python1
BartekCupial/nle-utils
Language:Python1
BartekCupial/sample-pretrain
Language:Python1 2 00
BartekCupial/CodeXGlue-defects
Language:Python0 1 00
BartekCupial/BartekCupial.github.io
Language:HTML1 0
BartekCupial/Finetune-RL-as-CL
BartekCupial/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
Language:Python
BartekCupial/nle-language-wrapper
Nethack Learning Environment Wrapper for Language Interface
BartekCupial/publications_2024
IDEAS scientific achievements
0 0
BartekCupial/rl-starter-files
RL starter files in order to immediatly train, visualize and evaluate an agent without writing any line of code
Language:Python0 0
BartekCupial/torch-ac
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
Language:Python0 0