Pinned Repositories
a-practical-introduction-to-reinforcement-learning
adhoc-teamwork-under-partial-observability
Code for ECAI 2023 paper "Making Friends in the Dark: Ad Hoc Teamwork under Partial Observability"
NumPy-Neural-Network-From-Scratch
Neural network implemented using nothing but NumPy (no autograd), showcased for multi-class classification, with OCR dataset.
PLASTIC-Algorithms
Python implementation of the PLASTIC Model and PLASTIC Policy algorithms for Ad Hoc teamwork
Pruning-and-Sparsemax-Methods-for-Hierarchical-Attention-Networks
Paper code for Pruning and Sparsemax Methods for Hierarchical Attention Networks
PyTorch-LSTM-based-LLM
Arabic to English Translation using Encoder-Decoder Sequence-to-Sequence Model
UGP
Paper code for Multi-task learning without Catastrophic Forgetting in Deep Reinforcement Learning (https://login.easychair.org/publications/paper/8RPq)
yaaf
Yet Another Agents Framework - An RL research-oriented framework for agent prototyping and evaluation
PyTorch-NLP
Basic Utilities for PyTorch Natural Language Processing (NLP)
pfrl
PFRL: a PyTorch-based deep reinforcement learning library
jmribeiro's Repositories
jmribeiro/yaaf
Yet Another Agents Framework - An RL research-oriented framework for agent prototyping and evaluation
jmribeiro/PLASTIC-Algorithms
Python implementation of the PLASTIC Model and PLASTIC Policy algorithms for Ad Hoc teamwork
jmribeiro/UGP
Paper code for Multi-task learning without Catastrophic Forgetting in Deep Reinforcement Learning (https://login.easychair.org/publications/paper/8RPq)
jmribeiro/Pruning-and-Sparsemax-Methods-for-Hierarchical-Attention-Networks
Paper code for Pruning and Sparsemax Methods for Hierarchical Attention Networks
jmribeiro/a-practical-introduction-to-reinforcement-learning
jmribeiro/adhoc-teamwork-under-partial-observability
Code for ECAI 2023 paper "Making Friends in the Dark: Ad Hoc Teamwork under Partial Observability"
jmribeiro/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
jmribeiro/NumPy-Neural-Network-From-Scratch
Neural network implemented using nothing but NumPy (no autograd), showcased for multi-class classification, with OCR dataset.
jmribeiro/pddpg-hfo
Half Field Offense in Robocup 2D Soccer with reinforcement learning
jmribeiro/PyTorch-LSTM-based-LLM
Arabic to English Translation using Encoder-Decoder Sequence-to-Sequence Model
jmribeiro/teamster-model-based-ad-hoc-teamwork
Source code for paper "TEAMSTER: Model-based reinforcement learning for ad hoc teamwork" (Artificial Intelligence Journal, 2023)
jmribeiro/TF-Agents
jmribeiro/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
jmribeiro/AndroidEventCollector
Forensic tool that provides data collection from rooted Android devices
jmribeiro/CLISP-Tetris-A-AI
Tetris A* Search w/ Heuristics coded in CLISP
jmribeiro/CPractice
jmribeiro/gridworlds
jmribeiro/handson-ml2
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
jmribeiro/HOTSPOT-An-Ad-Hoc-Teamwork-Platform-for-Mixed-Human-Robot-Teams
Code for PLOS One paper "HOTSPOT: An Ad Hoc Teamwork Platform for Mixed Human-Robot Teams"
jmribeiro/HybridGA3C
Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.
jmribeiro/Java-TicTacToe-MiniMax-AlphaBeta-Prunning
Java TicTacToe MiniMax & Alpha-Beta Prunning
jmribeiro/Lifespan_Age_Transformation_Synthesis
Lifespan Age Transformation Synthesis code
jmribeiro/MyDrive-Java-FileSystem-and-Shell
Java FileSystem w/ Shell developed with SCRUM team
jmribeiro/pfrl
PFRL: a PyTorch-based deep reinforcement learning library
jmribeiro/PyTorch-NLP
Basic Utilities for PyTorch Natural Language Processing (NLP)
jmribeiro/rl-baselines-zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
jmribeiro/SLM-Lab
Modular Deep Reinforcement Learning framework in PyTorch.
jmribeiro/tfagents
TF-Agents is a library for Reinforcement Learning in TensorFlow
jmribeiro/torchbeast
A PyTorch Platform for Distributed RL
jmribeiro/UPATransports