jmribeiro

AI Research Scientist / Deep Learning Engineer at GAIPS

GAIPSLisbon

Pinned Repositories

a-practical-introduction-to-reinforcement-learning
Language:Python1 1 00
adhoc-teamwork-under-partial-observability
Code for ECAI 2023 paper "Making Friends in the Dark: Ad Hoc Teamwork under Partial Observability"
Language:Python1 1 00
NumPy-Neural-Network-From-Scratch
Neural network implemented using nothing but NumPy (no autograd), showcased for multi-class classification, with OCR dataset.
Language:Python1 0 00
PLASTIC-Algorithms
Python implementation of the PLASTIC Model and PLASTIC Policy algorithms for Ad Hoc teamwork
Language:Python7 1 20
Pruning-and-Sparsemax-Methods-for-Hierarchical-Attention-Networks
Paper code for Pruning and Sparsemax Methods for Hierarchical Attention Networks
Language:Python5 2 340
PyTorch-LSTM-based-LLM
Arabic to English Translation using Encoder-Decoder Sequence-to-Sequence Model
Language:Python1 0 01
UGP
Paper code for Multi-task learning without Catastrophic Forgetting in Deep Reinforcement Learning (https://login.easychair.org/publications/paper/8RPq)
Language:Python6 1 20
yaaf
Yet Another Agents Framework - An RL research-oriented framework for agent prototyping and evaluation
Language:Python18 3 13
PyTorch-NLP
Basic Utilities for PyTorch Natural Language Processing (NLP)
Language:Python2.2k 56 69257
pfrl
PFRL: a PyTorch-based deep reinforcement learning library
Language:Python1.2k 91 75157

jmribeiro's Repositories

jmribeiro/yaaf
Yet Another Agents Framework - An RL research-oriented framework for agent prototyping and evaluation
Language:Python18 3 13
jmribeiro/PLASTIC-Algorithms
Python implementation of the PLASTIC Model and PLASTIC Policy algorithms for Ad Hoc teamwork
Language:Python7 1 20
jmribeiro/UGP
Paper code for Multi-task learning without Catastrophic Forgetting in Deep Reinforcement Learning (https://login.easychair.org/publications/paper/8RPq)
Language:Python6 1 20
jmribeiro/Pruning-and-Sparsemax-Methods-for-Hierarchical-Attention-Networks
Paper code for Pruning and Sparsemax Methods for Hierarchical Attention Networks
Language:Python5 2 340
jmribeiro/a-practical-introduction-to-reinforcement-learning
Language:Python1 1 00
jmribeiro/adhoc-teamwork-under-partial-observability
Code for ECAI 2023 paper "Making Friends in the Dark: Ad Hoc Teamwork under Partial Observability"
Language:Python1 1 00
jmribeiro/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Language:Python1 0 00
jmribeiro/NumPy-Neural-Network-From-Scratch
Neural network implemented using nothing but NumPy (no autograd), showcased for multi-class classification, with OCR dataset.
Language:Python1 0 00
jmribeiro/pddpg-hfo
Half Field Offense in Robocup 2D Soccer with reinforcement learning
Language:Python1 0 00
jmribeiro/PyTorch-LSTM-based-LLM
Arabic to English Translation using Encoder-Decoder Sequence-to-Sequence Model
Language:Python1 0 01
jmribeiro/teamster-model-based-ad-hoc-teamwork
Source code for paper "TEAMSTER: Model-based reinforcement learning for ad hoc teamwork" (Artificial Intelligence Journal, 2023)
Language:Python1 1 01
jmribeiro/TF-Agents
Language:Python1 0 00
jmribeiro/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
jmribeiro/AndroidEventCollector
Forensic tool that provides data collection from rooted Android devices
Language:Java0 0
jmribeiro/CLISP-Tetris-A-AI
Tetris A* Search w/ Heuristics coded in CLISP
Language:Common Lisp0 0
jmribeiro/CPractice
Language:C0 0
jmribeiro/gridworlds
Language:Python1 0
jmribeiro/handson-ml2
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
Language:Jupyter Notebook0 0
jmribeiro/HOTSPOT-An-Ad-Hoc-Teamwork-Platform-for-Mixed-Human-Robot-Teams
Code for PLOS One paper "HOTSPOT: An Ad Hoc Teamwork Platform for Mixed Human-Robot Teams"
Language:Python1 0
jmribeiro/HybridGA3C
Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.
Language:Python1 0
jmribeiro/Java-TicTacToe-MiniMax-AlphaBeta-Prunning
Java TicTacToe MiniMax & Alpha-Beta Prunning
Language:Java0 0
jmribeiro/Lifespan_Age_Transformation_Synthesis
Lifespan Age Transformation Synthesis code
Language:Python0 0
jmribeiro/MyDrive-Java-FileSystem-and-Shell
Java FileSystem w/ Shell developed with SCRUM team
Language:Java0 0
jmribeiro/pfrl
PFRL: a PyTorch-based deep reinforcement learning library
Language:Python0 0
jmribeiro/PyTorch-NLP
Basic Utilities for PyTorch Natural Language Processing (NLP)
Language:Python0 0
jmribeiro/rl-baselines-zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
Language:Python0 0
jmribeiro/SLM-Lab
Modular Deep Reinforcement Learning framework in PyTorch.
Language:Python0 0
jmribeiro/tfagents
TF-Agents is a library for Reinforcement Learning in TensorFlow
Language:Python2 0
jmribeiro/torchbeast
A PyTorch Platform for Distributed RL
Language:Python1 0
jmribeiro/UPATransports
Language:Java0 0