Pinned Repositories
AlwaysSafe
Code for the paper "AlwaysSafe: Reinforcement Learning Without Safety Constraint Violations During Training"
spi_pomdp
Code for the paper "Safe Policy Improvement for POMDPs via Finite-State Controllers"
bbc
utility to check and format bibtex files
crawler-rss-feed
Crawler for a rss feed.
gym-factored
SPIBB
Safe Policy Improvement with Baseline Bootstrapping
SPIBB-DQN
Code for SPIBB-DQN and Soft-SPIBB-DQN
symbolic-model-checker-planning
univr_offline_rl
Code for offline RL programming exercise
tdsimao's Repositories
tdsimao/gym-factored
tdsimao/univr_offline_rl
Code for offline RL programming exercise
tdsimao/bbc
utility to check and format bibtex files
tdsimao/ttheme
A modern LaTeX Beamer theme
tdsimao/SPIBB
Safe Policy Improvement with Baseline Bootstrapping
tdsimao/SPIBB-DQN
Code for SPIBB-DQN and Soft-SPIBB-DQN
tdsimao/al-folio-old
A beautiful, simple, clean, and responsive Jekyll theme for academics
tdsimao/albert_plugins
Official Albert plugins
tdsimao/ATM
Repository containing all code related to the ATM-approach for solving ACNO-MDPs
tdsimao/cs228-notes
Course notes for CS228: Probabilistic Graphical Models.
tdsimao/CSrankings
A web app for ranking computer science departments according to their research output in selective venues, and for finding active faculty across a wide range of areas.
tdsimao/gym
A toolkit for developing and comparing reinforcement learning algorithms.
tdsimao/gym-multigrid
Lightweight multi-agent gridworld Gym environment
tdsimao/matplotlib
matplotlib: plotting with Python
tdsimao/prettybib
Bibtex linter and fixer
tdsimao/python-bibtexparser
Bibtex parser for Python 3.3+
tdsimao/responsible-charger-aggregator
Project E
tdsimao/stable-job-shop
Reinforcement Learning applied to permutation job shop problems
tdsimao/study-julia
tdsimao/sum_db
Literature Review Archive
tdsimao/ThinkJulia.jl
Port of the book Think Python to the Julia programming language
tdsimao/tt
Projeto de Monografia de Graduação apresentada ao Departamento de Ciência da Computação para obtenção do título de Bacharel em “Ciência da Computação”
tdsimao/tud-beamertheme
TU Delft template
tdsimao/tueplots
Figure sizes, font sizes, fonts, and more configurations at minimal overhead. Fix your journal papers, conference proceedings, and other scientific publications.
tdsimao/univr_cmdp
Code for CMDP programming exercise
tdsimao/univr_crl
tdsimao/v52
PGM 2016 Proceedings
tdsimao/velha21
A game server with tic-tac-toe and blackjack
tdsimao/vi_demo
tdsimao/WCSAC
Code for the paper "WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning"