Pinned Repositories
agents
Efficient Batched Reinforcement Learning in TensorFlow
ai-economist
Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforcement learning to learn optimal economic policies, as done by the AI Economist (https://www.einstein.ai/the-ai-economist).
Arcade-Learning-Environment
The Arcade Learning Environment (ALE) -- a platform for AI research.
arrayfire
ArrayFire: a general purpose GPU library.
ARS
An implementation of the Augmented Random Search algorithm for The DeepMind Control Suite and Package
big
CMU RI BIG
BinaryNet.pytorch
Binarized Neural Network (BNN) for pytorch
dm-control-rl
Implementations of reinforcement learning algorithms on top of The DeepMind Control Suite and Package
NoiseInjection
Fork of DART: Noise Injection for Imitation Learning
xitari-Windows
A platform for AI research, in Windows
pedronahum's Repositories
pedronahum/ai-economist
Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforcement learning to learn optimal economic policies, as done by the AI Economist (https://www.einstein.ai/the-ai-economist).
pedronahum/arrayfire
ArrayFire: a general purpose GPU library.
pedronahum/big
CMU RI BIG
pedronahum/xitari-Windows
A platform for AI research, in Windows
pedronahum/bwapi
Brood War API
pedronahum/category-theory-for-dotnet-programmers
This repo contains all c++ / haskell samples from Bartosz Milewski's book (Category Theory for Programmers) converted to csharp and fsharp
pedronahum/conservative-uncertainty-estimation-random-priors
Source code for paper Conservative Uncertainty Estimation By Fitting Prior Networks (ICLR 2020)
pedronahum/CQL
Code for conservative Q-learning
pedronahum/DAN
A new architecture of semantic segmentation called Dense-Attention Networks.
pedronahum/diffkt
A framework for automatic differentiation in Kotlin
pedronahum/fsi-samples
A collection of open-source GPU accelerated Python tools and examples for quantitative analyst tasks and leverages RAPIDS AI project, Numba, cuDF, and Dask.
pedronahum/google-research
Google Research
pedronahum/hanabi_SAD
Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning
pedronahum/Hanabi_SPARTA
Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it
pedronahum/ite_v2
Intelligent Trial & Error Algorithm for Robot Adaptation
pedronahum/learn_solana
pedronahum/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
pedronahum/phemex-python-api
Phemex Market Data & Trading API in Python
pedronahum/proxy
Proxy: Next Generation Polymorphism in C++
pedronahum/PyBC
Bitcoin blockchain parser for Python 2 and 3. Includes handy examples.
pedronahum/rlkit
Collection of reinforcement learning algorithms
pedronahum/s2client-api
StarCraft II Client - C++ library supported on Windows, Linux and Mac designed for building scripted bots and research using the SC2API.
pedronahum/Scientific-Software-Design
Code examples from Rouson, Xia & Xu (Cambridge University Press, 2011)
pedronahum/starcraft_defogger
Forward Modeling for Partial Observation Strategy Games - A StarCraft Defogger
pedronahum/StarData
Starcraft AI Research Dataset
pedronahum/swift-models
Example models built using Swift for TensorFlow
pedronahum/TorchCraft
Connecting Torch to StarCraft
pedronahum/TrillSamples
Sample applications to demonstrate how to use the Trill library and API
pedronahum/varibad
pedronahum/weld
High-performance runtime for data analytics applications