andreasbinder's Stars
ermongroup/MetaIRL
Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
Farama-Foundation/Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
google-deepmind/deepmind-research
This repository contains implementations and illustrative code to accompany DeepMind publications
FlorianWilhelm/bhm-at-scale
🪜 Bayesian Hierarchical Models at Scale
dalmia/David-Silver-Reinforcement-learning
Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.
openai/evolution-strategies-starter
Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"
tushar-semwal/awesome-federated-computing
:books: :eyeglasses: A collection of research papers, codes, tutorials and blogs on Federated Computing/Learning.
google-research/fast-soft-sort
Fast Differentiable Sorting and Ranking
technicolor-research/sodeep
mkurovski/deep_rl_nanodegree
Project Solutions for my Deep Reinforcement Learning Nanodegree at Udacity
FlorianWilhelm/lrann
On the Effectiveness of Low-rank Approximations in Collaborative Filtering compared to Neural Networks
pyscaffold/pyscaffold
🛠 Python project template generator with batteries included
leriomaggio/reproducible-learn
Reproducible Machine and Deep Learning pipelines
omoindrot/tensorflow-triplet-loss
Implementation of triplet loss in TensorFlow
greerviau/GlorifiedCruiseControl
A proof of concept autonomous driving system that uses Behavioral Cloning to predict driving commands in real time from a video feed of the road.
jerrylin1121/BCO
Implementation of Behavioral Cloning from Observationmentation
haowei01/pytorch-examples
train models in pytorch, Learn to Rank, Collaborative Filter, Heterogeneous Treatment Effect, Uplift Modeling, etc
wildltr/ptranking
Learning to Rank in PyTorch
Stable-Baselines-Team/stable-baselines-tf2
[Experimental] TensorFlow 2 version of stable-baselines, temporary repository
tanzhenyu/baselines-tf2
openai baselines with tensorflow 2.0
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
jakewright/tutorials
Source code from Jake Wright's YouTube tutorials
dsbrown1331/bayesianrex
shiba24/learning2rank
Learning to rank with neuralnet - RankNet and ListNet
szdr/RankNet
Implementation of RankNet with chainer (python neural network library)
spring-cloud/spring-cloud-stream-binder-kafka
Spring Cloud Stream binders for Apache Kafka and Kafka Streams
milesial/Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
Alvary/snowplow
Cloud-native web, mobile and event analytics, running on AWS, GCP and on-premise with Kafka
adambielski/siamese-triplet
Siamese and triplet networks with online pair/triplet mining in PyTorch
nottombrown/rl-teacher
Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback