andreasbinder

andreasbinder's Stars

ermongroup/MetaIRL
Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
Language:Python698
Farama-Foundation/Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
Language:Python1.3k271
google-deepmind/deepmind-research
This repository contains implementations and illustrative code to accompany DeepMind publications
Language:Jupyter Notebook13.2k2.6k
FlorianWilhelm/bhm-at-scale
🪜 Bayesian Hierarchical Models at Scale
Language:Jupyter Notebook5017
dalmia/David-Silver-Reinforcement-learning
Notes for the Reinforcement Learning course by David Silver along with implementation of various algorithms.
Language:Jupyter Notebook786213
openai/evolution-strategies-starter
Code for the paper "Evolution Strategies as a Scalable Alternative to Reinforcement Learning"
Language:Python1.6k277
tushar-semwal/awesome-federated-computing
:books: :eyeglasses: A collection of research papers, codes, tutorials and blogs on Federated Computing/Learning.
46785
google-research/fast-soft-sort
Fast Differentiable Sorting and Ranking
Language:Python57347
technicolor-research/sodeep
Language:Python7113
mkurovski/deep_rl_nanodegree
Project Solutions for my Deep Reinforcement Learning Nanodegree at Udacity
Language:Jupyter Notebook41
FlorianWilhelm/lrann
On the Effectiveness of Low-rank Approximations in Collaborative Filtering compared to Neural Networks
Language:Jupyter Notebook71
pyscaffold/pyscaffold
🛠 Python project template generator with batteries included
Language:Python2.1k183
leriomaggio/reproducible-learn
Reproducible Machine and Deep Learning pipelines
Language:Python72
omoindrot/tensorflow-triplet-loss
Implementation of triplet loss in TensorFlow
Language:Python1.1k283
greerviau/GlorifiedCruiseControl
A proof of concept autonomous driving system that uses Behavioral Cloning to predict driving commands in real time from a video feed of the road.
Language:Python75
jerrylin1121/BCO
Implementation of Behavioral Cloning from Observationmentation
Language:Python163
haowei01/pytorch-examples
train models in pytorch, Learn to Rank, Collaborative Filter, Heterogeneous Treatment Effect, Uplift Modeling, etc
Language:Python17720
wildltr/ptranking
Learning to Rank in PyTorch
Language:Python47369
Stable-Baselines-Team/stable-baselines-tf2
[Experimental] TensorFlow 2 version of stable-baselines, temporary repository
Language:Python4510
tanzhenyu/baselines-tf2
openai baselines with tensorflow 2.0
Language:Python95
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.6k830
jakewright/tutorials
Source code from Jake Wright's YouTube tutorials
Language:Go589571
dsbrown1331/bayesianrex
Language:Python165
shiba24/learning2rank
Learning to rank with neuralnet - RankNet and ListNet
Language:Python481141
szdr/RankNet
Implementation of RankNet with chainer (python neural network library)
Language:Python85
spring-cloud/spring-cloud-stream-binder-kafka
Spring Cloud Stream binders for Apache Kafka and Kafka Streams
Language:Java331301
milesial/Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
Language:Python9.2k2.5k
Alvary/snowplow
Cloud-native web, mobile and event analytics, running on AWS, GCP and on-premise with Kafka
Language:Scala1
adambielski/siamese-triplet
Siamese and triplet networks with online pair/triplet mining in PyTorch
Language:Python3.1k633
nottombrown/rl-teacher
Code for Deep RL from Human Preferences [Christiano et al]. Plus a webapp for collecting human feedback
Language:Python55995