dmitrySorokin's Stars
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
catalyst-team/catalyst
Accelerated deep learning R&D
google-deepmind/alphatensor
levyitay/AddSecurityExceptionAndroid
FortsAndMills/RL-Theory-book
Reinforcement learning theory book about foundations of deep RL algorithms with proofs.
jcwleo/random-network-distillation-pytorch
Random Network Distillation pytorch
NVlabs/cule
CuLE: A CUDA port of the Atari Learning Environment (ALE)
booydar/babilong
BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.
sberbank-ai-lab/pytorch-lifestream
A library built upon PyTorch for building embeddings on discrete event sequences using self-supervision
Behrouz-Babaki/MinSizeKmeans
A python implementation of KMeans clustering with minimum cluster size constraint (Bradley et al., 2000)
root-project/veccore
C++ Library for Portable SIMD Vectorization
alirezakazemipour/PPO-RND
Random network distillation on Montezuma's Revenge and Super Mario Bros.
lnpalmer/A2C
PyTorch implementation of Advantage Actor-Critic (A2C)
ijmbarr/parsing-pdfs
Extracting tabular information from PDFs using python
bdhammel/python_newport_controller
Interfacing with Newport Motion Controllers using Python
jonapost/field_propagation
Module for integrating a track's trajectory in a field, whether magnetic, electric, combined electromagnetic, or also including gravity or other forces.
garabik/pdfshapeminer
Extract text from pdf using pdfminer and shapely
timCF/pymor
simplified pymorphy2 command line tool
DmitriyValetov/servers
servers on python
zemerov/robel
ROBEL: Robotics Benchmarks for Learning with low-cost robots