Pinned Repositories
reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
arboretum
Gradient Boosting powered by GPU(NVIDIA CUDA)
CS294-112-assignments
Assignments for CS294-112.
data-science-bowl-2017
Algorithms for improving lung cancer detection with deep learning
evolution-strategies-exploration
Contains implementation of: Tim Salimans Et al. “Evolution Strategies as a Scalable Alternative to Reinforcement Learning”. Arxiv.org. https://arxiv.org/pdf/1703.03864.pdf.
jovsatools
my personal toolbox 🧰
jtorch
An automatic differentiation engine for personal exploration and learning
rl-examples-sutton-and-barto-book
Python Example from the book "Reinforcement Learning: An Introduction"
speed-challenge-2017
Explore how well deep neural networks perform at predicting vehicle speed given just visual data (dashcam video) containing highway and suburban driving.
jovsa's Repositories
jovsa/jovsatools
my personal toolbox 🧰
jovsa/jtorch
An automatic differentiation engine for personal exploration and learning
jovsa/jovsa.github.io
jovsa/transformer-experiments
jovsa/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
jovsa/ANN-playground
A playground to learn and explore techniques about ANN
jovsa/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
jovsa/blender-beginner-modelling-chair
jovsa/blender-donut
jovsa/carbs
Cost aware hyperparameter tuning algorithm
jovsa/cut-the-knot-probability-riddles
jovsa/diplomacy_cicero
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
jovsa/dreamerv3
Mastering Diverse Domains through World Models
jovsa/kaggle-connect-x
https://www.kaggle.com/c/connectx
jovsa/keymaker
where 🔑s get made
jovsa/learning-systems
jovsa/mctx
Monte Carlo tree search in JAX
jovsa/micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
jovsa/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
jovsa/mmd
Code for magnetic mirror descent.
jovsa/mmd-dilated
An implementation of the QRE solver magnetic mirror descent with dilated entropy (MMD).
jovsa/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
jovsa/rebel
An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.
jovsa/safe-haven-reproduction
jovsa/scalify
jovsa/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.
jovsa/Tensor-Puzzles
Solve puzzles. Improve your pytorch.
jovsa/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
jovsa/triton
Development repository for the Triton language and compiler
jovsa/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)