jovsa

personal account

@redditMountain View, California

Pinned Repositories

reinforcement-learning
Implementation of Reinforcement Learning Algorithms. Python, OpenAI Gym, Tensorflow. Exercises and Solutions to accompany Sutton's Book and David Silver's course.
Language:Jupyter Notebook20.8k 866 1556.1k
arboretum
Gradient Boosting powered by GPU(NVIDIA CUDA)
Language:Cuda2 2 00
CS294-112-assignments
Assignments for CS294-112.
Language:Python3 2 00
data-science-bowl-2017
Algorithms for improving lung cancer detection with deep learning
Language:Jupyter Notebook13 3 16
evolution-strategies-exploration
Contains implementation of: Tim Salimans Et al. “Evolution Strategies as a Scalable Alternative to Reinforcement Learning”. Arxiv.org. https://arxiv.org/pdf/1703.03864.pdf.
Language:Jupyter Notebook22 2 012
jovsatools
my personal toolbox 🧰
Language:Jupyter Notebook3 1 10
jtorch
An automatic differentiation engine for personal exploration and learning
Language:Python2 2 20
rl-examples-sutton-and-barto-book
Python Example from the book "Reinforcement Learning: An Introduction"
Language:Python5 2 01
speed-challenge-2017
Explore how well deep neural networks perform at predicting vehicle speed given just visual data (dashcam video) containing highway and suburban driving.
Language:Jupyter Notebook43 3 411

jovsa's Repositories

jovsa/jtorch
An automatic differentiation engine for personal exploration and learning
Language:Python2 2 20
jovsa/jovsa.github.io
Language:SCSS1 1 6
jovsa/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Language:Jupyter Notebook0 0
jovsa/ANN-playground
A playground to learn and explore techniques about ANN
Language:Jupyter Notebook2 0
jovsa/carbs
Cost aware hyperparameter tuning algorithm
Language:Jupyter Notebook0 0
jovsa/cut-the-knot-probability-riddles
1 0
jovsa/diplomacy_cicero
Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.
Language:Python0 0
jovsa/dreamerv3
Mastering Diverse Domains through World Models
Language:Python0 0
jovsa/dreamerv3-torch
Implementation of Dreamer v3 in pytorch.
jovsa/kaggle-connect-x
https://www.kaggle.com/c/connectx
Language:Python
jovsa/keymaker
where 🔑s get made
Language:Python2 01
jovsa/mctx
Monte Carlo tree search in JAX
jovsa/meltingpot
A suite of test scenarios for multi-agent reinforcement learning.
jovsa/micrograd
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Language:Jupyter Notebook0 0
jovsa/minGPT
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
Language:Python
jovsa/mmd
Code for magnetic mirror descent.
jovsa/mmd-dilated
An implementation of the QRE solver magnetic mirror descent with dilated entropy (MMD).
Language:Python0 0
jovsa/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
jovsa/openr
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
jovsa/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python0 0
jovsa/rebel
An algorithm that generalizes the paradigm of self-play reinforcement learning and search to imperfect-information games.
Language:C++1 0
jovsa/safe-haven-reproduction
Language:Jupyter Notebook1 0
jovsa/stream-of-search
Repository for the paper Stream of Search: Learning to Search in Language
jovsa/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.
Language:Python0 0
jovsa/systems
systems is a set of tools for describing, running and visualizing systems diagrams.
Language:HTML
jovsa/Tensor-Puzzles
Solve puzzles. Improve your pytorch.
Language:Jupyter Notebook0 0
jovsa/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
Language:Python
jovsa/triton
Development repository for the Triton language and compiler
Language:C++0 0
jovsa/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:Python0 0
jovsa/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0