Pinned Repositories
Autonomous-Driving-via-RL
Applying RL methods for autonomous driving in Carla simulator.
Coding_Reinforcement_Learning
Implementation of basic RL steps and algorithms - Dynamic Programming approach, Monte-Carlo approach, DQN on Atari, Policy Gradient - Reinforce with baseline, Actor Critic (A2C)
Data-Structures-Algorithms-and-OOPS-in-Python
Just some practice codes
Image-Captioning-via-YOLOv5-EncoderDecoderwithAttention
Image Captioning using combination of object detection via YOLOv5 and Encoder Decoder LSTM model
interpretable-ml-book
Book about interpretable machine learning
Linear-Algebra
Linear Algebra Course Assignments at IISc, Bangalore
mbppol
This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm" accepted at NeurIPS 2022.
Neural-Architecture-Search-Project
Attempt of Efficient Neural Architecture Search for searching DQN approximation neural network architecture
PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
Sentiment-Analysis
Twitter Sentiment Analysis
akjayant's Repositories
akjayant/PPO_Lagrangian_PyTorch
Implementation of PPO Lagrangian in PyTorch
akjayant/Autonomous-Driving-via-RL
Applying RL methods for autonomous driving in Carla simulator.
akjayant/mbppol
This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algorithm" accepted at NeurIPS 2022.
akjayant/Image-Captioning-via-YOLOv5-EncoderDecoderwithAttention
Image Captioning using combination of object detection via YOLOv5 and Encoder Decoder LSTM model
akjayant/Coding_Reinforcement_Learning
Implementation of basic RL steps and algorithms - Dynamic Programming approach, Monte-Carlo approach, DQN on Atari, Policy Gradient - Reinforce with baseline, Actor Critic (A2C)
akjayant/Sentiment-Analysis
Twitter Sentiment Analysis
akjayant/interpretable-ml-book
Book about interpretable machine learning
akjayant/Neural-Architecture-Search-Project
Attempt of Efficient Neural Architecture Search for searching DQN approximation neural network architecture
akjayant/News-Article-Recommendation-via-Contextual-Bandits
Recommendation using LinUCB algorithm
akjayant/2020_CARLA_challenge
"Learning by Cheating" (CoRL 2019) submission for the 2020 CARLA Challenge
akjayant/acme
A library of reinforcement learning components and agents
akjayant/akjayant.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
akjayant/Analysis-of-Codeforces-Data
To analyze correlation between coding style and coding proficiency, and whether coding styles show regional variations.
akjayant/Carla-ppo
This repository hosts a customized PPO based agent for Carla. The goal of this project is to make it easier to interact with and experiment in Carla with reinforcement learning based agents -- this, by wrapping Carla in a gym like environment that can handle custom reward functions, custom debug output, etc.
akjayant/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features
akjayant/coach
Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms
akjayant/COVID19-Short-Term-Forecasting-and-Cluster-Analysis
Covid 19 cases Forecasting using ARIMA in India, US, Italy, Belgium & Patient data(anonymized) cluster analysis of India
akjayant/Custom_YOLOv5
Using Ultralytics YOLOv5 to train a custom dataset
akjayant/Data-Structures-Algorithms
akjayant/Deep-Learning-PyTorch
For recapping Feed Forward Neural Networks, CNNs, RNNs, LSTMs in PyTorch
akjayant/Deep-Reinforcement-Learning-Hands-On
Hands-on Deep Reinforcement Learning, published by Packt
akjayant/l2rpn-baselines
L2RPN Baselines a repository to host baselines for l2rpn competitions.
akjayant/LabMeetPapers
Papers discussed during lab meetings for Intelligent Systems Lab at IISc.
akjayant/ML_DL_Research_collab_base
This is a repository of where we are trying to have all the knowledge base collected.
akjayant/Recommendation-via-Latent-Reprsentations
Recommendation using non negative matrix factorization
akjayant/rlax
akjayant/safety-starter-agents
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.
akjayant/shieldNN2020
akjayant/Spectral-Clustering
Spectral Clustering is a technique to cluster data which finds use in community detetction applications
akjayant/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.