EnnaSachdeva

Research Engineer at Honda Research Institute

Honda Research InstituteSan Jose, California

Pinned Repositories

2R-manipulator-force-control
This is a MATLAB simulation for force control of a 2R manipulator using feedback linearization.
Language:MATLAB9 2 02
Algorithms
Language:Python0 2 00
cvpr_dNRI
Code accompanying "Dynamic Neural Relational Inference" from CVPR 2020
Language:Python0 1 00
D_VAE
Language:Python0 3 00
MADyS
Code accompanying Multiagent Learning via Dynamic Skill Selection
Language:Python4 2 01
Multiagent_ERL_heterogeneous_rover_domain
Language:Python0 2 00
Non-holonomic-Trajectory-Planning-Using-the-Bernstein-Basis-Functions
Language:MATLAB1 2 00
Recurrent-Multiagent-Deep-Deterministic-Policy-Gradient-with-Difference-Rewards
Deep Reinforcement Learning (DRL) algorithms have been successfully applied to a range of challenging simulated continuous control single agent tasks. These methods have further been extended to multiagent domains in cooperative, competitive or mixed environments. This paper primarily focuses on multiagent cooperative settings which can be modeled for several real world problems such as coordination of autonomous vehicles and warehouse robots. However, these systems suffer from several challenges such as, structural credit assignment and partial observability. In this paper, we propose Recurrent Multiagent Deep Deterministic Policy Gradient (RMADDPG) algorithm which extends Multiagent Deep Determinisitic Policy Gradient algorithm - MADDPG \cite{lowe2017multi} by using a recurrent neural network for the actor policy. This helps to address partial observability by maintaining a sequence of past observations which networks learn to preserve in order to solve the POMDP. In addition, we use reward shaping through difference rewards to address structural credit assignment in a partially observed environment. We evaluate the performance of MADDPG and R-MADDPG with and without reward shaping in a Multiagent Particle Environment. We further show that reward shaped RMADDPG outperforms the baseline algorithm MADDPG in a partially observable environmental setting.
Language:Python48 4 110
Resume-Screener
A project for CodeDay Labs that screens resumes based on their fit for a Software Engineer New Grad position
Language:Jupyter Notebook1 1 00
Robotics_Informative-Path-Planning
Language:Python1 2 00

EnnaSachdeva's Repositories

EnnaSachdeva/Recurrent-Multiagent-Deep-Deterministic-Policy-Gradient-with-Difference-Rewards
Deep Reinforcement Learning (DRL) algorithms have been successfully applied to a range of challenging simulated continuous control single agent tasks. These methods have further been extended to multiagent domains in cooperative, competitive or mixed environments. This paper primarily focuses on multiagent cooperative settings which can be modeled for several real world problems such as coordination of autonomous vehicles and warehouse robots. However, these systems suffer from several challenges such as, structural credit assignment and partial observability. In this paper, we propose Recurrent Multiagent Deep Deterministic Policy Gradient (RMADDPG) algorithm which extends Multiagent Deep Determinisitic Policy Gradient algorithm - MADDPG \cite{lowe2017multi} by using a recurrent neural network for the actor policy. This helps to address partial observability by maintaining a sequence of past observations which networks learn to preserve in order to solve the POMDP. In addition, we use reward shaping through difference rewards to address structural credit assignment in a partially observed environment. We evaluate the performance of MADDPG and R-MADDPG with and without reward shaping in a Multiagent Particle Environment. We further show that reward shaped RMADDPG outperforms the baseline algorithm MADDPG in a partially observable environmental setting.
Language:Python48 4 110
EnnaSachdeva/MADyS
Code accompanying Multiagent Learning via Dynamic Skill Selection
Language:Python4 2 01
EnnaSachdeva/Non-holonomic-Trajectory-Planning-Using-the-Bernstein-Basis-Functions
Language:MATLAB1 2 00
EnnaSachdeva/Resume-Screener
A project for CodeDay Labs that screens resumes based on their fit for a Software Engineer New Grad position
Language:Jupyter Notebook1 1 00
EnnaSachdeva/Robotics_Informative-Path-Planning
Language:Python1 2 00
EnnaSachdeva/Algorithms
Language:Python0 2 00
EnnaSachdeva/cvpr_dNRI
Code accompanying "Dynamic Neural Relational Inference" from CVPR 2020
Language:Python0 1 00
EnnaSachdeva/D_VAE
Language:Python0 3 00
EnnaSachdeva/Multiagent_ERL_heterogeneous_rover_domain
Language:Python0 2 00
EnnaSachdeva/CIFAR-10-binary-classification
Language:Python2 0
EnnaSachdeva/Computer-Vision-Chroma-Keying
Language:C++2 0
EnnaSachdeva/convert_kitti_to_ros
A useful ROS tool for dealing with KITTI point cloud dataset.
Language:C++1 0
EnnaSachdeva/CppND-Route-Planning-Project
Language:C++2 0
EnnaSachdeva/Deep-Learning-Implemetations
Language:Python2 0
EnnaSachdeva/EKF-Localization
Language:MATLAB
EnnaSachdeva/ennasachdeva.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript2 01
EnnaSachdeva/IROS-2017-COCrIP-Optimization
Optimization for estimating friction coefficient of materials to be used for In-Pipe climbing robot COCRIP (published in IROS-2017) for vertical and bend pipes.
Language:MATLAB2 0
EnnaSachdeva/Leisure_time_stuff
Language:Python2 0
EnnaSachdeva/Machine_learning_algos
Machine learning algorithms
Language:Jupyter Notebook3 0
EnnaSachdeva/MAEDyS
Language:Python1 0
EnnaSachdeva/Papers-Summary
This repository includes the summary of various papers in Robotics and AI, I have read.
3 0
EnnaSachdeva/releasing-research-code
Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)
1 0
EnnaSachdeva/Robotics-Path-Planning
Language:MATLAB2 0
EnnaSachdeva/Rover-Domain
Language:Python2 0
EnnaSachdeva/scenario_runner
Traffic scenario definition and execution engine
Language:Python0 0
EnnaSachdeva/Scribe-Notes
Scribe notes on various topics relevant to robotics and AI.
2 0
EnnaSachdeva/subgoal-discovery
Learning from Trajectories via Subgoal Discovery
Language:Python1 0
EnnaSachdeva/Trajectory-of-a-robot
Language:MATLAB2 0
EnnaSachdeva/Udacity_Computer_Vision_Nanodegree
EnnaSachdeva/vatic
Efficiently Scaling Up Video Annotation with Crowdsourced Marketplaces. IJCV 2012
Language:HTML1 0