chinganc

Pinned Repositories

alfworld
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Language:Python1 0 00
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
Language:Jupyter Notebook0 0 03
Bullet-Safety-Gym
An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.
Language:Python00
chinganc.github.io
Language:HTML0 2 00
CQL
Code for conservative Q-learning
Language:Python0 0 00
d4rl
A benchmark for offline reinforcement learning.
Language:Python0 0 00
librl
Language:Python2 0 03
lightATAC
Language:Python2 3 01
mamba
Language:Python3 1 00
Trace
Trace, the New AutoDiff for AI Systems and LLM Agents
Language:Python293 9 518

chinganc's Repositories

chinganc/mamba
Language:Python3 1 00
chinganc/librl
Language:Python2 0 03
chinganc/lightATAC
Language:Python2 3 01
chinganc/alfworld
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
Language:Python1 0 00
chinganc/autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
Language:Jupyter Notebook0 0 03
chinganc/Bullet-Safety-Gym
An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.
Language:Python00
chinganc/chinganc.github.io
Language:HTML0 2 00
chinganc/CQL
Code for conservative Q-learning
Language:Python0 0 00
chinganc/d4rl
A benchmark for offline reinforcement learning.
Language:Python0 0 00
chinganc/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Language:Python0 0 00
chinganc/garage
A toolkit for reproducible reinforcement learning research.
Language:Python0 0 00
chinganc/hand_dapg
Repository to accompany RSS 2018 paper on dexterous hand manipulation
Language:Python0 0 00
chinganc/metaworld
An open source robotics benchmark for meta- and multi-task reinforcement learning
Language:Python0 0 00
chinganc/DSRL
🔥 Datasets and env wrappers for offline safe reinforcement learning
Language:Python0 0
chinganc/IQL-PyTorch
A PyTorch implementation of Implicit Q-Learning
Language:Python0 0
chinganc/mj_envs
A collection of MuJoCo based environments.
Language:Python0 0
chinganc/mjrl
Reinforcement learning algorithms for MuJoCo tasks
Language:Python0 0
chinganc/object_completion
Language:Python1 0
chinganc/Organized-LLM-Agents
Embodied and organized multi-LLM-agent teams supporting communication for >3 agents. Source codes for the paper "Embodied LLM Agents Learn to Cooperate in Organized Teams".
Language:Python
chinganc/Parrot_Paraphraser
A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
Language:Python0 0
chinganc/ray
A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Language:Python0 0
chinganc/robomimic
robomimic: A Modular Framework for Robot Learning from Demonstration
Language:Python0 0
chinganc/robosuite
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
Language:Python0 0
chinganc/vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.