Pinned Repositories
alfworld
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
Bullet-Safety-Gym
An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.
chinganc.github.io
CQL
Code for conservative Q-learning
d4rl
A benchmark for offline reinforcement learning.
librl
lightATAC
mamba
Trace
Trace, the New AutoDiff for AI Systems and LLM Agents
chinganc's Repositories
chinganc/mamba
chinganc/librl
chinganc/lightATAC
chinganc/alfworld
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning
chinganc/autogen
Enable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
chinganc/Bullet-Safety-Gym
An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.
chinganc/chinganc.github.io
chinganc/CQL
Code for conservative Q-learning
chinganc/d4rl
A benchmark for offline reinforcement learning.
chinganc/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
chinganc/garage
A toolkit for reproducible reinforcement learning research.
chinganc/hand_dapg
Repository to accompany RSS 2018 paper on dexterous hand manipulation
chinganc/metaworld
An open source robotics benchmark for meta- and multi-task reinforcement learning
chinganc/DSRL
🔥 Datasets and env wrappers for offline safe reinforcement learning
chinganc/IQL-PyTorch
A PyTorch implementation of Implicit Q-Learning
chinganc/mj_envs
A collection of MuJoCo based environments.
chinganc/mjrl
Reinforcement learning algorithms for MuJoCo tasks
chinganc/object_completion
chinganc/Organized-LLM-Agents
Embodied and organized multi-LLM-agent teams supporting communication for >3 agents. Source codes for the paper "Embodied LLM Agents Learn to Cooperate in Organized Teams".
chinganc/Parrot_Paraphraser
A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational engines. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
chinganc/ray
A fast and simple framework for building and running distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
chinganc/robomimic
robomimic: A Modular Framework for Robot Learning from Demonstration
chinganc/robosuite
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
chinganc/vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.