missflash's Stars
oobabooga/text-generation-webui
A Gradio web UI for Large Language Models with support for multiple inference backends.
eungbean/Docker-for-AI-Researcher
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
lerrel/Grasp-Detector
Code to detect planar grasps
Wenxuan-Zhou/EPI
Code for Environment Probing Interaction Policies [ICLR 2019]
facebookresearch/pyrobot
PyRobot: An Open Source Robotics Research Platform
jhejna/hierarchical_morphology_transfer
Code for paper "Hierarchically Decoupled Imitation for Morphological Transfer"
wilson1yan/rlpyt
Reinforcement Learning in PyTorch
zzyunzhi/vds
Code for Automatic Curriculum Learning through Value Disagreement
alexsax/robust-policies-via-midlevel-vision
wilson1yan/contrastive-forward-model
sarahisyoung/Visual-Imitation-Made-Easy
MishaLaskin/rad
RAD: Reinforcement Learning with Augmented Data
jhejna/morphology-opt
Code for the paper Task Agnostic Morphology Evolution.
nicklashansen/policy-adaptation-during-deployment
Training code and evaluation benchmarks for the "Self-Supervised Policy Adaptation during Deployment" paper.
sjtuzq/Cycle_Dynamics
[ICLR2021, Oral] Learning Cross-Domain Correspondence for Control with Dynamics Cycle-Consistency
denisyarats/proto
Proto-RL: Reinforcement Learning with Prototypical Representations
facebookresearch/drqv2
DrQ-v2: Improved Data-Augmented Reinforcement Learning
bennevans/iida
jyopari/VINN
denisyarats/exorl
ExORL: Exploratory Data for Offline Reinforcement Learning
NYU-robot-learning/DIME-Models
Models implemented on the Dexterous Arm
siddhanthaldar/ROT
Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport
notmahi/bet
Code and website for Behavior Transformers: Cloning k modes with one stone.
jeffacce/play-to-policy
From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data
notmahi/clip-fields
Teaching robots to respond to open-vocab queries with CLIP and NeRF-like neural fields
siddhanthaldar/FISH
Code for Teach a Robot to FISH: Versatile Imitation from One Minute of Demonstrations
SridharPandian/Holo-Dex
Official Implementation of Holo-Dex: Teaching Dexterity with Immersive Mixed Reality
datamllab/tods
TODS: An Automated Time-series Outlier Detection System
nlpai-lab/KULLM
☁️ 구름(KULLM): 고려대학교에서 개발한, 한국어에 특화된 LLM