missflash

missflash's Stars

oobabooga/text-generation-webui
A Gradio web UI for Large Language Models with support for multiple inference backends.
Language:Python41.3k5.4k
eungbean/Docker-for-AI-Researcher
Language:Shell4616
microsoft/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Language:Python36k4.2k
lerrel/Grasp-Detector
Code to detect planar grasps
Language:Python4512
Wenxuan-Zhou/EPI
Code for Environment Probing Interaction Policies [ICLR 2019]
Language:Python291
facebookresearch/pyrobot
PyRobot: An Open Source Robotics Research Platform
Language:Python2.3k352
jhejna/hierarchical_morphology_transfer
Code for paper "Hierarchically Decoupled Imitation for Morphological Transfer"
Language:Python175
wilson1yan/rlpyt
Reinforcement Learning in PyTorch
Language:Python266
zzyunzhi/vds
Code for Automatic Curriculum Learning through Value Disagreement
Language:Python3012
alexsax/robust-policies-via-midlevel-vision
Language:Python181
wilson1yan/contrastive-forward-model
Language:Python308
sarahisyoung/Visual-Imitation-Made-Easy
Language:Python182
MishaLaskin/rad
RAD: Reinforcement Learning with Augmented Data
Language:Jupyter Notebook40271
jhejna/morphology-opt
Code for the paper Task Agnostic Morphology Evolution.
Language:Python204
nicklashansen/policy-adaptation-during-deployment
Training code and evaluation benchmarks for the "Self-Supervised Policy Adaptation during Deployment" paper.
Language:Python11324
sjtuzq/Cycle_Dynamics
[ICLR2021, Oral] Learning Cross-Domain Correspondence for Control with Dynamics Cycle-Consistency
Language:Python468
denisyarats/proto
Proto-RL: Reinforcement Learning with Prototypical Representations
Language:Python8215
facebookresearch/drqv2
DrQ-v2: Improved Data-Augmented Reinforcement Learning
Language:Python36788
bennevans/iida
Language:Jupyter Notebook82
jyopari/VINN
Language:HTML4510
denisyarats/exorl
ExORL: Exploratory Data for Offline Reinforcement Learning
Language:Python1059
NYU-robot-learning/DIME-Models
Models implemented on the Dexterous Arm
Language:Python273
siddhanthaldar/ROT
Code for Watch and Match: Supercharging Imitation with Regularized Optimal Transport
Language:Python7513
notmahi/bet
Code and website for Behavior Transformers: Cloning k modes with one stone.
Language:Python11317
jeffacce/play-to-policy
From Play to Policy: Conditional Behavior Generation from Uncurated Robot Data
Language:Python514
notmahi/clip-fields
Teaching robots to respond to open-vocab queries with CLIP and NeRF-like neural fields
Language:Python15918
siddhanthaldar/FISH
Code for Teach a Robot to FISH: Versatile Imitation from One Minute of Demonstrations
Language:Python6511
SridharPandian/Holo-Dex
Official Implementation of Holo-Dex: Teaching Dexterity with Immersive Mixed Reality
Language:Python446
datamllab/tods
TODS: An Automated Time-series Outlier Detection System
Language:Python1.5k194
nlpai-lab/KULLM
☁️ 구름(KULLM): 고려대학교에서 개발한, 한국어에 특화된 LLM
57772