xwinks's Stars
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
simpler-env/SimplerEnv
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)
Blealtan/efficient-kan
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
facebookresearch/digit-interface
Python interface for the DIGIT tactile sensor
pytorch/torchtune
A Native-PyTorch Library for LLM Fine-tuning
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Alpha-VLLM/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
xai-org/grok-1
Grok open release
JamesQFreeman/LoRA-ViT
Low rank adaptation for Vision Transformer
XueFuzhao/awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
ml-jku/L2M
Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)
MzeroMiko/VMamba
VMamba: Visual State Space Models,code is based on mamba
state-spaces/mamba
Mamba SSM architecture
google/flax
Flax is a neural network library for JAX that is designed for flexibility.
octo-models/octo
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
lucidrains/mixture-of-experts
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
google-research/vmoe
robfiras/loco-mujoco
Imitation learning benchmark focusing on complex locomotion tasks using MuJoCo.
wuphilipp/gello_software
TToTMooN/paco-mtrl
antoine77340/S3D_HowTo100M
S3D Text-Video model trained on HowTo100M using MIL-NCE
tinnerhrhe/MTDiff
ikostrikov/rlpd
rail-berkeley/rlkit
Collection of reinforcement learning algorithms
UT-Austin-RPL/maple
Official codebase for Manipulation Primitive-augmented reinforcement Learning (MAPLE)
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
camel-ai/camel
🐫 CAMEL: Finding the Scaling Law of Agents. A multi-agent framework. https://www.camel-ai.org
z-x-yang/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
opendilab/awesome-model-based-RL
A curated list of awesome model based RL resources (continually updated)