xwinks

I am a Ph.D. candidate at the Renmin University of China.

Beijing

xwinks's Stars

OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
Language:Python1.9k189
simpler-env/SimplerEnv
Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Google Robot, WidowX+Bridge)
Language:Jupyter Notebook22826
Blealtan/efficient-kan
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
Language:Python3.8k338
facebookresearch/digit-interface
Python interface for the DIGIT tactile sensor
Language:Python6420
pytorch/torchtune
A Native-PyTorch Library for LLM Fine-tuning
Language:Python3.9k340
OpenGVLab/LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Language:Python5.7k372
Alpha-VLLM/LLaMA2-Accessory
An Open-source Toolkit for LLM Development
Language:Python2.7k170
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Language:Python13.1k1.1k
xai-org/grok-1
Grok open release
Language:Python49.4k8.3k
JamesQFreeman/LoRA-ViT
Low rank adaptation for Vision Transformer
Language:Python33814
XueFuzhao/awesome-mixture-of-experts
A collection of AWESOME things about mixture-of-experts
90568
ml-jku/L2M
Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)
Language:Python485
MzeroMiko/VMamba
VMamba: Visual State Space Models，code is based on mamba
Language:Python2k111
state-spaces/mamba
Mamba SSM architecture
Language:Python12.3k1k
google/flax
Flax is a neural network library for JAX that is designed for flexibility.
Language:Python5.9k623
octo-models/octo
Octo is a transformer-based robot policy trained on a diverse mix of 800k robot trajectories.
Language:Python738140
lucidrains/mixture-of-experts
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
Language:Python59747
google-research/vmoe
Language:Jupyter Notebook55451
robfiras/loco-mujoco
Imitation learning benchmark focusing on complex locomotion tasks using MuJoCo.
Language:Python51742
wuphilipp/gello_software
Language:Python9425
TToTMooN/paco-mtrl
Language:Python222
antoine77340/S3D_HowTo100M
S3D Text-Video model trained on HowTo100M using MIL-NCE
Language:Python18821
tinnerhrhe/MTDiff
Language:Python462
ikostrikov/rlpd
Language:Python19522
rail-berkeley/rlkit
Collection of reinforcement learning algorithms
Language:Python2.4k548
UT-Austin-RPL/maple
Official codebase for Manipulation Primitive-augmented reinforcement Learning (MAPLE)
Language:Python7213
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Language:Python2.2k285
camel-ai/camel
🐫 CAMEL: Finding the Scaling Law of Agents. A multi-agent framework. https://www.camel-ai.org
Language:Python5.3k637
z-x-yang/Segment-and-Track-Anything
An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.
Language:Jupyter Notebook2.7k332
opendilab/awesome-model-based-RL
A curated list of awesome model based RL resources (continually updated)
85446

xwinks

xwinks's Stars

OpenRLHF/OpenRLHF

simpler-env/SimplerEnv

Blealtan/efficient-kan

facebookresearch/digit-interface

pytorch/torchtune

OpenGVLab/LLaMA-Adapter

Alpha-VLLM/LLaMA2-Accessory

QwenLM/Qwen

xai-org/grok-1

JamesQFreeman/LoRA-ViT

XueFuzhao/awesome-mixture-of-experts

ml-jku/L2M

MzeroMiko/VMamba

state-spaces/mamba

google/flax

octo-models/octo

lucidrains/mixture-of-experts

google-research/vmoe

robfiras/loco-mujoco

wuphilipp/gello_software

TToTMooN/paco-mtrl

antoine77340/S3D_HowTo100M

tinnerhrhe/MTDiff

ikostrikov/rlpd

rail-berkeley/rlkit

UT-Austin-RPL/maple

pytorch/rl

camel-ai/camel

z-x-yang/Segment-and-Track-Anything

opendilab/awesome-model-based-RL