thu-ml/RoboticsDiffusionTransformer
RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation
PythonMIT
Issues
- 0
thanks forHow much graphics memory does fine-tuning approximately require for a graphics card?
#6 opened by lanlankilkil - 3
Positional Encoding Questions
#38 opened by motor-x - 1
- 3
Fail to read dirty bit
#29 opened by wu-yutong-525 - 1
missing 'qpos' in bridgeV2 dataset
#35 opened by huangxu1991 - 2
- 1
Port to LeRobot Dataset v2.0?
#40 opened by ivelin - 1
Have you considered providing a Docker image?
#34 opened by soul667 - 1
The role of dirty bit
#36 opened by COST-97 - 1
Data cleaning - detecting failed episodes
#41 opened by Wonder1905 - 1
May i ask if it deploy on a new robot and fine-tuning it with only 1-5tracks?How is the generalization?
#33 opened by lanlankilkil - 1
Do you have any simulation to recommend?
#32 opened by gxy-TJU - 1
- 1
QKN and RMSN, which is more effective?
#39 opened by jshang-bdai - 3
- 5
- 2
- 11
Pretrain Dataset Question:
#25 opened by motor-x - 18
Issue with Mobile Aloha Inference in MuJoCo: Robot Wandering Without Performing Actions
#24 opened by yongzhengqi - 3
What are the problems/challenges for this robot manipulation solution: 3D reconstruction then ROS control the arm to take the thing?
#27 opened by guotong1988 - 3
- 9
Fine-Tuning in Relative Action Space
#14 opened by lakomchik - 2
Confused about normalization
#23 opened by alik-git - 4
Mobile ALOHA Dataset Conversion Issues
#16 opened by WangZhiXiong0 - 1
- 1
Preprocessed dataset
#17 opened by COST-97 - 5
RuntimeError: mat1 and mat2 must have the same dtype, but got BFloat16 and Float
#21 opened by liushuailxx - 8
Some doubts regarding fine-tuning
#20 opened by Henry-Ellis - 2
Confusions Regarding Finetuning with New Dataset
#22 opened by JinmingM - 0
RuntimeError: mat1 and mat2 must have the same dtype, but got BFloat16 and Float
#18 opened by liushuailxx - 2
- 1
Dataloder thread issues
#12 opened by budzianowski - 2
[Question about Input Configuration in RDT-1b] Why Concatenate Action Noise, Proprio Z, and Frequency C?
#9 opened by Mark-98 - 1
- 7
How to run inference of the model with a single image and no proprioception data?
#5 opened by alik-git - 2
thanks for your work. How much graphics memory does fine-tuning approximately require for a graphics card?
#7 opened by lanlankilkil - 11
Can a single graphics card with 24 GB of video memory load the t5-v1_1-xxl model? Do smaller language models still exist?
#3 opened by qiynxx - 1
Fine-tuned RDT checkpoint
#4 opened by lakomchik - 1
Nice work! How do you deal with different embodiments when sharing the same action space?
#1 opened by StarCycle