Pinned Repositories
Guoxu1233
Config files for my GitHub profile.
maddpg_mpe
MADDPG for collecting data
madiffusion_mpe
Madiff on self dataset
RLforUTracking
Deep Reinforcement Learning (RL) algorithms for underwater target tracking with Autonomous Underwater Vehicles (AUV)
Tianji
从零学习,制作懂人情世故的大语言模型
qwen-dpo
通义千问的DPO训练
Guoxu1233's Repositories
Guoxu1233/Guoxu1233
Config files for my GitHub profile.
Guoxu1233/maddpg_mpe
MADDPG for collecting data
Guoxu1233/madiffusion_mpe
Madiff on self dataset
Guoxu1233/RLforUTracking
Deep Reinforcement Learning (RL) algorithms for underwater target tracking with Autonomous Underwater Vehicles (AUV)
Guoxu1233/Tianji
从零学习,制作懂人情世故的大语言模型