Pinned Repositories
DeepSpeedExamples
Example models using DeepSpeed
DeepSpeedExamples
Example models using DeepSpeed
learning
小五的学习之路资料整理
MachineLearningInAction
magenta
Magenta: Music and Art Generation with Machine Intelligence
sun_idea_package
xiaowu.nlp
nlp course
sunxiaowu's Repositories
sunxiaowu/transpeeder_rlhf
support llama,qwen,deepseek等结构的强化训练代码,包括ppo,dpo等
sunxiaowu/DeepSpeedExamples
Example models using DeepSpeed
sunxiaowu/learning
小五的学习之路资料整理
sunxiaowu/MachineLearningInAction
sunxiaowu/magenta
Magenta: Music and Art Generation with Machine Intelligence
sunxiaowu/sun_idea_package
sunxiaowu/xiaowu.nlp
nlp course