Pinned Repositories
dsir
DSIR large-scale data selection framework for language model training
Luxai-s2-Baseline
d3po
Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
DeepRL-Tutorials
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
make-pull-request
Use this as learning repo on how to create successful pull requests. Very basic tasks on Python, HTML, CSS, JavaScript, JAVA.
Statistical-Learning-Method_Code
手写实现李航《统计学习方法》书中全部算法
iTransformer
Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight), https://openreview.net/forum?id=JePfAI8fah
Superfiltering
[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning
make-pull-request
Use this as learning repo on how to create successful pull requests. Very basic tasks on Python, HTML, CSS, JavaScript, JAVA.
Schopenhauer-loves-Hegel's Repositories
Schopenhauer-loves-Hegel/d3po
Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
Schopenhauer-loves-Hegel/DeepRL-Tutorials
Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch
Schopenhauer-loves-Hegel/make-pull-request
Use this as learning repo on how to create successful pull requests. Very basic tasks on Python, HTML, CSS, JavaScript, JAVA.
Schopenhauer-loves-Hegel/Statistical-Learning-Method_Code
手写实现李航《统计学习方法》书中全部算法