Pinned Repositories
reward-bench
RewardBench: the first evaluation tool for reward models.
adapter-transfer
Experiments of Transfer Learning on adapter.
alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
Challenge2020-Homework
2020 NTU CSIE Camp Challenge Homework
CNL
Neural-network-based Mail Server Light (NMSL)
DLHLP-Prosody
DogeRM
The code used in the paper "DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging"
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
HEAR-2021-NeurIPS-Challenge---NTU-GURA
ML2022-Spring
**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2022 Spring
hank0316's Repositories
hank0316/adapter-transfer
Experiments of Transfer Learning on adapter.
hank0316/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
hank0316/Challenge2020-Homework
2020 NTU CSIE Camp Challenge Homework
hank0316/CNL
Neural-network-based Mail Server Light (NMSL)
hank0316/DLHLP-Prosody
hank0316/fai_final_project
Add change hole cards feature
hank0316/Line-Bot
A Line Bot implementation.
hank0316/ML_HW4_Dataset
hank0316/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
hank0316/reward-bench
RewardBench: the first evaluation tool for reward models.
hank0316/web-programming