hank0316

MS student from National Taiwan University.

Pinned Repositories

reward-bench
RewardBench: the first evaluation tool for reward models.
Language:Python437 5 6951
adapter-transfer
Experiments of Transfer Learning on adapter.
Language:Python0 1 00
alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
Language:Python0 0 00
Challenge2020-Homework
2020 NTU CSIE Camp Challenge Homework
Language:Python0 0 00
CNL
Neural-network-based Mail Server Light (NMSL)
Language:Python0 1 00
DLHLP-Prosody
0 1 00
DogeRM
The code used in the paper "DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging"
Language:Python2 2 00
s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Language:Python2.3k 46 398487
HEAR-2021-NeurIPS-Challenge---NTU-GURA
Language:Python12 1 04
ML2022-Spring
**Official** 李宏毅 (Hung-yi Lee) 機器學習 Machine Learning 2022 Spring
Language:Jupyter Notebook2.1k 22 4496

hank0316's Repositories

hank0316/adapter-transfer
Experiments of Transfer Learning on adapter.
Language:Python0 1 00
hank0316/alpaca_farm
A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.
Language:Python0 0 00
hank0316/Challenge2020-Homework
2020 NTU CSIE Camp Challenge Homework
Language:Python0 0 00
hank0316/CNL
Neural-network-based Mail Server Light (NMSL)
Language:Python0 1 00
hank0316/DLHLP-Prosody
0 1 00
hank0316/fai_final_project
Add change hole cards feature
Language:Python0 0 00
hank0316/Line-Bot
A Line Bot implementation.
Language:Python0 1 00
hank0316/ML_HW4_Dataset
0 0 02
hank0316/s3prl
Self-Supervised Speech Pre-training and Representation Learning Toolkit.
Language:Python0 0 00
hank0316/reward-bench
RewardBench: the first evaluation tool for reward models.
Language:Python0 0
hank0316/web-programming
Language:HTML1 0