Scripts for experiments on "ALIGNMENT AS TOKEN DISTRIBUTION SHIFT"

Question

Scripts for experiments on "ALIGNMENT AS TOKEN DISTRIBUTION SHIFT"

Closed this issue 9 months ago · 5 comments

Hello!

Great work. Do you plan on releasing the code for the analysis you do in your paper in the subsection "ALIGNMENT AS TOKEN DISTRIBUTION SHIFT"? Thank You!

Answer 1 · 2023-12-20T05:00:10.000Z

Hey Sreyan,

Thanks for the question. Yes, I will clean the code and upload it here soon. Will ping you here once it is ready! :D

Best,
Yuchen

Answer 2 · 2023-12-21T02:55:19.000Z

Thank You! Looking forward!

Answer 3 · 2023-12-24T03:40:11.000Z

Hi @yuchenlin,

I am trying to replicate some of your analysis with different combinations of LLMs and training datasets, while you are getting your code ready. However, I am not sure how in the huggingface model.generate() function one can return a list of top-k tokens at each step of decoding. Can you please help me with this?

Thank You!

Answer 4 · 2023-12-24T08:41:39.000Z

Hey @Sreyan88,

I uploaded a (very rough) version of the code about token distribution analysis here: https://github.com/Re-Align/AlignTDS

Would u plz take a look? (I haven't got time to make better documentation and testing yet (will do soon), so please be prepared that there might be a few minor bugs).

Best,
Yuchen

Answer 5 · 2023-12-27T02:39:52.000Z

Hi @yuchenlin ,

Thank You so much for the scripts! Had a look and looks promising! Again, great work!