how to deal with multi-turn dialogue for octivius?
joez17 opened this issue · 0 comments
joez17 commented
It seems that in Octivius, lora-moe uses conversation[0]['value'] to obtain the soft_gate value.
There are 2 questions:
1 Where are the system message and modality embedding introduced into gate activation?
2. In the case of multi-turn dialogues, incorporating only the initial question for gate computation throughout the entire conversation seems illogical.
Could there be aspects I'm misunderstanding? Please help clarify my confusion. Thanks!