yuchenlin/LLM-Blender

[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the diverse strengths of multiple open-source LLMs. LLM-Blender cut the weaknesses through ranking and integrate the strengths through fusing generation to enhance the capability of LLMs.

PythonApache-2.0

Issues

Training the GenFuser
#24 opened 25 days ago by jonsaadfalcon
2
Make `llm-blender` available on PyPI
#23 opened a month ago by lewtun
2
Split dependencies so that core install is light for model inference
#22 opened a month ago by lewtun
6
GPU Memory Requirement on Retraining PairRM
#20 opened 2 months ago by harshyadav17
9
How to change models to train ranker?
#21 opened 2 months ago by ashmalvayani
1
How to act as reward model for RLHF
#16 opened 3 months ago by Andrewzh112
1
Training ranker on Unified-Feedback
#17 opened 3 months ago by yifei-he
2
undefined symbol: _ZNK3c1010TensorImpl36is_contiguous_nondefault_policy_implENS_12MemoryFormatE
#15 opened 3 months ago by vermabasu
1
Data Generation Code
#12 opened 4 months ago by tgyuan21
4
Replicate Experiments
#13 opened 4 months ago by RaccoonOnion
1
Support for MPS device
#14 opened 4 months ago by fangyuan-ksgk
1
Load PairRM ranker without Internet access failed
#11 opened 6 months ago by kai01ai
4
RuntimeError: in loading state_dict for CrossCompareReranker
#10 opened 6 months ago by NISH1001
3
name 'vllm' is not defined
#9 opened 6 months ago by siddiquelatif
3
Unable to reproduce the results in the paper
#8 opened 10 months ago by wang-debug
10
Issues with the split of dataset
#7 opened a year ago by Cascol-Chen
4
Issue with calculating >= Vic and OA
#6 opened a year ago by Cascol-Chen
14
Question about the fuser outputs
#5 opened a year ago by sai4july
4
Why the matrix M is not Symmetrical along the diagonal?
#4 opened a year ago by MAxx8371
6
Json parse error when downloading dataset from hf
#3 opened a year ago by KaiLv69
3
Issue with downloading dataset from HuggingFace
#2 opened a year ago by swarnaHub
3
Why all the models used in the experiments are released after ACL2023 submission deadline?
#1 opened a year ago by Yuanhy1997
4