chi2liu/mamba-gpt-3b
It is almost the best 3B model in the current open source industry, surpassing Dolly v2-3b, open lama-3b, and even outperforming the EleutherAI/pythia-12b model in terms of performance. Can refer to open_llm_leaderboard
Apache-2.0
Issues
- 0
What architecture is it?
#2 opened by chen-yingfa - 0
mamba-gpt-7b-v2 fine-tuning approach
#1 opened by chitangwa