/bangla-llama

A New Bangla Large Language Model (LLM) Based on Llama 2

Primary LanguageJupyter NotebookGNU General Public License v3.0GPL-3.0

Bangla-Llama: A Family of LLaMA-based LLMs focused on Bangla Language

Description

This repository contains the code and models for "Bangla-Llama", a project focused on enhancing the performance of language models for the Bangla language. It builds upon the open-source LLaMA model, introducing additional Bangla tokens and employing the LoRA methodology for efficient training. Please read the technical report for more details.

Table of Contents

Contact

For any queries regarding the codebase or research, please reach out to Abdullah Khan Zehady at brishtiteveja@gmail.com.