Tachiwin is an open-source project developing Large Language Models (LLMs) for indigenous languages of Mexico, focusing on Tutunakú linguistic resources.
- Llama 3.1 8B Instruct pretraining on indigenous language corpora
- Domain-specific fine-tuning for translation and linguistic tasks
- Model deployment and inference pipeline
- Python 3.10+
- PyTorch
- Transformers
- Unsloth
- Llama 3.1 8B Instruct weights
Fully functional app to demonstrate the translation capabilities offline or online
- Multilingual support (Tutunakú/Spanish/English)
- Low-resource language model development
- Open-source linguistic technology
Apache 2.0
- Luis J Camargo
- Fidencio Hernández Hernández