gpengzhi's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
meta-llama/llama
Inference code for Llama models
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
facebookresearch/fastText
Library for fast text representation and classification.
openai-translator/openai-translator
基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.
Hannibal046/Awesome-LLM
Awesome-LLM: a curated list of Large Language Model
exaloop/codon
A high-performance, zero-overhead, extensible Python compiler using LLVM
QwenLM/Qwen
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
meta-llama/llama-recipes
Scripts for fine-tuning Meta Llama with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama for WhatsApp & Messenger.
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Instruction-Tuning-with-GPT-4/GPT-4-LLM
Instruction Tuning with GPT-4
baichuan-inc/Baichuan2
A series of large language models developed by Baichuan Intelligent Technology
OpenNMT/CTranslate2
Fast inference engine for Transformer models
baichuan-inc/Baichuan-13B
A 13B large language model developed by Baichuan Intelligent Technology
yuchenlin/rebiber
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
asyml/texar
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
rsennrich/subword-nmt
Unsupervised Word Segmentation for Neural Machine Translation and Text Generation
microsoft/MASS
MASS: Masked Sequence to Sequence Pre-training for Language Generation
yaodongC/awesome-instruction-dataset
A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)
fe1ixxu/ALMA
State-of-the-art LLM-based translation models.
PeterTheOne/slideslive-slides-dl
slideslive slides downloading script
wmt-conference/wmt23-news-systems
gpengzhi/Bi-SimCut
Code for NAACL 2022 main conference paper "Bi-SimCut: A Simple Strategy for Boosting Neural Machine Translation"
gpengzhi/CrossConST-MT
Code for Findings of ACL 2023 paper "Improving Zero-shot Multilingual Neural Machine Translation by Leveraging Cross-lingual Consistency Regularization"
gpengzhi/CrossConST-SR
Code for EMNLP 2023 industry track paper "Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization"
gpengzhi/SimCR
Code for NAACL 2024 main conference paper "An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text Translation"
gpengzhi/CrossConST-LLM
Code for arXiv paper "Towards Boosting Many-to-Many Multilingual Machine Translation with Large Language Models"