NiuTrans Open Source
The NiuTrans Open Source (NOS) project, maintained by NLP Lab at Northeastern University and NiuTrans Research, aims at developing cutting edge NLP systems.
Shenyang, Liaoning, China
Pinned Repositories
ABigSurvey
A collection of 1000+ survey papers on Natural Language Processing (NLP) and Machine Learning (ML).
ABigSurveyOfLLMs
A collection of 150+ surveys on LLMs
Classical-Modern
非常全的文言文(古文)-现代文平行语料
CNSurvey
一份中文综述文章列表(自然语言处理&机器学习)
MTBook
《机器翻译:基础与模型》肖桐 朱靖波 著 - Machine Translation: Foundations and Models
NASPapers
Paper lists of neural architecture search (NAS)
NiuTensor
NiuTensor is an open-source toolkit developed by a joint team from NLP Lab. at Northeastern University and the NiuTrans Team. It provides tensor utilities to create and train neural networks.
NiuTrans.NMT
A Fast Neural Machine Translation System developed in C++.
NiuTrans.SMT
NiuTrans.SMT is an open-source statistical machine translation system developed by a joint team from NLP Lab. at Northeastern University and the NiuTrans Team. The NiuTrans system is fully developed in C++ language. So it runs fast and uses less memory. Currently it supports phrase-based, hierarchical phrase-based and syntax-based (string-to-tree, tree-to-string and tree-to-tree) models for research-oriented studies.
Vision-LLM-Alignment
This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
NiuTrans Open Source's Repositories
NiuTrans/MTBook
《机器翻译:基础与模型》肖桐 朱靖波 著 - Machine Translation: Foundations and Models
NiuTrans/ABigSurvey
A collection of 1000+ survey papers on Natural Language Processing (NLP) and Machine Learning (ML).
NiuTrans/Classical-Modern
非常全的文言文(古文)-现代文平行语料
NiuTrans/CNSurvey
一份中文综述文章列表(自然语言处理&机器学习)
NiuTrans/NiuTensor
NiuTensor is an open-source toolkit developed by a joint team from NLP Lab. at Northeastern University and the NiuTrans Team. It provides tensor utilities to create and train neural networks.
NiuTrans/ABigSurveyOfLLMs
A collection of 150+ surveys on LLMs
NiuTrans/NiuTrans.SMT
NiuTrans.SMT is an open-source statistical machine translation system developed by a joint team from NLP Lab. at Northeastern University and the NiuTrans Team. The NiuTrans system is fully developed in C++ language. So it runs fast and uses less memory. Currently it supports phrase-based, hierarchical phrase-based and syntax-based (string-to-tree, tree-to-string and tree-to-tree) models for research-oriented studies.
NiuTrans/NiuTrans.NMT
A Fast Neural Machine Translation System developed in C++.
NiuTrans/NASPapers
Paper lists of neural architecture search (NAS)
NiuTrans/MT-paper-lists
MT paper lists (by conference)
NiuTrans/Vision-LLM-Alignment
This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
NiuTrans/LanguageCodes
We present a list of languages with their codes, families, regions and etc. We also present a list of multi-lingual corpora (with urls).
NiuTrans/compiler-notes
NiuTrans/Introduction-to-Transformers
An introduction to basic concepts of Transformers and key techniques of their recent advances.
NiuTrans/MTVenues
A list of conferences and journals relevant to machine translation
NiuTrans/ForgettingCurve
A benchmark for testing memorization abilities of LMs
NiuTrans/Hands-on-GEMM
A tutorial on GEMM
NiuTrans/NiuLoad