jiangyingjunn's Stars
esbatmop/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
WisdomShell/codeshell-vscode
An intelligent coding assistant plugin for Visual Studio Code, developed based on CodeShell
lemon0830/XLMP
models for Soft Language Clustering for Multilingual Model Pre-training
lemon0830/TDT
Code for paper Task-guided Disentangled Tuning for Pretrained Language Models. Findings of ACL 2022 .
lemon0830/RAN
Code for Recurrent Attention for Neural Machine Translation. EMNLP 2021
lemon0830/dualNER
code for dualNER, emnlp 2022
lemon0830/promptCSE
code for promptCSE, emnlp 2022
nlpxucan/WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
lemon0830/TIM
code for Teaching LM to Translate with Comparison
AGI-Edgerunners/LLM-Adapters
Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"
hpcaitech/ColossalAI
Making large AI models cheaper, faster and more accessible
CLUEbenchmark/CLUE
中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard
princeton-nlp/SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
libeineu/GPKD
The codebase of paper:Learning Light-Weight Translation Models from Deep Transformer, which is accepted by AAAI2021 conference.
NiuTrans/ABigSurvey
A collection of 1000+ survey papers on Natural Language Processing (NLP) and Machine Learning (ML).
libeineu/Context-Aware
The implementation of "Does Multi-Encoder Help? A Case Study on Context-AwareNeural Machine Translation"
NiuTrans/MTBook
《机器翻译:基础与模型》肖桐 朱靖波 著 - Machine Translation: Foundations and Models
NiuTrans/MT-paper-lists
MT paper lists (by conference)
carpedm20/ENAS-pytorch
PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"
formulahendry/955.WLB
955 不加班的公司名单 - 工作 955,work–life balance (工作与生活的平衡)
tianzhipengfei/simple_script
Collect some simple scripts
quark0/darts
Differentiable architecture search for convolutional and recurrent networks
gopala-kr/meta-learning
meta-learning research
NiuTrans/NiuTensor
NiuTensor is an open-source toolkit developed by a joint team from NLP Lab. at Northeastern University and the NiuTrans Team. It provides tensor utilities to create and train neural networks.
NiuTrans/NiuTrans.SMT
NiuTrans.SMT is an open-source statistical machine translation system developed by a joint team from NLP Lab. at Northeastern University and the NiuTrans Team. The NiuTrans system is fully developed in C++ language. So it runs fast and uses less memory. Currently it supports phrase-based, hierarchical phrase-based and syntax-based (string-to-tree, tree-to-string and tree-to-tree) models for research-oriented studies.