Pinned Repositories
awesome-sentence-embedding
A curated list of pretrained sentence and word embedding models
AzureML-BERT
End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service
bort
Repository for the paper "Optimal Subarchitecture Extraction for BERT"
CDial-GPT
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
CHINESE-MEDICINE-QUESTION-GENERATION
“万创杯”中医药天池大数据竞赛——中医文献问题生成挑战 决赛 第一名方案
chinese_chatbot_corpus
中文公开聊天语料库
CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
gpt2-ml-finetune-
根据gpt2-ml中文模型finetune自己的数据集
nlp_paper_study
研读顶会论文,复现论文相关代码
Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
wind91725's Repositories
wind91725/gpt2-ml-finetune-
根据gpt2-ml中文模型finetune自己的数据集
wind91725/awesome-sentence-embedding
A curated list of pretrained sentence and word embedding models
wind91725/bort
Repository for the paper "Optimal Subarchitecture Extraction for BERT"
wind91725/CHINESE-MEDICINE-QUESTION-GENERATION
“万创杯”中医药天池大数据竞赛——中医文献问题生成挑战 决赛 第一名方案
wind91725/CLUECorpus2020
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
wind91725/CPM-Generate
Chinese Pre-Trained Language Models (CPM-LM) Version-I
wind91725/CPM-LM-TF2
wind91725/CPM_LM_bert4keras
在bert4keras下加载CPM_LM模型
wind91725/DALLE-pytorch
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
wind91725/DeepIE
DeepIE: Deep Learning for Information Extraction
wind91725/delta
DELTA is a deep learning based natural language and speech processing platform.
wind91725/Dive-into-DL-PyTorch
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
wind91725/FewShotMultiLabel
Code for AAAI2021 paper: Few-Shot Learning for Multi-label Intent Detection.
wind91725/gpt-neo
An implementation of model parallel GPT2& GPT3-like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library.
wind91725/guwenbert
GuwenBERT: 古文预训练语言模型 a Pre-trained Language Model for Classical Chinese (Literary Chinese)
wind91725/InfoBERT
[ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Yu Cheng, Zhe Gan, Ruoxi Jia, Bo Li, Jingjing Liu
wind91725/lightseq
LightSeq: A High Performance Library for Sequence Processing and Generation
wind91725/ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
wind91725/nlp-gym
NLPGym - A toolkit to develop RL agents to solve NLP tasks.
wind91725/NLP-Interview-Notes
本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料,该资料目前包含 自然语言处理各领域的 面试题积累。
wind91725/nlp_tutorial
NLP入门指南
wind91725/NLPDataSet
记录本人整理的一些数据集
wind91725/nlprule
Rule-based grammatical error correction through parsing LanguageTool rules in Rust w/ bindings for Python.
wind91725/nndl.github.io
《神经网络与深度学习》 邱锡鹏著 Neural Network and Deep Learning
wind91725/pretrained-models
Open Language Pre-trained Model Zoo
wind91725/pytorch-lightning
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
wind91725/sentence-transformers
Sentence Embeddings with BERT & XLNet
wind91725/Summarization-Papers
Summarization Papers
wind91725/UER-py
Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
wind91725/UNIMO
UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning