WangYuxuan93's Stars
microsoft/TaskMatrix
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
flairNLP/flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
LianjiaTech/BELLE
BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)
crownpku/Awesome-Chinese-NLP
A curated list of resources for Chinese NLP 中文自然语言处理相关资料
google-research/text-to-text-transfer-transformer
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
Oneflow-Inc/oneflow
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
zjunlp/DeepKE
[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
timoschick/pet
This repository contains the code for "Exploiting Cloze Questions for Few-Shot Text Classification and Natural Language Inference"
wikipedia2vec/wikipedia2vec
A tool for learning vector representations of words and entities from Wikipedia
GEM-benchmark/NL-Augmenter
NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations
studio-ousia/luke
LUKE -- Language Understanding with Knowledge-based Embeddings
google-research/xtreme
XTREME is a benchmark for the evaluation of the cross-lingual generalization ability of pre-trained multilingual models that covers 40 typologically diverse languages and includes nine tasks.
crownpku/Small-Chinese-Corpus
Some useful Chinese corpus datasets 中文语料小数据
MLNLP-World/SimBiber
MLNLP社区用来帮助缩短参考文献的工具。A tool for simplifying bibtex with official info
lancopku/Chinese-Literature-NER-RE-Dataset
A Discourse-Level Named Entity Recognition and Relation Extraction Dataset for Chinese Literature Text
thunlp/Chinese_NRE
Source code for ACL 2019 paper "Chinese Relation Extraction with Multi-Grained Information and External Linguistic Knowledge"
luanyi/DyGIE
apple/ml-mkqa
We introduce MKQA, an open-domain question answering evaluation set comprising 10k question-answer pairs aligned across 26 typologically diverse languages (260k question-answer pairs in total). The goal of this dataset is to provide a challenging benchmark for question answering quality across a wide set of languages. Please refer to our paper for details, MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering
timoschick/fewglue
This repository contains the FewGLUE dataset for few-shot natural language understanding.
CLUEbenchmark/DataCLUE
DataCLUE: 数据为中心的NLP基准和工具包
chujiezheng/ChID-Dataset
ChID: A Large-scale Chinese IDiom Dataset for Cloze Test
thunlp/MNRE
The code and data for ACL2017 paper "Neural Relation Extraction with Multi-lingual Attention"
mia-workshop/MIA-Shared-Task-2022
An official repository for MIA 2022 (NAACL 2022 Workshop) Shared Task on Cross-lingual Open-Retrieval Question Answering.
c-box/LANKA
Code for ACL2021 long paper: Knowledgeable or Educated Guess? Revisiting Language Models as Knowledge Bases
zhangdongxu/kbp37
relation classification dataset
jzbjyb/X-FACTR
boun-tabi/RELX
The RELX Dataset and Matching the Multilingual Blanks for Cross-Lingual Relation Classification, EMNLP-Findings 2020.
SUDA-LA/wist
[ACL'21] Data for "An In-depth Study on Internal Structure of Chinese Words".
WangYuxuan93/DepAttacker
The codes for ACL Findings paper "A Closer Look into the Robustness of Neural Dependency Parserswith Better Adversarial Examples"