Apollo2Mars's Stars
chinese-poetry/chinese-poetry
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
pytorch/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
fangzesheng/free-api
收集免费的接口服务,做一个api的搬运工
PaddlePaddle/PaddleNLP
👑 Easy-to-use and powerful NLP and LLM library with 🤗 Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including 🗂Text Classification, 🔍 Neural Search, ❓ Question Answering, ℹ️ Information Extraction, 📄 Document Intelligence, 💌 Sentiment Analysis etc.
manticoresoftware/manticoresearch
Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
InsaneLife/ChineseNLPCorpus
中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。
ThilinaRajapakse/simpletransformers
Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
letiantian/TextRank4ZH
:deciduous_tree:从中文文本中自动提取关键词和摘要
huawei-noah/Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.
brightmart/roberta_zh
RoBERTa中文预训练模型: RoBERTa for Chinese
NLP-LOVE/Introduction-NLP
HanLP作者的新书《自然语言处理入门》详细笔记!业界良心之作,书中不是枯燥无味的公式罗列,而是用白话阐述的通俗易懂的算法模型。从基本概念出发,逐步介绍中文分词、词性标注、命名实体识别、信息抽取、文本聚类、文本分类、句法分析这几个热门问题的算法原理与工程实现。
patil-suraj/question_generation
Neural question generation using transformers
eduosi/district
**省/自治区/直辖市、市/自治州、区/县/旗数据,包含名称、拼音、拼音首字母、行政代码、区号
liucongg/NLPDataSet
记录本人整理的一些数据集
geek-ai/Texygen
A text generation benchmarking platform
ZhuiyiTechnology/simbert
a bert for retrieval and generation
thunlp/FewRel
A Large-Scale Few-Shot Relation Extraction Dataset
NTMC-Community/MatchZoo-py
Facilitating the design, comparison and sharing of deep text matching models.
threelittlemonkeys/lstm-crf-pytorch
LSTM-CRF in PyTorch
THUNLP-MT/TG-Reading-List
A text generation reading list maintained by Tsinghua Natural Language Processing Group.
dongwookim-ml/python-topic-model
Implementation of various topic models
Hyperparticle/udify
A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology tags, lemmas, and dependency trees.
JasonForJoy/Leaderboards-for-Multi-Turn-Response-Selection
Leaderboards, Datasets and Papers for Multi-Turn Response Selection in Retrieval-Based Chatbots
google-deepmind/lamb
LAnguage Modelling Benchmarks
kedz/nnsum
An extractive neural network text summarization library for the EMNLP 2018 paper "Content Selection in Deep Learning Models of Summarization" (https://arxiv.org/abs/1810.12343).
SVAIGBA/TwASP
UniversalDependencies/UD_Chinese-GSDSimp
Conversion of UD_Chinese-GSD to simplified Chinese characters.
JayKumarr/OSDM
This code belongs to ACL conference paper entitled as "An Online Semantic-enhanced Dirichlet Model for Short Text Stream Clustering"