Chen-Wang-CUHK's Stars
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Anduin2017/HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
fxsjy/jieba
结巴中文分词
microsoft/JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
lancopku/pkuseg-python
pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation
goto456/stopwords
中文常用停用词表(哈工大停用词表、百度停用词表等)
princeton-nlp/SimCSE
[EMNLP 2021] SimCSE: Simple Contrastive Learning of Sentence Embeddings https://arxiv.org/abs/2104.08821
dongrixinyu/JioNLP
中文 NLP 预处理、解析工具包,准确、高效、易用 A Chinese NLP Preprocessing & Parsing Package www.jionlp.com
huawei-noah/Pretrained-Language-Model
Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.
LDNOOBW/List-of-Dirty-Naughty-Obscene-and-Otherwise-Bad-Words
List of Dirty, Naughty, Obscene, and Otherwise Bad Words
425776024/nlpcda
一键中文数据增强包 ; NLP数据增强、bert数据增强、EDA:pip install nlpcda
920232796/bert_seq2seq
pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。
BaiduSpider/BaiduSpider
BaiduSpider,一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
xcfcode/Summarization-Papers
Summarization Papers
THUDM/P-tuning
A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.
dongrixinyu/chinese_keyphrase_extractor
An off-the-shelf tool for Chinese Keyphrase Extraction 一个快速从中文里抽取关键短语的工具,仅占35M内存 www.jionlp.com
renmada/t5-pegasus-pytorch
RUCAIBox/Top-conference-paper-list
A collection of classified and organized top conference paper list.
harvardnlp/neural-template-gen
Diego999/py-rouge
Full Python implementation of the ROUGE metric, producing same results as in the official perl implementation.
NJUNLP/TOWE
Code and data for "Target-oriented Opinion Words Extraction with Target-fused Neural Sequence Labeling" (NAACL2019)
sosuperic/MeanSum
abrazinskas/Copycat-abstractive-opinion-summarizer
ACL 2020 Unsupervised Opinion Summarization as Copycat-Review Generation
IsakZhang/Generative-ABSA
zcgzcgzcg1/MediaSum
MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization
NKU-IIPLab/BMRC
stangelid/qt
BugOMan/summary_generator
Summarization with Pointer-Generator Networks
lipiji/Summarization-Papers
Summarization Papers