selfcs's Stars
Separius/awesome-sentence-embedding
A curated list of pretrained sentence and word embedding models
my8100/scrapydweb
Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI. DEMO :point_right:
Socialbird-AILab/BERT-Classification-Tutorial
Embedding/Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
mahmoudparsian/pyspark-tutorial
PySpark-Tutorial provides basic algorithms using PySpark
gcunhase/NLPMetrics
Python code for various NLP metrics
ctr4si/MMN
:memo: Abstractive Summarization of Reddit Posts with Multi-level Memory Networks. In NAACL-HLT, 2019 (oral).
Smilexuhc/Data-Competition-TopSolution
Data competition Top Solution 数据竞赛top解决方案开源整理
0xMJ/AI-Security-Learning
自身学习的安全数据科学和算法的学习资料
budaLi/-Learning-materials-
各种学习资料,包括一些百度云视频链接还有pdf资料 --搬运工
doccano/doccano
Open source annotation tool for machine learning practitioners.
kk7nc/Text_Classification
Text Classification Algorithms: A Survey
blmoistawinde/HarvestText
文本挖掘和预处理工具(文本清洗、新词发现、情感分析、实体识别链接、关键词抽取、知识抽取、句法分析等),无监督或弱监督方法
Roshanson/TextInfoExp
自然语言处理实验(sougou数据集),TF-IDF,文本分类、聚类、词向量、情感识别、关系抽取等
mukund109/word-mesh
A context-preserving word cloud generator
Kyubyong/bert_ner
Ner with Bert
asyml/texar
Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
terrifyzhao/bert-utils
一行代码使用BERT生成句向量,BERT做文本分类、文本相似度计算
shibing624/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。
lipiji/neural-summ-cnndm-pytorch
Neural abstractive summarization (seq2seq + copy (or pointer network) + coverage) in pytorch on CNN/Daily Mail
sebastianruder/NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
ymfa/seq2seq-summarizer
Pointer-generator reinforced seq2seq summarization in PyTorch
ownthink/KnowledgeGraph
史上最大规模1.4亿知识图谱数据免费下载,知识图谱,通用知识图谱,融合了两千五百多万的实体,拥有亿级别的实体属性关系。
jiqizhixin/ML-Tutorial-Experiment
Coding the Machine Learning Tutorial for Learning to Learn
DQinYuan/chinese_province_city_area_mapper
一个用于提取简体中文字符串中省,市和区并能够进行映射,检验和简单绘图的python模块
leeguandong/Interview-code-practice-python
面试题