text-classification

There are 4117 repositories under text-classification topic.

  • HanLP

    hankcs/HanLP

    中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理

    Language:Python34.7k1.1k1.4k10.5k
  • explosion/spaCy

    💫 Industrial-strength Natural Language Processing (NLP) in Python

    Language:Python31.3k5625.7k4.5k
  • brightmart/nlp_chinese_corpus

    大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

  • brightmart/text_classification

    all kinds of text classification models and more with deep learning

    Language:Python7.9k2981242.6k
  • microsoft/nlp-recipes

    Natural Language Processing Best Practices & Examples

    Language:Python6.4k187211917
  • CLUEbenchmark/CLUEDatasetSearch

    搜索所有中文NLP数据集,附常用英文NLP数据集

    Language:Python4.3k6112622
  • gaussic/text-classification-cnn-rnn

    CNN-RNN中文文本分类,基于TensorFlow

    Language:Python4.2k1081581.5k
  • simpletransformers

    ThilinaRajapakse/simpletransformers

    Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI

    Language:Python4.2k641.1k726
  • spark-nlp

    JohnSnowLabs/spark-nlp

    State of the Art Natural Language Processing

    Language:Scala3.9k99896722
  • snipsco/snips-nlu

    Snips Python library to extract meaning from text

    Language:Python3.9k132263511
  • catalyst-team/catalyst

    Accelerated deep learning R&D

    Language:Python3.3k44355393
  • fastnlp/fastNLP

    fastNLP: A Modularized and Extensible NLP Framework. Currently still in incubation.

    Language:Python3.1k79218450
  • x4nth055/pythoncode-tutorials

    The Python Code Tutorials

    Language:Jupyter Notebook2.8k104572k
  • BrikerMan/Kashgari

    Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.

    Language:Python2.4k64377437
  • embeddings-benchmark/mteb

    MTEB: Massive Text Embedding Benchmark

    Language:Jupyter Notebook2.4k19936354
  • HarderThenHarder/transformers_tasks

    ⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

    Language:Jupyter Notebook2.3k1689397
  • EasyNLP

    alibaba/EasyNLP

    EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

    Language:Python2.1k36130255
  • xlang-ai/instructor-embedding

    [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

    Language:Python1.9k18112144
  • kk7nc/Text_Classification

    Text Classification Algorithms: A Survey

    Language:Python1.8k727543
  • yongzhuo/Keras-TextClassification

    中文长文本分类、短句子分类、多标签分类、两句子相似度(Chinese Text Classification of Keras NLP, multi-label classify, or sentence classify, long or short),字词句向量嵌入层(embeddings)和网络层(graph)构建基类,FastText,TextCNN,CharCNN,TextRNN, RCNN, DCNN, DPCNN, VDCNN, CRNN, Bert, Xlnet, Albert, Attention, DeepMoji, HAN, 胶囊网络-CapsuleNet, Transformer-encode, Seq2seq, SWEM, LEAM, TextGCN

    Language:Python1.8k3388404
  • text-analytics-with-python

    dipanjanS/text-analytics-with-python

    Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.

    Language:Jupyter Notebook1.7k11814846
  • jasonwei20/eda_nlp

    Data augmentation for NLP, presented at EMNLP 2019

    Language:Python1.6k3539317
  • Delta-ML/delta

    DELTA is a deep learning based natural language and speech processing platform. LF AI & DATA Projects: https://lfaidata.foundation/projects/delta/

    Language:Python1.6k6475288
  • bfelbo/DeepMoji

    State-of-the-art deep learning model for analyzing sentiment, emotion, sarcasm etc.

    Language:Python1.5k5251314
  • yongzhuo/nlp_xiaojiang

    自然语言处理(nlp),小姜机器人(闲聊检索式chatbot),BERT句向量-相似度(Sentence Similarity),XLNET句向量-相似度(text xlnet embedding),文本分类(Text classification), 实体提取(ner,bert+bilstm+crf),数据增强(text augment, data enhance),同义句同义词生成,句子主干提取(mainpart),中文汉语短文本相似度,文本特征工程,keras-http-service调用

    Language:Python1.5k4015395
  • microsoft/NeuronBlocks

    NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego

    Language:Python1.5k6225195
  • refinery

    code-kern-ai/refinery

    The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.

    Language:Python1.4k1820571
  • yao8839836/text_gcn

    Graph Convolutional Networks for Text Classification. AAAI 2019

    Language:Python1.4k24141438
  • lyeoni/nlp-tutorial

    A list of NLP(Natural Language Processing) tutorials

    Language:Jupyter Notebook1.4k4815264
  • zhanlaoban/EDA_NLP_for_Chinese

    An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。

    Language:Python1.4k1619240
  • 920232796/bert_seq2seq

    pytorch实现 Bert 做seq2seq任务,使用unilm方案,现在也可以做自动摘要,文本分类,情感分析,NER,词性标注等任务,支持t5模型,支持GPT2进行文章续写。

    Language:Python1.3k1267209
  • charlesXu86/Chatbot_CN

    基于金融-司法领域(兼有闲聊性质)的聊天机器人,其中的主要模块有信息抽取、NLU、NLG、知识图谱等,并且利用Django整合了前端展示,目前已经封装了nlp和kg的restful接口

  • Hello-SimpleAI/chatgpt-comparison-detection

    Human ChatGPT Comparison Corpus (HC3), Detectors, and more! 🔥

    Language:Python1.3k2626120
  • Tongjilibo/bert4torch

    An elegent pytorch implement of transformers

    Language:Python1.3k15153165
  • obsei

    obsei/obsei

    Obsei is a low code AI powered automation tool. It can be used in various business flows like social listening, AI based alerting, brand image analysis, comparative study and more .

    Language:Python1.3k30120166
  • explosion/spacy-llm

    🦙 Integrating LLMs into structured NLP pipelines

    Language:Python1.2k228394