topic-modeling
There are 1890 repositories under topic-modeling topic.
ddbourgin/numpy-ml
Machine learning, in numpy
piskvorky/gensim
Topic Modelling for Humans
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.
baidu/Familia
A Toolkit for Industrial Topic Modeling
JasonKessler/scattertext
Beautiful visualizations of how language differs among document types.
ContextLab/hypertools
A Python toolbox for gaining geometric insights into high-dimensional data
nomic-ai/nomic
Interact, analyze and structure massive text, image, embedding, audio and video datasets
owlbarn/owl
Owl - OCaml Scientific Computing @ https://ocaml.xyz
MilaNLProc/contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).
dselivanov/text2vec
Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
MIND-Lab/OCTIS
OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)
bigartm/bigartm
Fast topic modeling platform
gregversteeg/corex_topic
Hierarchical unsupervised and semi-supervised topic models for sparse count data with CorEx
stepthom/text_mining_resources
Resources for learning about Text Mining and Natural Language Processing
bab2min/tomotopy
Python package of Tomoto, the Topic Modeling Tool
cpsievert/LDAvis
R package for web-based interactive topic model visualization.
dongrixinyu/chinese_keyphrase_extractor
An off-the-shelf tool for Chinese Keyphrase Extraction 一个快速从中文里抽取关键短语的工具,仅占35M内存 www.jionlp.com
vi3k6i5/GuidedLDA
semi supervised guided topic model with custom guidedLDA
stephenhky/PyShortTextCategorization
Various Algorithms for Short Text Mining
jmartinezheras/2018-MachineLearning-Lectures-ESA
Machine Learning Lectures at the European Space Agency (ESA) in 2018
ruidan/Unsupervised-Aspect-Extraction
Code for acl2017 paper "An unsupervised neural attention model for aspect extraction"
primaryobjects/lda
LDA topic modeling for node.js
chtmp223/topicGPT
TopicGPT: A Prompt-Based Framework for Topic Modeling (NAACL'24)
yangliuy/LDAGibbsSampling
Open Source Package for Gibbs Sampling of LDA
cohere-ai/sandbox-topically
Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.
dice-group/Palmetto
Palmetto is a quality measuring tool for topics
BobXWu/TopMost
A Topic Modeling System Toolkit
MaartenGr/Concept
Concept Modeling: Topic Modeling on Images and Text
WZBSocialScienceCenter/tmtoolkit
Text Mining and Topic Modeling Toolkit for Python with parallel processing power
datquocnguyen/LFTM
Improving topic models LDA and DMM (one-topic-per-document model for short texts) with word embeddings (TACL 2015)
maxent-ai/converse
Conversational text Analysis using various NLP techniques
binoydutt/Resume-Job-Description-Matching
The purpose of this project was to defeat the current Application Tracking System used by most of the organization to filter out resumes. In order to achieve this goal I had to come up with a universal score which can help the applicant understand the current status of the match. The following steps were undertaken for this project 1) Job Descriptions were collected from Glass Door Web Site using Selenium as other scrappers failed 2) PDF resume parsing using PDF Miner 3) Creating a vector representation of each Job Description - Used word2Vec to create the vector in 300-dimensional vector space with each document represented as a list of word vectors 4) Given each word its required weights to counter few Job Description specific words to be dealt with - Used TFIDF score to get the word weights. 5) Important skill related words were given higher weights and overall mean of each Job description was obtained using the product for word vector and its TFIDF scores 6) Cosine Similarity was used get the similarities of the Job Description and the Resume 7) Various Natural Language Processing Techniques were identified to suggest on the improvements in the resume that could help increase the match score
charlesdedampierre/BunkaTopics
🗺️ Data Cleaning and Textual Data Visualization 🗺️
osainz59/Ask2Transformers
A Framework for Textual Entailment based Zero Shot text classification
yuewang-cuhk/TAKG
The official implementation of ACL 2019 paper "Topic-Aware Neural Keyphrase Generation for Social Media Language"