ziganluo1997

lovebugs

ziganluo1997's Stars

StatguyUser/TextFeatureSelection
Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
Language:Python515
CLaraRR/autoencoder_practice
AutoEncoder implements by keras. Including AE, DAE, DAE_CNN, VAE, VAE_CNN, CVAE, Sparse AE, Stacked DAE.
Language:Python3910
kellenf/TSP_collection
TSP算法全复现：遗传(GA)、粒子群(PSO)、模拟退火(SA)、禁忌搜索(ST)、蚁群算法(ACO)、自自组织神经网络(SOM)
Language:Python707184
src-d/kmcuda
Large scale K-means and K-nn implementation on NVIDIA GPU / CUDA
Language:Jupyter Notebook797146
subhadarship/kmeans_pytorch
kmeans using PyTorch
Language:Jupyter Notebook47275
gugarosa/opytimizer
🐦 Opytimizer is a Python library consisting of meta-heuristic optimization algorithms.
Language:Python59940
ckeshava/lbpso-text-clustering
Language:Python1
xiaopeng-liao/DEC_pytorch
Language:Jupyter Notebook219
piiswrong/dec
Language:Jupyter Notebook456138
zhoushengisnoob/DeepClustering
Methods and Implements of Deep Clustering
2.8k415
betashort/DeepClusterings
Language:Jupyter Notebook2
Deepayan137/DeepClustering
A pytorch implementation of the paper Unsupervised Deep Embedding for Clustering Analysis.
Language:Python12723
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Language:Python133k26.5k
jaumpedro214/Textual-Clustering-Doc2Vec-KMeans
This repository contains a step-by-step guide to use Doc2Vec vetorization process with Gensim Library and execute Clustering with KMeans
Language:Jupyter Notebook1
amazon-science/sccl
Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021
Language:Python28856
rajeshmore1/Self-Project-K-means-clustering-and-Hierarchical-clustering
Unsupervised Learning.For this clustering project on text, you will use a dataset named 20newsgroup. This is available in sklearn.datasets. You can use the code given below in the code cells to fetch the data. Next you need to run a TFIDFVectorizer on the sentences to obtain a document-word sparse matrix. Use this array as your X . Once you have got your array, you can apply different clustering techniques such as K-Means clustering and Hierarchical clustering to obtain meaningful clusters. Check if these clusters seem relevant and well separated. Finally you can use dimensionality reduction technqiues such as PCA or t-SNE(you can read about it and use it straight away) to come up with two dimensional visualization of these clusters.
Language:Jupyter Notebook3
Nagesh-Cheripally/Venmo-Transaction-Clustering
Text analytics, User spending profile analytics, Emoji analytics, text dictionary, user’s spending over her lifetime
Language:Jupyter Notebook3
imane97-NLP/Document-clustering-in-The-context-of-Big-Data
Text clustering is a process that involves Natural Language Processing (NLP) and the use of a clustering algorithm. This method of finding groups in unstructured texts can be applied in many different segments, such as feedback analysis, research segmentation, etc.
Language:Jupyter Notebook1
TarunAggrawl/Model-Based-Clustering
This is a improved short text clustering model from state of the art algorithm
Language:Jupyter Notebook1
kypexin/text-clustering
Short sentences K-Means clustering
Language:Jupyter Notebook2
sujitjean/Clsutering-__-Kmeans-Allagglomerative-hierarchical-clustering
step 1: Choose any vectorizer (data matrix) that you have worked in any of the assignments, and got the best AUC value. step 2: Choose any of the feature selection/reduction algorithms ex: selectkbest features, pretrained word vectors, model based feature selection etc and reduce the number of features to 5k features. step 3: Apply all three kmeans, Agglomerative clustering, DBSCAN K-Means Clustering: ● Find the best ‘k’ using the elbow-knee method (plot k vs inertia_) Agglomerative Clustering: ● Apply agglomerative algorithm and try a different number of clusters like 2,5 etc. ● As this is very computationally expensive, take 5k datapoints only to perform hierarchical clustering because they do take a considerable amount of time to run. DBSCAN Clustering: ● Find the best ‘eps’ using the elbow-knee method. ● Take 5k datapoints only. step 4: Summarize each cluster by manually observing few points from each cluster. step 5: You need to plot the word cloud with essay text for each cluster for each of algorithms mentioned in step 3.
Language:Jupyter Notebook23
deyanarajib/DM_Hybrid-Reduction-Dimension-on-Clustering-Text-of-English-Hadith-Translation
Text clustering on English Hadith Translation using Hybrid Feature Extraction
Language:Python2
deyanarajib/DM_Document-Clustering-by-Adding-Metadata-Using-the-COATES-Algorithm
Improve Text Clustering by Adding Side Information using COATES Algorithm
Language:Python1
kennedykwangari/Natural-Language-Processing-with-Python
This repository contains data sets and code snippets on how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python!.
Language:Jupyter Notebook31
pokarats/gsdmm
Gibbs Sampling Dirichlet Multinomial Model (GSDMM) for Short-Text Clustering
Language:Python81
kitaharatomoyo/ShortTextClustering
a complement of the paper"Short Text Clustering via Convolutional Neural Networks"
Language:Python62
Kagigz/french-text-clustering
K-means clustering of French text using the CamemBERT model.
Language:Jupyter Notebook102
fw192020/clairvoyant_clustering
Analyzing text and clustering companies with NLP (SpaCy and CorEx) and unsupervised machine learning (K-Means and PCA)
Language:Jupyter Notebook1
hasitha087/LSTMtextClassifier
LSTM based text classifier using GloVe vector embedding. Model training runs on Azure Machine Learning workspace as an Experiment using compute cluster
Language:Python1
heatherlogan/email-transfer-learning
Masters project using BERT + XLNET for text classification, sentiment analysis and clustering/topic modelling,
Language:Python2

ziganluo1997

ziganluo1997's Stars

StatguyUser/TextFeatureSelection

CLaraRR/autoencoder_practice

kellenf/TSP_collection

src-d/kmcuda

subhadarship/kmeans_pytorch

gugarosa/opytimizer

ckeshava/lbpso-text-clustering

xiaopeng-liao/DEC_pytorch

piiswrong/dec

zhoushengisnoob/DeepClustering

betashort/DeepClusterings

Deepayan137/DeepClustering

huggingface/transformers

jaumpedro214/Textual-Clustering-Doc2Vec-KMeans

amazon-science/sccl

rajeshmore1/Self-Project-K-means-clustering-and-Hierarchical-clustering

Nagesh-Cheripally/Venmo-Transaction-Clustering

imane97-NLP/Document-clustering-in-The-context-of-Big-Data

TarunAggrawl/Model-Based-Clustering

kypexin/text-clustering

sujitjean/Clsutering-__-Kmeans-Allagglomerative-hierarchical-clustering

deyanarajib/DM_Hybrid-Reduction-Dimension-on-Clustering-Text-of-English-Hadith-Translation

deyanarajib/DM_Document-Clustering-by-Adding-Metadata-Using-the-COATES-Algorithm

kennedykwangari/Natural-Language-Processing-with-Python

pokarats/gsdmm

kitaharatomoyo/ShortTextClustering

Kagigz/french-text-clustering

fw192020/clairvoyant_clustering

hasitha087/LSTMtextClassifier

heatherlogan/email-transfer-learning