gensim

There are 634 repositories under gensim topic.

  • gensim

    piskvorky/gensim

    Topic Modelling for Humans

    Language:Python15.6k4301.8k4.4k
  • text-analytics-with-python

    dipanjanS/text-analytics-with-python

    Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.

    Language:Jupyter Notebook1.6k11914839
  • plasticityai/magnitude

    A fast, efficient universal vector embedding utility package.

    Language:Python1.6k3784119
  • explosion/sense2vec

    🦆 Contextually-keyed word vectors

    Language:Python1.6k49114238
  • nlp-in-practice

    kavgan/nlp-in-practice

    Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

    Language:Jupyter Notebook1.1k518787
  • piskvorky/gensim-data

    Data repository for pretrained NLP models and NLP corpora.

    Language:Python9763943131
  • oborchers/Fast_Sentence_Embeddings

    Compute Sentence Embeddings Fast!

    Language:Jupyter Notebook618125583
  • zake7749/word2vec-tutorial

    中文詞向量訓練教學

    Language:Python516194166
  • ThoughtRiver/lmdb-embeddings

    Fast word vectors with little memory usage in Python

    Language:Python41419330
  • bakrianoo/aravec

    AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.

    Language:Jupyter Notebook390321578
  • 5hirish/adam_qas

    ADAM - A Question Answering System. Inspired from IBM Watson

    Language:Python3573026106
  • AICoE/log-anomaly-detector

    Log Anomaly Detection - Machine learning to detect abnormal events logs

    Language:Jupyter Notebook31721178129
  • 30lm32/ml-projects

    ML based projects such as Spam Classification, Time Series Analysis, Text Classification using Random Forest, Deep Learning, Bayesian, Xgboost in Python

  • GEMSEC

    benedekrozemberczki/GEMSEC

    The TensorFlow reference implementation of 'GEMSEC: Graph Embedding with Self Clustering' (ASONAM 2019).

    Language:Python252151450
  • davidberenstein1957/concise-concepts

    This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entity scoring.

    Language:Python24072815
  • devmount/GermanWordEmbeddings

    Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tensorflow.

    Language:Jupyter Notebook234121550
  • akoksal/Turkish-Word2Vec

    Pre-trained Word2Vec Model for Turkish

    Language:Python21116632
  • Splitter

    benedekrozemberczki/Splitter

    A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).

    Language:Python21111844
  • giacbrd/ShallowLearn

    An experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.

    Language:Python198182630
  • webvectors

    akutuzov/webvectors

    Web-ify your word2vec: framework to serve distributional semantic models online

    Language:Python197123749
  • role2vec

    benedekrozemberczki/role2vec

    A scalable Gensim implementation of "Learning Role-based Graph Embeddings" (IJCAI 2018).

    Language:Python16512332
  • platisd/duplicate-code-detection-tool

    A simple Python3 tool to detect similarities between files within a repository

    Language:Python16341730
  • PrashantRanjan09/WordEmbeddings-Elmo-Fasttext-Word2Vec

    Using pre trained word embeddings (Fasttext, Word2Vec)

    Language:Python1585731
  • MUSAE

    benedekrozemberczki/MUSAE

    The reference implementation of "Multi-scale Attributed Node Embedding". (Journal of Complex Networks 2021)

    Language:Python1555822
  • Stock-Prediction

    alisonmitchell/Stock-Prediction

    Technical and sentiment analysis to predict the stock market with machine learning models based on historical time series data and news article sentiment collected using APIs and web scraping.

    Language:Jupyter Notebook1335431
  • nlp_workshop_odsc_europe20

    dipanjanS/nlp_workshop_odsc_europe20

    Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and Topic Models.

    Language:Jupyter Notebook13310065
  • diff2vec

    benedekrozemberczki/diff2vec

    Reference implementation of Diffusion2Vec (Complenet 2018) built on Gensim and NetworkX.

    Language:Python1256220
  • eellak/nlpbuddy

    A text analysis application for performing common NLP tasks through a web dashboard interface and an API

    Language:HTML124211528
  • ibrahimsharaf/doc2vec

    :notebook: Long(er) text representation and classification using Doc2Vec embeddings

    Language:Python10691243
  • walklets

    benedekrozemberczki/walklets

    A lightweight implementation of Walklets from "Don't Walk Skip! Online Learning of Multi-scale Network Embeddings" (ASONAM 2017).

    Language:Python1029222
  • roboreport/doc2vec-api

    document embedding and machine learning script for beginners

    Language:Python9226036
  • aniass/Product-Categorization-NLP

    Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).

    Language:Jupyter Notebook853026
  • philipperemy/japanese-words-to-vectors

    Word2vec (word to vectors) approach for Japanese language using Gensim and Mecab.

    Language:Python844319
  • apachecn/gensim-doc-zh

    gensim 中文文档

    Language:JavaScript838026
  • johndpope/hcn

    Hybrid Code Networks https://arxiv.org/abs/1702.03274

    Language:Python817021
  • ansegura7/NLP

    Free hands-on course with the implementation (in Python) and description of several Natural Language Processing (NLP) algorithms and techniques, on several modern platforms and libraries.

    Language:HTML793015