drob-xx's Stars
psf/black
The uncompromising Python code formatter
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
duckdb/duckdb
DuckDB is an analytical in-process SQL database management system
Delgan/loguru
Python logging made (stupidly) simple
apache/arrow
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
librosa/librosa
Python library for audio and music analysis
PyCQA/isort
A Python utility / library to sort imports.
MaartenGr/BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
tyiannak/pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
JustAnotherArchivist/snscrape
A social networking service scraper in Python
nmslib/nmslib
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
aws-samples/aws-machine-learning-university-accelerated-nlp
Machine Learning University: Accelerated Natural Language Processing Class
MilaNLProc/contextualized-topic-models
A python package to run contextualized topic modeling. CTMs combine contextualized embeddings (e.g., BERT) with topic models to get coherent topics. Published at EACL and ACL 2021 (Bianchi et al.).
adjidieng/ETM
Topic Modeling in Embedding Spaces
bminixhofer/wtpsplit
Code for Where's the Point? Self-Supervised Multilingual Punctuation-Agnostic Sentence Segmentation
cohere-ai/notebooks
Code examples and jupyter notebooks for the Cohere Platform
zll17/Neural_Topic_Models
Implementation of topic models based on neural network approaches.
wilsonjr/humap
Hierarchical Uniform Manifold Approximation and Projection
cohere-ai/sandbox-topically
Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.
mim-solutions/bert_for_longer_texts
BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them to BERT, intermediate results are pooled. The implementation allows fine-tuning.
ieriii/spacy-annotator
Spacy NER annotator using ipywidgets
yumeng5/TopClus
[WWW 2022] Topic Discovery via Latent Space Clustering of Pretrained Language Model Representations
GGiecold-zz/Cluster_Ensembles
A package for combining multiple partitions into a consolidated clustering. The combinatorial optimization problem of obtaining such a consensus clustering is reformulated in terms of approximation algorithms for graph or hyper-graph partitioning.
gagolews/genieclust
Genie: Fast and Robust Hierarchical Clustering with Noise Point Detection - in Python and R
znhy1024/HEARD
MilaNLProc/bertlang
A web interface to understand language-specific BERT-models
GU-DataLab/gdtm
A Python Package containing wrappers for topic models, including TND, NLDA, GTM, and temporal topic-noise models.
dkorenci/topic_coverage
Code of the experiments from the article "A Topic Coverage Approach to Evaluation of Topic Models"