Pinned Repositories
data-science-interviews
Data science interview questions and answers
bpe-dropout
Class allows you to create BPE tokens with dropout or not. Implements Sentencepeace lib with easy fit predict way.
cherk
cian
Parser of cian.ru site and modeling classification to devide photos of the resource into telegramm channel.
data-science-interviews
Data science interview questions and answers
newsbot
News Bot allows you to choose and personalize news in one flow.
stratify
Code allows you to strtify dataframe in different convenient ways. ReadMe shows how.
tfidf
TF-IDF is a method, which allows get matrix tokens for your list of text. It creates n * m matrix, where n - quantity of texts and m - quantity of unique words - tokens in texts vocabulary. IMPORTANT: This .py is my own variation of tfidf and doesnt duplicate existing tfidf versions. So it could
tinkoff-api
tinkoff-api
word2vec
Word2Vec is an algorythm of word representation in embeddings. This repo contains a code about word2vec only.
pingmehard's Repositories
pingmehard/bpe-dropout
Class allows you to create BPE tokens with dropout or not. Implements Sentencepeace lib with easy fit predict way.
pingmehard/cherk
pingmehard/cian
Parser of cian.ru site and modeling classification to devide photos of the resource into telegramm channel.
pingmehard/data-science-interviews
Data science interview questions and answers
pingmehard/newsbot
News Bot allows you to choose and personalize news in one flow.
pingmehard/stratify
Code allows you to strtify dataframe in different convenient ways. ReadMe shows how.
pingmehard/tfidf
TF-IDF is a method, which allows get matrix tokens for your list of text. It creates n * m matrix, where n - quantity of texts and m - quantity of unique words - tokens in texts vocabulary. IMPORTANT: This .py is my own variation of tfidf and doesnt duplicate existing tfidf versions. So it could
pingmehard/tinkoff-api
tinkoff-api
pingmehard/word2vec
Word2Vec is an algorythm of word representation in embeddings. This repo contains a code about word2vec only.