Pinned Repositories
ALBERT
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
chinese-poetry
最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
Chinese_segment_augment
python3实现互信息和左右熵的新词发现
credtweak
Password attack and defense using deep neural networks
DeepTweets
Fine tuned GPT-2 with tweets from Black Twitter in attempt to auto generate funny tweets. Idea inspired by Lex Fridman
Don-AI-ld-Trump
gpt2 Colab Notebook and Dataset + Model to Generate Fake Trump Tweets
Fake-Trump-Tweet
A model to generate Fake Trump Tweets, which says "RealDonaldTrump" a lot!!!
fake_trump_tweet
Tweet Classifiers using Multinomial Naive Bayes and LSTM + Tweet Generators using Markov Chain and LSTM = the Most Trumpy (fake) Tweets
fucking-algorithm
手把手撕LeetCode题目,扒各种算法套路的裤子。English version supported! Crack LeetCode, not only how, but also why.
GloVe
GloVe model for distributed word representation
zhusy09's Repositories
zhusy09/ALBERT
ALBERT: A Lite BERT for Self-supervised Learning of Language Representations
zhusy09/chinese-poetry
最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
zhusy09/Chinese_segment_augment
python3实现互信息和左右熵的新词发现
zhusy09/credtweak
Password attack and defense using deep neural networks
zhusy09/DeepTweets
Fine tuned GPT-2 with tweets from Black Twitter in attempt to auto generate funny tweets. Idea inspired by Lex Fridman
zhusy09/Don-AI-ld-Trump
gpt2 Colab Notebook and Dataset + Model to Generate Fake Trump Tweets
zhusy09/Fake-Trump-Tweet
A model to generate Fake Trump Tweets, which says "RealDonaldTrump" a lot!!!
zhusy09/fake_trump_tweet
Tweet Classifiers using Multinomial Naive Bayes and LSTM + Tweet Generators using Markov Chain and LSTM = the Most Trumpy (fake) Tweets
zhusy09/fucking-algorithm
手把手撕LeetCode题目,扒各种算法套路的裤子。English version supported! Crack LeetCode, not only how, but also why.
zhusy09/GloVe
GloVe model for distributed word representation
zhusy09/gpt-2-simple
Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts
zhusy09/kenlm
KenLM: Faster and Smaller Language Model Queries
zhusy09/LangueOne
练习题︱基于今日头条开源数据的文本挖掘
zhusy09/LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
zhusy09/make-lstm-great-again
Donald Trump's tweets generator
zhusy09/New-Word-Detection
新词发现算法(NewWordDetection)
zhusy09/New-Word-Discovery
新词发现 基于词频、凝聚系数和左右邻接信息熵
zhusy09/New_word_discovery
中文新词发现
zhusy09/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
zhusy09/PassGAN
A Deep Learning Approach for Password Guessing (https://arxiv.org/abs/1709.00440)
zhusy09/PassGAN-1
PassGAN source code for Python 3 & TensorFlow 1.13 with a pre-trained model. https://arxiv.org/abs/1709.00440
zhusy09/py-kenlm-model
python | 高效使用统计语言模型kenlm:新词发现、分词、智能纠错等
zhusy09/pycorrector
pycorrector is a toolkit for text error correction. It was developed to facilitate the designing, comparing, and sharing of deep text error correction models.
zhusy09/semantic-guesser
Training and testing of linguistic passwords models.
zhusy09/Swifter.Json
A powerful, easy-to-use and fastest json serializer and deserializer on .Net platforms.
zhusy09/tensorflow-1.4-billion-password-analysis
Deep Learning model to analyze a large corpus of clear text passwords.
zhusy09/trump-tweet-archive
trump twitter archive
zhusy09/Trump-Tweet-Generator
A project to finetune a GPT-2 338M model on US President Donald Trump's twitter feed, along with a basic Flask website to display some generated outputs from the model.
zhusy09/vnote
A note-taking application that knows programmers and Markdown better.
zhusy09/word-discovery
速度更快、效果更好的中文新词发现