Pinned Repositories
cnn-text-classification-tf
Convolutional Neural Network for Text Classification in Tensorflow with word2vec embeddings (switch to branch dev-mmihaltz!)
hunlp-pipeline
Hungarian NLP pipelne for tokenization, pos-tagging and stemming using open source tools
huwn
Hungarian WordNet / Magyar WordNet
huwn.rdf
Hungarian WordNet in RDF format for the Linked Open Data cloud
pysettrie
python3 package supporting efficient storage and querying of sets of sets using the trie data structure. Supports finding all the supersets/subsets of a given set from a collection of sets. Also includes a trie-based mapping container where the keys are sets.
trendminer-hunlp
Hungarian NLP pipeline for social media text analysis (TrendMiner project)
trendminer-hutools
Various tools used by TrendMiner/hu (Facebook data download, Java NooJ import/export format conversion)
word2vec-GoogleNews-vectors
word2vec Google News model
mmihaltz's Repositories
mmihaltz/word2vec-GoogleNews-vectors
word2vec Google News model
mmihaltz/pysettrie
python3 package supporting efficient storage and querying of sets of sets using the trie data structure. Supports finding all the supersets/subsets of a given set from a collection of sets. Also includes a trie-based mapping container where the keys are sets.
mmihaltz/huwn
Hungarian WordNet / Magyar WordNet
mmihaltz/trendminer-hunlp
Hungarian NLP pipeline for social media text analysis (TrendMiner project)
mmihaltz/hunlp-pipeline
Hungarian NLP pipelne for tokenization, pos-tagging and stemming using open source tools
mmihaltz/huwn.rdf
Hungarian WordNet in RDF format for the Linked Open Data cloud
mmihaltz/trendminer-hutools
Various tools used by TrendMiner/hu (Facebook data download, Java NooJ import/export format conversion)
mmihaltz/cnn-text-classification-tf
Convolutional Neural Network for Text Classification in Tensorflow with word2vec embeddings (switch to branch dev-mmihaltz!)
mmihaltz/biralat
Bírálat sablon a BME-s dolgozatokhoz
mmihaltz/dedupe
:id: A python library for accurate and scaleable fuzzy matching, record deduplication and entity-resolution.
mmihaltz/huwn-util
Miscellaneous utilities for Hungarian WordNet data files
mmihaltz/keep-a-changelog
If you build software, keep a changelog.
mmihaltz/libWNXML
C++ API for querying Hungarian WordNet XML files
mmihaltz/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
mmihaltz/opener-setup
Script to download and install OpeNER and all dependencies in one go (Ubuntu)
mmihaltz/pytimeout
Python module to enable timeout on python code