Pinned Repositories
ctr-prediction
Predict CTR for the Criteo-sponsored Kaggle competition.
daft-builder
Wrapper library on daft that provides a builder interface for rendering probabilistic graphical models (PGMs).
dblp
Parse the dblp data into a structured format for experimentation.
gensim
Topic Modelling for Humans
motif-classify
Classify time series data using motifs discovered from Sequitur processing of SAX discretized data.
nsf-award-data
Documentation and scripts which parse the NSF historical award data and store it as a complete dataset in a RDB.
ntsgp
Educational Recommender Systems
recsys
Recommender system experiments.
macks22's Repositories
macks22/dblp
Parse the dblp data into a structured format for experimentation.
macks22/ntsgp
Educational Recommender Systems
macks22/motif-classify
Classify time series data using motifs discovered from Sequitur processing of SAX discretized data.
macks22/daft-builder
Wrapper library on daft that provides a builder interface for rendering probabilistic graphical models (PGMs).
macks22/gensim
Topic Modelling for Humans
macks22/mixture_models
Implementations of various supervised and unsupervised mixture models
macks22/scikit-learn
scikit-learn: machine learning in Python
macks22/arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
macks22/birth-data
CDC Vital Statistics birth datasets
macks22/elasticsearch-playground
Scripts to build a test index with test data.
macks22/ersgen
educational recommender systems data generation and analysis
macks22/fastFM
fastFM: A Library for Factorization Machines
macks22/GI
Sequitur and RePair grammar induction algorithms implementation
macks22/grammarviz2_src
GrammarViz 2.0 public release:
macks22/gym
A toolkit for developing and comparing reinforcement learning algorithms.
macks22/intuitive-ml
macks22/ipython-tikzmagic
IPython magics for generating figures with TikZ
macks22/kafka-connect-elasticsearch
macks22/kaggle_Microsoft_Malware
code for kaggle competition Microsoft malware classification
macks22/keep-a-changelog
If you build software, keep a changelog.
macks22/Limp
Lisp IDE
macks22/luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
macks22/mapreduce
macks22/ner-experiments
Experiments with Named Entity Recognition (NER) for learning purposes.
macks22/prefix-tree
Simple in-memory, pure Python prefix tree for autocompletion search
macks22/pymc3
PyMC version 3
macks22/scipy
Scipy library main repository
macks22/sphinx-example
A mini-tutorial / cheatsheet / link-collection to get you started documenting Python code using Sphinx.
macks22/text-editing
Various tools that make my life easier when writing/editing text documents.
macks22/TopicModeling
Topic Modeling on Apache Spark