Pinned Repositories
AmazonTopics
Practice topic models techniques on Amazon reviews dataset
aws-jupyter
Command-line tool for setting up Jupyter Notebook on AWS
bathymetry-analysis
chrome-to-saved.io
Script to migrate Chrome bookmarks to saved.io
election-tweets-crawler
A streaming tweets crawler for collecting tweets related to 2016 US presidential election.
markdowku
Forked the "markdowku" plugin of Dokuwiki.
PySpark-Scripts
Some useful scripts to work with PySpark [http://spark-project.org]
sparrow
Sparrow is a boosting algorithm implementation that is optimized for training on very large datasets and/or in the limited memory settings.
tmsn
general modules for implementing TMSN for various learning algorithms
vimrc
my vimrc
arapat's Repositories
arapat/sparrow
Sparrow is a boosting algorithm implementation that is optimized for training on very large datasets and/or in the limited memory settings.
arapat/markdowku
Forked the "markdowku" plugin of Dokuwiki.
arapat/bathymetry-analysis
arapat/vimrc
my vimrc
arapat/spark-notebook
Launch Apache Spark and Jupyter Notebook on Amazon Web Services.
arapat/aws-jupyter
Command-line tool for setting up Jupyter Notebook on AWS
arapat/bathymetry
Train boosted trees on the bathymetry data
arapat/sparkboost
arapat/sparrow-doc
The documentation for Sparrow and TMSN
arapat/election-tweets-crawler
A streaming tweets crawler for collecting tweets related to 2016 US presidential election.
arapat/tmsn
general modules for implementing TMSN for various learning algorithms
arapat/bathymetry-talk
arapat/boosting_tree_benchmarks
arapat/datasette
A tool for exploring and publishing data
arapat/defnse-talk
Jupyter notebook that were used to create the plots for my defense slides
arapat/early-stop-synthetic
arapat/edX-nbgrader
arapat/jboost
Mirror of JBoost from https://sourceforge.net/projects/jboost/
arapat/LightGBM
A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. It is under the umbrella of the DMTK(http://github.com/microsoft/dmtk) project of Microsoft.
arapat/machine_learning_for_good
Machine learning fundamentals lesson in interactive notebooks
arapat/metricslib
A Rust library for validating the model performance.
arapat/online-methods-for-twitter
a project for analyzing large amounts of twitter data.
arapat/simple-spark-jobserver
A simple server app to receive jobs and submit to Spark, used for CSE 255 in SP16.
arapat/simpleurlshortener
Build a Simple URL Shortener
arapat/spark-notebook-emr
Launch Apache Spark and Jupyter Notebook on Amazon Web Services.
arapat/sparrow-experiments
Analyze logs generated by Sparrow
arapat/sparrow-scripts
arapat/sparrow-writeup
Manuscript for SysML 2018
arapat/swift-coreml-diffusers
Swift app demonstrating Core ML Stable Diffusion
arapat/yarn-ec2
Quickly start YARN cluster on EC2