Pinned Repositories
flink
Apache Flink
graphstorm
Enterprise graph machine learning framework for billion-scale graphs for ML scientists and data scientists.
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
cuda-sgd-sese-project
Implementation of mini-batch Stochastic Gradient Descent (SGD) in CUDA using Thrust and cuBLAS
thvasilo.github.io
Personal website/blog
uncertain-trees-reproducible
Online random forests with prediction uncertainty
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
thvasilo's Repositories
thvasilo/thvasilo.github.io
Personal website/blog
thvasilo/uncertain-trees-reproducible
Online random forests with prediction uncertainty
thvasilo/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
thvasilo/phd-thesis-tex
thvasilo/uncertain-trees-experiments
Experiments scripts for online trees with uncertainty
thvasilo/adaqs
AdaQS: Adaptive QuickScorer for Sparse Data and Regression Trees with Default Directions
thvasilo/amazon-sagemaker-developer-guide
The open source version of the Amazon SageMaker docs
thvasilo/annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
thvasilo/autogluon
AutoGluon: AutoML Toolkit for Deep Learning
thvasilo/block-gbt-poster
thvasilo/cpython
The Python programming language
thvasilo/dgl
Python package built to ease deep learning on graph, on top of existing DL frameworks.
thvasilo/dmlc-core
A common bricks library for building scalable and portable distributed machine learning.
thvasilo/dotfiles
A collection of dotfiles I tend to use
thvasilo/garcia-multiple-testing
A copy of the codebase provided by Salvador García and Francisco Herrera for their JMLR paper "An Extension on "Statistical Comparisons of Classifiers over Multiple Data Sets" for all Pairwise Comparisons"
thvasilo/graphlaxy
thvasilo/graphstorm
Enterprise graph machine learning framework for billion-scale graphs for ML scientists and data scientists.
thvasilo/hummingbird
Hummingbird compiles trained ML models into tensor computation for faster inference.
thvasilo/LightGBM
A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
thvasilo/moa
MOA is an open source framework for Big Data stream mining. It includes a collection of machine learning algorithms (classification, regression, clustering, outlier detection, concept drift detection and recommender systems) and tools for evaluation.
thvasilo/mondrianforest
Code for Mondrian Forests (for classification and regression)
thvasilo/phraug2
A new version of phraug, which is a set of simple Python scripts for pre-processing large files
thvasilo/ps-lite
A lightweight parameter server interface
thvasilo/rabit
Reliable Allreduce and Broadcast Interface for distributed machine learning
thvasilo/sagemaker-spark-container
The SageMaker Spark Container is a Docker image used to run data processing workloads with the Spark framework on Amazon SageMaker.
thvasilo/scikit-garden
A garden for scikit-learn compatible trees
thvasilo/sketches-core-cpp
C++ implementation
thvasilo/SysML-reading-list
Systems for ML/AI & ML/AI for Systems paper reading list: A curated reading list of computer science research for work at the intersection of machine learning and systems. PR are welcome.
thvasilo/voila
Voilà turns Jupyter notebooks into standalone web applications
thvasilo/yarn-ec2
Quickly start YARN cluster on EC2