eliasah
Lead Data Scientist Interested in Recommender Systems and beyond. I'm also a Scala & Spark evangelist. @awesome-spark @kiliba-codebase
KilibaParis
Pinned Repositories
spark
Apache Spark - A unified analytics engine for large-scale data processing
awesome-spark
A curated list of awesome Apache Spark packages and resources.
learn-by-examples
Real-world Spark pipelines examples
spark-gotchas
Spark Gotchas. A subjective compilation of the Apache Spark tips and tricks
elasticsearch-definitive-guide
The Definitive Guide to Elasticsearch
airship-convert
A proof of concept converter for Apache Zeppelin notebooks.
elasticsearch-hadoop
Elasticsearch real-time search and analytics natively integrated with Hadoop
ES-Elastica-Project
PHP Elasticsearch Indexer using Elastica (This repo is more like an example of how to use elastica)
eliasah's Repositories
eliasah/airship-convert
A proof of concept converter for Apache Zeppelin notebooks.
eliasah/learn-by-examples
Real-world Spark pipelines examples
eliasah/MSDChallenge
Million Song Dataset challenge using Item-Based CF
eliasah/awesome-spark
A curated list of awesome Apache Spark packages and resources.
eliasah/machine-learning-for-software-engineers
A complete daily plan for studying to become a machine learning engineer.
eliasah/spark-in-a-box
Template-based Dockerfile generator for Apache Spark applications.
eliasah/triplet_recommendations_keras
An example of doing MovieLens recommendations using triplet loss in Keras
eliasah/awesome-spark.github.io
eliasah/aws-big-data-blog
eliasah/benchm-ml
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
eliasah/CausE
Code for the Recsys 2018 paper entitled Causal Embeddings for Recommandation.
eliasah/datasets
eliasah/dbscan-on-spark
An implementation of DBSCAN runing on top of Apache Spark
eliasah/elasticsearch-vector-scoring
Score documents with pure dot product / cosine similarity with ES
eliasah/entity-embedding-rossmann
eliasah/hands_on_kafka
eliasah/handson-ml2
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
eliasah/implicit
Fast Python Collaborative Filtering for Implicit Feedback Datasets
eliasah/incubator-toree
Mirror of Apache Toree (Incubating)
eliasah/MDF-2018
eliasah/photon-ml
A scalable machine learning library on Apache Spark
eliasah/spark
Mirror of Apache Spark
eliasah/spark-bigquery-connector
The connector uses the Spark SQL Data Source API to read data from Google BigQuery.
eliasah/spark-gotchas
eliasah/spark-iforest
Isolation Forest on Apache Spark
eliasah/sparklyr
R interface for Apache Spark
eliasah/tensorframes
Tensorflow wrapper for DataFrames on Apache Spark
eliasah/ThinkStats2
Text and supporting code for Think Stats, 2nd Edition
eliasah/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow
eliasah/zeppelin
Mirror of Apache Zeppelin