Pinned Repositories
AdvancedSQLPuzzles
Welcome to my GitHub repository. I hope you enjoy solving these puzzles as much as I have enjoyed creating them.
community-clusters
We look to apply big data techniques to perform large-scale graph mining on web crawl data. We utilize the Common Crawl web dataset, which is an open dataset hosted on Amazon Web Services' cloud platform. This is an extremely large dataset that consists of petabytes of web crawl data.
dive-in-ml
equity-portfolio-prediction
etl-series
gdx-sqlite
A cross-platform extension for database handling in Libgdx
kafka-jython
This repo complements the blog post on Interfacing Jython with Kafka 0.8.x. It can also be used as a bare minimum to interface Kafka with Jython.
sarama
Sarama is a Go library for Apache Kafka 0.8 and 0.9
simple-crawler
A super simple webcrawler framework written in Python.
transfer-learning-bigdl
Jupyter notebook showing transfer learning using BigDL and Analytics Zoo
mrafayaleem's Repositories
mrafayaleem/gdx-sqlite
A cross-platform extension for database handling in Libgdx
mrafayaleem/equity-portfolio-prediction
mrafayaleem/etl-series
mrafayaleem/transfer-learning-bigdl
Jupyter notebook showing transfer learning using BigDL and Analytics Zoo
mrafayaleem/dive-in-ml
mrafayaleem/AdvancedSQLPuzzles
Welcome to my GitHub repository. I hope you enjoy solving these puzzles as much as I have enjoyed creating them.
mrafayaleem/data-engineer-roadmap
Roadmap to becoming a data engineer in 2020
mrafayaleem/academic_advisory
Collected opinions and advice for academic programs focused on data science skills.
mrafayaleem/ai-by-hand-excel
mrafayaleem/annotated_encoder_decoder
The Annotated Encoder Decoder with Attention
mrafayaleem/Financial-Models-Numerical-Methods
Collection of notebooks about quantitative finance, with interactive python code.
mrafayaleem/formula1-analysis
mrafayaleem/fullstack.ai
End-to-end machine learning project showing key aspects of developing and deploying ML driven application
mrafayaleem/goodreads_etl_pipeline
An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.
mrafayaleem/kubernetes-workshop
⚙️ A Gentle introduction to Kubernetes with more than just the basics. 🌟 Give it a star if you like it.
mrafayaleem/learn-python
📚 Playground and cheatsheet for learning Python. Collection of Python scripts that are split by topics and contain code examples with explanations.
mrafayaleem/machine-learning-interview
Minimum Viable Study Plan for Machine Learning Interviews from FAAG, Snapchat, LinkedIn.
mrafayaleem/mml-book.github.io
Companion webpage to the book "Mathematics For Machine Learning"
mrafayaleem/mrafayaleem.github.io
Code for my personal blog
mrafayaleem/portable-data-stack-dagster
A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset
mrafayaleem/pure-bash-bible
📖 A collection of pure bash alternatives to external processes.
mrafayaleem/redis-rdb-tools
Parse Redis dump.rdb files, Analyze Memory, and Export Data to JSON
mrafayaleem/resilience4j
Resilience4j is a fault tolerance library designed for Java8 and functional programming
mrafayaleem/reverse-interview
Questions to ask the company during your interview
mrafayaleem/soda-core
Data reliability tools for SQL- and Spark-accessible data
mrafayaleem/sql-snippets
A curated collection of helpful SQL queries and functions, maintained by Count.
mrafayaleem/streamz
Real-time stream processing for python
mrafayaleem/system-design-funnel-analysis
mrafayaleem/TastyIgniter
:fire: Powerful, yet easy to use, open-source online ordering, table reservation and management system for restaurants
mrafayaleem/tf-idf-luigi-pipeline