chongjeeseng's Stars
twitter/the-algorithm
Source code for Twitter's Recommendation Algorithm
Ephemeral-Ahsan/Complex-SQL-Practice-for-Interview
I have tried to solve some complex SQL interview questions that had been asked in several company. Collected this question from Ankit Bansal. He made a organized playlist along with the solution. I have tried to figure out them first and then checked whether my query is correct or no. Thanks in advance.
jonathan-bower/DataScienceResources
Open Source Data Science Resources.
rushter/data-science-blogs
A curated list of data science blogs
ml-tooling/best-of-ml-python
🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
SigmaQuan/Better-Python-59-Ways
Code Sample of Book "Effective Python: 59 Specific Ways to Write Better Pyton" by Brett Slatkin
cuge1995/awesome-time-series
list of papers, code, and other resources
ResidentMario/missingno
Missing data visualization module for Python.
fabiopipitone/elasticsearch-tocsv
Simple python tool to easily extract massive amounts of data from Elasticsearch into a csv file
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
matterport/Mask_RCNN
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
iam-mhaseeb/Skytrax-Data-Warehouse
A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for cloud data warehouse and Metabase to serve the needs of data visualizations such as analytical dashboards.
kearnz/autoimpute
Python package for Imputation Methods
jphall663/awesome-machine-learning-interpretability
A curated list of awesome responsible machine learning resources.
chrisluedtke/data-science-journal
Personal repository of data science demonstrations and references
zamzambadruzaman/big-data-engineering-indonesia
A curated list of big data engineering tools, resources and communities.
jghoman/awesome-apache-airflow
Curated list of resources about Apache Airflow
iterative/dvc
🦉 ML Experiments and Data Management with Git
sryza/aas
Code to accompany Advanced Analytics with Spark from O'Reilly Media
datasciencescoop/Data-Science--Cheat-Sheet
Cheat Sheets
MingChen0919/learning-apache-spark
Notes on Apache Spark (pyspark)
elsyifa/Classification-Pyspark
This repository of classification template using pyspark.
MarcKaminski/spark-FeatureSelection
Featureselection methods as Spark MLlib Pipelines
alteryx/predict-customer-churn
A general-purpose framework for solving problems with machine learning applied to predicting customer churn
WillKoehrsen/automated-feature-engineering
Automated feature engineering in Python with Featuretools
benedekrozemberczki/awesome-community-detection
A curated list of community detection research papers with implementations.
benedekrozemberczki/awesome-graph-classification
A collection of important graph embedding, classification and representation learning papers with implementations.
firmai/industry-machine-learning
A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
uwescience/datasci_course_materials
Public repository for course materials for the Data Science at Scale Specialization at Coursera