zleex's Stars
learn-co-students/Hypothesis_testing
Bergvca/string_grouper
Super Fast String Matching in Python
Bergvca/pyspark_dist_explore
Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.
ericmjl/Network-Analysis-Made-Simple
An introduction to network analysis and applied graph theory using Python and NetworkX
MUSA-620-Fall-2019/philadelphia-shootings-app
A Panel-based dashboard showing recent shootings in Philadelphia using Altair, Folium, and Hvplot
finlytics-hub/credit_risk_model
A comprehensive credit risk model and scorecard using data from Lending Club
guillermo-navas-palencia/optbinning
Optimal binning: monotonic binning with constraints. Support batch & stream optimal binning. Scorecard modelling and counterfactual explanations.
jstephenj14/Cost-Sensitive-Churn-Modelling
Converting transaction data to a cost-aware churn model using approximated customer lifetime value and efficient test/control windows.
jstephenj14/leetcode-sql-unlocked
Through the command line, the user can easily access all ~125 LeetCode SQL/Database questions, even locked ones, and automatically generate the tables in db-fiddle.com.
jstephenj14/Monotonic-WOE-Binning-Algorithm
Python package that optimizes information value, weight-of-evidence monotonicity and representativeness of features for credit scorecard models (pip install monotonic-binning)
Selvameenakshi/Credit-card-default-Regression
Predict the credit limit for customers
rachel872/oreilly-intro-to-predictive-clv
Repo that contains the supporting material for O'Reilly Webinar "An Intro to Predictive Modeling for Customer Lifetime Value"
k-bosko/CLV_prediction
Predicting Customer Lifetime Value
explosion/spacy-course
👩🏫 Advanced NLP with spaCy: A free online course
iterative/dvc
🦉 Data Versioning and ML Experiments
chiphuyen/stanford-tensorflow-tutorials
This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
chiphuyen/just-pandas-things
An ongoing list of pandas quirks
chiphuyen/coding-exercises
My implementation of useful data structures, algorithms, as well as my solutions to programming puzzles.
chiphuyen/sotawhat
Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday.
YLTsai0609/pyspark_101
Yu Long's note about spark and pyspark
feng-li/Distributed-Statistical-Computing
Teaching Materials for Distributed Statistical Computing (大数据分布式计算教学材料)
kevinschaich/pyspark-cheatsheet
🐍 Quick reference guide to common patterns & functions in PySpark.
MingChen0919/learning-apache-spark
Notes on Apache Spark (pyspark)
susanli2016/PySpark-and-MLlib
Getting start with PySpark and MLlib
AlexIoannides/pyspark-example-project
Implementing best practices for PySpark ETL jobs and applications.
databricks/LearningSparkV2
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
aakinlalu/ThinkBayes
Code repository for Think Bayes.
aakinlalu/d3-book
Code examples for “Interactive Data Visualization for the Web”
aakinlalu/100-Days-Of-ML-Code
100 Days of ML Coding
aakinlalu/awesome-project-ideas
Curated list of Machine Learning, NLP, Vision, Recommender Systems Project Ideas