pourya-ir's Stars
yangshun/tech-interview-handbook
💯 Curated coding interview preparation materials for busy software engineers
awesomedata/awesome-public-datasets
A topic-centric list of HQ open datasets.
recommenders-team/recommenders
Best Practices on Recommendation Systems
marcotcr/lime
Lime: Explaining the predictions of any machine learning classifier
modin-project/modin
Modin: Scale your Pandas workflows by changing a single line of code
VowpalWabbit/vowpal_wabbit
Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.
blue-yonder/tsfresh
Automatic extraction of relevant features from time series:
alteryx/featuretools
An open source python library for automated feature engineering
python-visualization/folium
Python Data. Leaflet.js Maps.
interpretml/interpret
Fit interpretable models. Explain blackbox machine learning.
microsoft/SynapseML
Simple and Distributed Machine Learning
gboeing/osmnx
OSMnx is a Python package to easily download, model, analyze, and visualize street networks and other geospatial features from OpenStreetMap.
holoviz/datashader
Quickly and accurately render even the largest data.
Yelp/mrjob
Run MapReduce jobs on Hadoop or Amazon Web Services
WillKoehrsen/feature-selector
Feature selector is a tool for dimensionality reduction of machine learning datasets
databricks/spark-deep-learning
Deep Learning Pipelines for Apache Spark
bckenstler/CLR
andrea-cuttone/geoplotlib
python toolbox for visualizing geographical data and making maps
d6t/d6tflow
Python library for building highly effective data science workflows
gmplot/gmplot
Plot data on Google Maps, the easy way.
kundajelab/deeplift
Public facing deeplift repo
h2oai/driverlessai-recipes
Recipes for Driverless AI
flowlight0/talkingdata-adtracking-fraud-detection
My solution for TalkingData AdTracking Fraud Detection Challenge (https://www.kaggle.com/c/talkingdata-adtracking-fraud-detection/)
charlieg/A-Smattering-of-NLP-in-Python
A very brief introduction to Natural Language Processing programming in Python
Apress/machine-learning-with-pyspark
Source Code for 'Machine Learning with PySpark' by Pramod Singh
lmassaron/kaggledays-2019-gbdt
Kaggle Days Paris - Competitive GBDT Specification and Optimization Workshop
PacktPublishing/PySpark-Cookbook
PySpark Cookbook, published by Packt
PrincipalComponent-zz/AXA_Telematics
Contains the 2nd prize winning solution in AXA's "Driver Telematics Driver Analysis" competition
cmuth001/Data-Engineer-Nano-Degree
Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.
JasonKessler/GlobalAI2018
Slides and Code for Natural Language Visualization with Scattertext