Pinned Repositories
xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
bayesian-hypothesis-testing
Bayesian Hypothesis Testing using Monte Carlo Simulations
data-scientist-demystified
Explanation of the Data Scientist job function
doppel-speller
An ML+NLP solution for linking misspelled titles with the true titles
fast-arg-top-k
Get the indices of the top K values in an array
fastman
mhaseebtariq.github.io
Digital Resume
pyspark-helpers
Useful helper functions for PySpark dataframe operations
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
memory_profiler
Monitor Memory usage of Python code
mhaseebtariq's Repositories
mhaseebtariq/doppel-speller
An ML+NLP solution for linking misspelled titles with the true titles
mhaseebtariq/fast-arg-top-k
Get the indices of the top K values in an array
mhaseebtariq/bayesian-hypothesis-testing
Bayesian Hypothesis Testing using Monte Carlo Simulations
mhaseebtariq/data-scientist-demystified
Explanation of the Data Scientist job function
mhaseebtariq/fastman
mhaseebtariq/mhaseebtariq.github.io
Digital Resume
mhaseebtariq/pyspark-helpers
Useful helper functions for PySpark dataframe operations