Pinned Repositories
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
airflow-on-kubernetes
Bare minimal Airflow on Kubernetes (Local, EKS, AKS)
azure-airflow
cluster-policy-sdk
cluster-policy-sdk
databricks-end-to-end-streaming-main
dbldatagen-master
Delta-Live-Tables-main
dlt-pii-firewall-main
drunken-data-quality
Spark package for checking data quality
great-expectation
great expectation
anayyar82's Repositories
anayyar82/cluster-policy-sdk
cluster-policy-sdk
anayyar82/databricks-end-to-end-streaming-main
anayyar82/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
anayyar82/airflow-on-kubernetes
Bare minimal Airflow on Kubernetes (Local, EKS, AKS)
anayyar82/azure-airflow
anayyar82/dbldatagen-master
anayyar82/Delta-Live-Tables-main
anayyar82/dlt-pii-firewall-main
anayyar82/drunken-data-quality
Spark package for checking data quality
anayyar82/great-expectation
great expectation
anayyar82/gtc2017-numba
Numba tutorial for GTC 2017 conference
anayyar82/hyperas
Keras + Hyperopt: A very simple wrapper for convenient hyperparameter optimization
anayyar82/ide-best-practices
Best practices for working with Databricks from an IDE
anayyar82/lakehousedemo
anayyar82/Mapreduce-
anayyar82/Mapreduce-Custom-Input-format
Ebcidic to Ascii
anayyar82/mlflow
Open source platform for the machine learning lifecycle
anayyar82/mlflow-cicd-ankur
anayyar82/mlflowdemo
anayyar82/mlops
mlops slack
anayyar82/my-mlops-project
my-mlops-project
anayyar82/notebook
anayyar82/simple-keras-rest-api
A simple Keras REST API using Flask
anayyar82/spark
Mirror of Apache Spark
anayyar82/spark-pandas
Koala: Pandas APIs on Apache Spark
anayyar82/Twitter-Sentiment-Analysis-using-Apache-Spark-
Accessed the Twitter API for live streaming tweets. Performed Feature Extraction and transformation from the JSON format of tweets using machine learning package of python pyspark.mllib. Experimented with three classifiers -Naïve Bayes, Logistic Regression and Decision Tree Learning and performed k-fold cross validation to determine the best.
anayyar82/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow