DipeshV
A Data Engineer at heart with an eye of an Architect - DevOps | DataOps | MLOps
FortumStockholm
Pinned Repositories
cloudera-certified-hadoop-developer
data-visualization-matplotlib
This repository contains code for the code along session for the concept Data Visualization
data-wrangling-pandas-code-along
This repository contains code for the code along session for the concept Data Wrangling with Pandas
databricks-api
Simple client for databricks rest api
datatalks-workshop
dsmp-pre-work
Code repository for the Pre Work program at GreyAtom
eda
EDA of automobile data
feature-selection-logistic-regression
Feature Selection and Logistic Regression on Spam dataset
ga-learner-dsmp-repo
A collection of projects as part of the Data Science Masters Program at GreyAtom EduTech Pvt Ltd
getting-started-python-code-along
This repository contains code for the code along session for the concept Getting Started with Python
DipeshV's Repositories
DipeshV/cloudera-certified-hadoop-developer
DipeshV/data-visualization-matplotlib
This repository contains code for the code along session for the concept Data Visualization
DipeshV/data-wrangling-pandas-code-along
This repository contains code for the code along session for the concept Data Wrangling with Pandas
DipeshV/databricks-api
Simple client for databricks rest api
DipeshV/datatalks-workshop
DipeshV/dsmp-pre-work
Code repository for the Pre Work program at GreyAtom
DipeshV/eda
EDA of automobile data
DipeshV/feature-selection-logistic-regression
Feature Selection and Logistic Regression on Spam dataset
DipeshV/ga-learner-dsmp-repo
A collection of projects as part of the Data Science Masters Program at GreyAtom EduTech Pvt Ltd
DipeshV/getting-started-python-code-along
This repository contains code for the code along session for the concept Getting Started with Python
DipeshV/handling-program-flow-in-python-code-along
This repository contains code for the code along session for Handling program Flow in Python Code
DipeshV/hive-scd-examples
How to manage Slowly Changing Dimensions with Apache Hive
DipeshV/How_to_Create-DataLake-Spark
DipeshV/java-design-patterns
This repo contains examples of Java Design Patterns
DipeshV/manipulating-data-with-numpy-code-along
This repository contains code for the code along session for the concept Manipulating Data with NumP
DipeshV/mlflow-docker
Production ready docker-compose configuration for ML Flow with Mysql and Minio S3
DipeshV/mutual-fund-returns
Predict the mutual fund returns in terms of bond spread
DipeshV/olympic-hero
Olympic Hero
DipeshV/pyspark_demo
DipeshV/s3-sqs-connector
A library for reading data from Amzon S3 with optimised listing using Amazon SQS using Spark SQL Streaming ( or Structured streaming).
DipeshV/sbt-release
A release plugin for sbt
DipeshV/spark-dynamodb
Plug-and-play implementation of an Apache Spark custom data source for AWS DynamoDB.
DipeshV/spark-integration-tests
Integration tests for Spark
DipeshV/spark-property-tests
Write property based tests easily on spark dataframes
DipeshV/spark-scala-k8-app
A sample on showing how to deploy the Spark Scala code on Kubernetes using spark-ink8s-operation
DipeshV/spark-xml
XML data source for Spark SQL and DataFrames
DipeshV/tiny
tiny