Pinned Repositories
create-and-run-spark-job
Create n-node cluster and Run spark job on Docker
CreditCard_Fraud_Detection
Spark MLLib Application for Credit Card Fraud Detection - Structured Streaming
Docker_WordCount_Spark
Sample spark program to run in docker setup
KafkaFile_BatchProcessing
Project to demo file processing w/ Avro schema in Scala using gradle
MultiCurrencyPiggyBankCalculator
This is a small project for y kid who has a vast collection of coins of different currencies in her Piggy Bank
SearchEngine_ES_Flask
This project is a quick deomo of building search engine using Elasticsearch
Spark_Cassandra_Example
Sample code to demo spark cassandra connector (Spark v. 2.x; Cassandra v. 3.x)
Spark_Cassandra_Python
Repo to demo basic WordCount in PySpark using PyCharm
Spark_Streaming_Examples
This repo contains spark structured streaming examples in Scala
StockTwit_SentimentAnalysis
Repo to perform Sentiment Analysis on StockTwits
pavanpkulkarni's Repositories
pavanpkulkarni/create-and-run-spark-job
Create n-node cluster and Run spark job on Docker
pavanpkulkarni/Spark_Streaming_Examples
This repo contains spark structured streaming examples in Scala
pavanpkulkarni/SearchEngine_ES_Flask
This project is a quick deomo of building search engine using Elasticsearch
pavanpkulkarni/Spark_Cassandra_Python
Repo to demo basic WordCount in PySpark using PyCharm
pavanpkulkarni/CreditCard_Fraud_Detection
Spark MLLib Application for Credit Card Fraud Detection - Structured Streaming
pavanpkulkarni/docker-spark-image
This repo contains docker image for Spark 2.2.1 cluster
pavanpkulkarni/Spark_Mongo_Example
This repo contains mongo spark sample code in Scala
pavanpkulkarni/Docker_WordCount_Spark
Sample spark program to run in docker setup
pavanpkulkarni/KafkaFile_BatchProcessing
Project to demo file processing w/ Avro schema in Scala using gradle
pavanpkulkarni/MultiCurrencyPiggyBankCalculator
This is a small project for y kid who has a vast collection of coins of different currencies in her Piggy Bank
pavanpkulkarni/Spark_Cassandra_Example
Sample code to demo spark cassandra connector (Spark v. 2.x; Cassandra v. 3.x)
pavanpkulkarni/bhai
Explore this fun language --> bhailang
pavanpkulkarni/blog
This repo holds all the blog content
pavanpkulkarni/data-engineer-learning-path
Databricks Spark Materials As Retired from Databricks Academy
pavanpkulkarni/hello-github-actions
pavanpkulkarni/MongoDB_Python
Repo to demo insert and delete operations for MongoDB in Python
pavanpkulkarni/pavanpkulkarni
Config files for my GitHub profile.
pavanpkulkarni/pavanpkulkarni.github.io
Test wen hosting on github
pavanpkulkarni/PySpark_WordCount
Repo to demo basic WordCount in PySpark using PyCharm
pavanpkulkarni/PythonDocker
A basic repo to build and deploy simple Flask App on Docker
pavanpkulkarni/Read_Write_HDFS_Spark_WordCount
Read_Write_HDFS_Spark_WordCount
pavanpkulkarni/ScalaTestWorkspace
This repo contains solutions to Coding Challenge
pavanpkulkarni/Spark_WordCount_Gradle
This repo spark wordcount code using Gradle build tool
pavanpkulkarni/SparkDocker
pavanpkulkarni/sparkMeasure
This is the development repository for sparkMeasure, a tool and library designed for efficient analysis and troubleshooting of Apache Spark jobs. It focuses on easing the collection and examination of Spark metrics, making it a practical choice for both developers and data engineers.
pavanpkulkarni/spring-batch
spring-batch example projects
pavanpkulkarni/TestGitCommands
This repo is get handon on Git Commands
pavanpkulkarni/The-Documentation-Compendium
📢 Various README templates & tips on writing high-quality documentation that people want to read.
pavanpkulkarni/Topic_Classification
This repo contains python project to show topic classification using LDA for Topic Modeling and then applying Word2Vec for classifying the topics.
pavanpkulkarni/training-kit
Open source cheat sheets for Git and GitHub