vim89
Software & Data Engineer Lives in Mumbai, India Mumbai-Pune-Goa-Mumbai Love cars, I enjoy driving my @Ford 1.5 TDCi Watch collector:Own over 3 dozen
Staff Data Engineer @WalmartMumbai, India
Pinned Repositories
shc
The Apache Spark - Apache HBase Connector is a library to support Spark accessing HBase table as external data source or sink.
ConferenceManager
Better OOAM & Design
Data-Structures-and-Algorithms
Data Structures and Algorithms
dataengineering-savvy
datapipelines-essentials-python
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Deedy-Resume
A one page , two asymmetric column resume template in XeTeX that caters to an undergraduate Computer Science student
Design-Patterns
Design patterns provide a reusable solution to commonly occurring software problems.
gcp-datalake
Read various types of files from Google storage, Maps data to Google Bigtable & Performs bulk load into Google Bigtable
MapReduceExamples
MapReduce various examples & Algorithm & Hadoop Batch Processing using MapReduce
spark-spring-boot
Apache spark on/with support of Spring boot
vim89's Repositories
vim89/datapipelines-essentials-python
Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
vim89/spark-spring-boot
Apache spark on/with support of Spring boot
vim89/Data-Structures-and-Algorithms
Data Structures and Algorithms
vim89/dataengineering-savvy
vim89/abortable-bytes
a tiny scala/kyo POC that makes cloud uploads cancel-safe. Stream large files to cloud & clean-up on cancel
vim89/cask
Cask: a Scala HTTP micro-framework. Cask makes it easy to set up a website, backend server, or REST API using Scala
vim89/compile-time-data-contracts
Enforcing enforces compile‑time data contracts to data-pipelines leveraging Scala's power
vim89/covid19
vim89/Data-Modeling-with-Postgres
Project to define a fact and dimension star schema optimized for queries created using the flight booking database using a PostgreSQL Relational database.
vim89/datafusion
Apache DataFusion SQL Query Engine
vim89/flink
The official repository for the Rock the JVM Flink course
vim89/flowforge
Let's be honest - most data pipeline frameworks treat types as suggestions. Config files are strings. Schemas are "validated" at runtime. Data quality is an afterthought. So, let's do differently
vim89/GoBooks
List of Golang books
vim89/homebrew-tools
A custom Homebrew tap
vim89/ink-free
Crisp, minimal personal website and blog theme Hugo, designed for a collaborative, open-source, privacy conscious blog.
vim89/llm4s
Agentic and LLM Programming in Scala
vim89/llm4s.g8
giter8 templates for LLM4S
vim89/minimal-mistakes
:triangular_ruler: Jekyll theme for building a personal site, blog, project documentation, or portfolio.
vim89/project-layout
Standard Go Project Layout
vim89/pyspark-xml-parsing
Derive new attributes from XML when you have XPATH transformations. Accelerate boring stuff in #Pyspark & #Python. Also check out how to handle multi line XML
vim89/rockthejvm-site
vim89/scalasql
Scala ORM to query SQL databases from Scala via concise, type-safe, and familiar case classes and collection operations. Connects to Postgres, MySql, H2, and Sqlite out of the box
vim89/spark-streaming
The official repository for the Rock the JVM Spark Streaming course
vim89/spark-testing-base
Base classes to use when writing tests with Spark
vim89/spark-xml
XML data source for Spark SQL and DataFrames
vim89/szork
Voice-controlled Zork-style adventure demo using LLM4S: speech, image, tools, and agentic gameplay.
vim89/toon4s
vim89/valmuri
A batteries‑included, functional-first toolkit for building full‑stack JVM apps in @scala
vim89/vim89
vim89/zio-spark
A functional wrapper around Spark to make it works with ZIO