Pinned Repositories
ds-algo
Data Structures and Algorithms for Coding Interviews (Java & Scala)
e-commerce-marketing-pipeline
Data Pipeline examples using Oozie, Spark and Hive on Cloudera VM and AWS EC2 (branch aws-ec2)
geo-search-spark
Apache Spark v2.0.0 application written in Scala to map given latitude longitude values to nearest latitude longitude values in a given set using broadcasted indexes of available geo coordinates.
hive-migration
Migrating Hive Tables from one Hadoop Cluster to another and across versions
spark-indexed-dedup
Using Hash table based indexes for optimising joins in Apache Spark
spark-site-catalyst
Spark Data Source package to read data warehouse exports from Site Catalyst written for Apache Spark v1.6 and earlier and compatible with Spark 2.0 and above.
spark-skew-join-examples
Simple examples on techniques for handling skewed data in Spark 2.0
spark2-etl-examples
A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0
scio
A Scala API for Apache Beam and Google Cloud Dataflow.
anish749's Repositories
anish749/spark2-etl-examples
A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0
anish749/e-commerce-marketing-pipeline
Data Pipeline examples using Oozie, Spark and Hive on Cloudera VM and AWS EC2 (branch aws-ec2)
anish749/ds-algo
Data Structures and Algorithms for Coding Interviews (Java & Scala)
anish749/spark-skew-join-examples
Simple examples on techniques for handling skewed data in Spark 2.0
anish749/spark-indexed-dedup
Using Hash table based indexes for optimising joins in Apache Spark
anish749/better-subprocess
Examples on how to use wrap sub process in bash with correct exit code handling
anish749/spark-skew-join
Fragment Replicate Join for Spark Data frames using Count Min Sketches for estimating skews
anish749/algebird
Abstract Algebra for Scala
anish749/anish749.github.io
My GitHub Pages site
anish749/cloud-run-iap-terraform-demo
Deploy an IAP-secured application to Cloud Run using Terraform (e.g. an admin portal or internal company app)
anish749/dbeam
DBeam extracts SQL tables using JDBC and Apache Beam
anish749/encore
Encore is the end-to-end Backend Development Platform that lets you escape cloud complexity.
anish749/github-pages-stripe-checkout
Example of a client-only (no server) donation payment page that can be hosted on GitHub using Stripe Checkout.
anish749/jupyter-scala
Lightweight Scala kernel for Jupyter / IPython 3
anish749/langchain
🦜🔗 Build context-aware reasoning applications
anish749/magnolia
Simple, fast, debuggable generic derivation for typeclasses
anish749/magnolify
A collection of Magnolia add-on modules
anish749/median-motif-search
Median Motif Search using Hadoop Map Reduce
anish749/nyc-tlc-minh
anish749/redis-operator
A golang based redis operator that will make/oversee Redis standalone/cluster mode setup on top of the Kubernetes.
anish749/requests-scala
A Scala port of the popular Python Requests HTTP client: flexible, intuitive, and straightforward to use.
anish749/scala-lint-default-substitution
Scalafix Linting Rule to find default value substitution
anish749/scala-lint-pattern-match-with-cons-cell
Scalafix Linting for Pattern Matching with cons cells (Scala 2.12)
anish749/scala-workshop
Scala Workshop @ SpEC 2019
anish749/scio
A Scala API for Apache Beam and Google Cloud Dataflow.
anish749/spark
Apache Spark - A unified analytics engine for large-scale data processing
anish749/spark-dynamodb
Plug-and-play implementation of an Apache Spark custom data source for AWS DynamoDB.
anish749/spark-regex-delimited-csv
Spark Data Source for csv files with Regex as a delimiter
anish749/styx
"The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.
anish749/univocity-parsers
uniVocity-parsers is a suite of extremely fast and reliable parsers for Java. It provides a consistent interface for handling different file formats, and a solid framework for the development of new parsers.