gauravkumar37
Scaling Big Data Products for 14 yrs. Senior Staff Engr at Disney+ Hotstar, previously MakeMyTrip, IBM. Passionate about Data, Rust, Linux and AI
India
Pinned Repositories
druid
Apache Druid: a high performance real-time analytics database.
spark
Apache Spark - A unified analytics engine for large-scale data processing
aichat
Using ChatGPT/GPT-3.5/GPT-4 in the terminal.
arangodb
ArangoDB is a native multi-model database with flexible data models for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaScript extensions.
clipboard-rdc
Easily share files via Clipboard of Remote Desktop Connections (RDC)
gradle-versioning-plugin
A Gradle plugin implementing https://semver.org/
hadoop-twitter
Hadoop & Big Data magic applied on tweets
hive2-jdbc
Hive JDBC connection examples including simple and kerberos authentication methods.
styli.sh
A CLI tool for easy wallpaper management and image fetching
ydata-profiling
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
gauravkumar37's Repositories
gauravkumar37/clipboard-rdc
Easily share files via Clipboard of Remote Desktop Connections (RDC)
gauravkumar37/hadoop-twitter
Hadoop & Big Data magic applied on tweets
gauravkumar37/gradle-versioning-plugin
A Gradle plugin implementing https://semver.org/
gauravkumar37/styli.sh
A CLI tool for easy wallpaper management and image fetching
gauravkumar37/aichat
Using ChatGPT/GPT-3.5/GPT-4 in the terminal.
gauravkumar37/arangodb
ArangoDB is a native multi-model database with flexible data models for documents, graphs, and key-values. Build high performance applications using a convenient SQL-like query language or JavaScript extensions.
gauravkumar37/camus
LinkedIn's Kafka to HDFS pipeline.
gauravkumar37/django-thumbs
Easy powerful thumbnails for Django: http://code.google.com/p/django-thumbs mirror ('original' branch) and fork ('master' branch).
gauravkumar37/druid
Column oriented distributed data store ideal for powering interactive applications
gauravkumar37/feast
Feature Store for Machine Learning
gauravkumar37/gunicorn
gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
gauravkumar37/hadoop-book
Example source code accompanying O'Reilly's "Hadoop: The Definitive Guide" by Tom White
gauravkumar37/hive2-jdbc
Hive JDBC connection examples including simple and kerberos authentication methods.
gauravkumar37/pandas-profiling
Create HTML profiling reports from pandas DataFrame objects
gauravkumar37/socialzoo
A real-time analytics platform for aggregating social activities.
gauravkumar37/spark
Mirror of Apache Spark
gauravkumar37/webogram
Telegram UNOFFICIAL web application, GPL v3
gauravkumar37/axum-prometheus
🔎 Prometheus metrics middleware for Axum
gauravkumar37/mise
dev tools, env vars, task runner
gauravkumar37/rifgen