shauryashaurya
20+ years of cloud, big data, analytics, machine learning, consulting and tech leadership.
Bombay, India
Pinned Repositories
airbyte
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
bRAG
Braggosaurus Rex, the fragrant one, dragon of prague, keeper of the ragtag dragoons...
CooleRE
coolRE (cooler) is a set of regular expression engines written in Python - implementing a toy engine for learning, then one based on backtracking and finally a NFA-DFA based engine.
inside-a-data-engine
What's inside a data engine? Let's build one from scratch, for fun (and profit).
kandinsky
Kandinsky - analysis of color in photographic images through clustering and other algorithms.
learn-data-munging
Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.
shauryashaurya.github.io
Static pages for shauryashaurya's web presence
The-Meat-and-Potatoes-of-MLOps
The Meat and Potatoes of MLOps - essentials that make the practice unique compared to XXX-Ops (Dev, Data, Others).
tutorial-x.509certificates-mongo
Tutorial for building self signed X.509 certificates on Windows 10 and using them with MongoDB
shauryashaurya's Repositories
shauryashaurya/The-Meat-and-Potatoes-of-MLOps
The Meat and Potatoes of MLOps - essentials that make the practice unique compared to XXX-Ops (Dev, Data, Others).
shauryashaurya/Learning-How-Machines-Learn
Practical notes and references on common machine learning algorithms. Let's Go!
shauryashaurya/apache-pinot
Apache Pinot - A realtime distributed OLAP datastore
shauryashaurya/chrishayuk-embeddings
shauryashaurya/ClickHouse
ClickHouse® is a free analytics DBMS for big data
shauryashaurya/ComfyUI
The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.
shauryashaurya/EcZachly-data-engineer-handbook
This is a repo with links to everything you'd ever want to learn about data engineering
shauryashaurya/gluten
Gluten: Plugin to Double SparkSQL's Performance
shauryashaurya/gpt4all
gpt4all: run open-source LLMs anywhere
shauryashaurya/huggingface-peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
shauryashaurya/hyperswitch
An open source payments switch written in Rust to make payments fast, reliable and affordable
shauryashaurya/labmlai-annotated_deep_learning_paper_implementations
🧠 Minimal implementations/tutorials of deep learning papers with side-by-side notes
shauryashaurya/lena-voita-yandex-nlp-course
YSDA course in Natural Language Processing by Lena Voita
shauryashaurya/milesial-Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
shauryashaurya/mlabonne-llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
shauryashaurya/observable-plot
A concise API for exploratory data visualization implementing a layered grammar of graphics
shauryashaurya/OpenColorIO
A color management framework for visual effects and animation.
shauryashaurya/scrapscript
A functional, content-addressable programming language.
shauryashaurya/sql-mysteries
Inspired by @veltman's command-line mystery, use SQL to research clues and find out whodunit!
shauryashaurya/substrait
A cross platform way to express data transformation, relational algebra, standardized record expression and plans.
shauryashaurya/tensorflow-probability
Probabilistic reasoning and statistical analysis in TensorFlow
shauryashaurya/ThinkPython
Jupyter notebooks and other resources for Think Python by Allen Downey, published by O'Reilly Media.
shauryashaurya/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
shauryashaurya/TNIA-gpu-image-analysis-deep-learning
A short course exploring GPU image analysis and deep learning
shauryashaurya/uBlock
uBlock Origin - An efficient blocker for Chromium and Firefox. Fast and lean.
shauryashaurya/upscayl-custom-models
A repository for extra custom models for Upscayl.
shauryashaurya/vega
A visualization grammar.
shauryashaurya/vega-compassql
CompassQL Query Language for visualization recommendation.
shauryashaurya/vega-lite
A concise grammar of interactive graphics, built on Vega.
shauryashaurya/warp
A Python framework for high performance GPU simulation and graphics