shauryashaurya
20+ years of cloud, big data, analytics, machine learning, consulting and tech leadership.
Bombay, India
Pinned Repositories
airbyte
Data integration platform for ELT pipelines from APIs, databases & files to warehouses & lakes.
arrow
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
bRAG
Braggosaurus Rex, the fragrant one, dragon of prague, keeper of the ragtag dragoons...
CooleRE
coolRE (cooler) is a set of regular expression engines written in Python - implementing a toy engine for learning, then one based on backtracking and finally a NFA-DFA based engine.
inside-a-data-engine
What's inside a data engine? Let's build one from scratch, for fun (and profit).
kandinsky
Kandinsky - analysis of color in photographic images through clustering and other algorithms.
learn-data-munging
Notes on Data Engineering with Pandas, PySpark, Dask, Ray, Arrow DataFusion, Polars etc.
shauryashaurya.github.io
Static pages for shauryashaurya's web presence
The-Meat-and-Potatoes-of-MLOps
The Meat and Potatoes of MLOps - essentials that make the practice unique compared to XXX-Ops (Dev, Data, Others).
tutorial-x.509certificates-mongo
Tutorial for building self signed X.509 certificates on Windows 10 and using them with MongoDB
shauryashaurya's Repositories
shauryashaurya/kandinsky
Kandinsky - analysis of color in photographic images through clustering and other algorithms.
shauryashaurya/arrow-ballista
Apache Arrow Ballista Distributed Query Engine
shauryashaurya/arrow-datafusion-python
Apache Arrow DataFusion Python Bindings
shauryashaurya/superset
Apache Superset is a Data Visualization and Data Exploration Platform
shauryashaurya/The-Meat-and-Potatoes-of-MLOps
The Meat and Potatoes of MLOps - essentials that make the practice unique compared to XXX-Ops (Dev, Data, Others).
shauryashaurya/bRAG
Braggosaurus Rex, the fragrant one, dragon of prague, keeper of the ragtag dragoons...
shauryashaurya/inside-a-data-engine
What's inside a data engine? Let's build one from scratch, for fun (and profit).
shauryashaurya/arrow-datafusion
Apache Arrow DataFusion SQL Query Engine
shauryashaurya/chroma
the AI-native open-source embedding database
shauryashaurya/cinepi-raw
raw cinema dng recorder application based on libcamera-apps.
shauryashaurya/ClickHouse
ClickHouse® is a free analytics DBMS for big data
shauryashaurya/dair-ai-ML-Papers-Explained
Explanation to key concepts in ML
shauryashaurya/dair-ai-Prompt-Engineering-Guide
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
shauryashaurya/gluten
Gluten: Plugin to Double SparkSQL's Performance
shauryashaurya/huggingface-transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
shauryashaurya/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
shauryashaurya/karpathy-LLM101n
LLM101n: Let's build a Storyteller
shauryashaurya/lkpy
Python recommendation toolkit
shauryashaurya/llama.cpp
Port of Facebook's LLaMA model in C/C++
shauryashaurya/NirDiamant_RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
shauryashaurya/ollama
Get up and running with Llama 2, Mistral, and other large language models.
shauryashaurya/Pints-AI-1.5-Pints
A compact LLM pretrained in 9 days by using high quality data
shauryashaurya/quivr
Your GenAI Second Brain 🧠 A personal productivity assistant (RAG) ⚡️🤖 Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented generation.
shauryashaurya/rapidsai-cudf
cuDF - GPU DataFrame Library
shauryashaurya/scipy
SciPy library main repository
shauryashaurya/Scrapegraph-ai
Python scraper based on AI
shauryashaurya/statsmodels
Statsmodels: statistical modeling and econometrics in Python
shauryashaurya/subtitleedit
the subtitle editor :)
shauryashaurya/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
shauryashaurya/upscayl
🆙 Upscayl - Free and Open Source AI Image Upscaler for Linux, MacOS and Windows built with Linux-First philosophy.