Pinned Repositories
dbt-core-rh
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
ingestd-tpcdi
Data Integration via Confluent Kafka
lens
Lenses, Folds, and Traversals - Join us on freenode #haskell-lens
programming_notes
ravihindocha.github.io
tpc-di_benchmark
Benchmark for Airflow with BigQuery as the Data Warehouse using TPC - DI
raavioli's Repositories
raavioli/dbt-core-rh
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
raavioli/ingestd-tpcdi
Data Integration via Confluent Kafka
raavioli/lens
Lenses, Folds, and Traversals - Join us on freenode #haskell-lens
raavioli/ravihindocha.github.io
raavioli/tpc-di_benchmark
Benchmark for Airflow with BigQuery as the Data Warehouse using TPC - DI
raavioli/amundsen
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
raavioli/aqueduct
The control center for ML in the cloud
raavioli/awesome
😎 Awesome lists about all kinds of interesting topics
raavioli/awesome-production-machine-learning
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
raavioli/build-your-own-x
Master programming by recreating your favorite technologies from scratch.
raavioli/codon
A high-performance, zero-overhead, extensible Python compiler using LLVM
raavioli/Computer-Science-Education-Resources
A place for programming language instructors to share educational materials
raavioli/datacontract-specification
The Data Contract Specification Repository
raavioli/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
raavioli/free-for-dev
A list of SaaS, PaaS and IaaS offerings that have free tiers of interest to devops and infradev
raavioli/free-programming-books
:books: Freely available programming books
raavioli/h2o-llmstudio
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs
raavioli/hudi
Upserts, Deletes And Incremental Processing on Big Data.
raavioli/iceberg
Apache Iceberg
raavioli/jitsu
Jitsu is an open-source Segment alternative. Fully-scriptable data ingestion engine for modern data teams. Set-up a real-time data pipeline in minutes, not days
raavioli/mage-ai
🧙 The modern replacement for Airflow. Build, run, and manage data pipelines for integrating and transforming data.
raavioli/mlflow
Open source platform for the machine learning lifecycle
raavioli/onnx
Open standard for machine learning interoperability
raavioli/presto
The official home of the Presto distributed SQL query engine for big data
raavioli/python-mastery
Advanced Python Mastery (course by @dabeaz)
raavioli/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
raavioli/spark
Apache Spark - A unified analytics engine for large-scale data processing
raavioli/spec
The AsyncAPI specification allows you to create machine-readable definitions of your asynchronous APIs.
raavioli/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
raavioli/zenml
ZenML 🙏: Build portable, production-ready MLOps pipelines. https://zenml.io.