Pinned Repositories
30problems
awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
connectors
Connectors for Delta Lake
Daft
Distributed DataFrames for Python designed for the cloud, powered by Rust
DatabricksContent
Examples surrounding Databricks.
dataframe-rules-engine
Extensible Rules Engine for custom Dataframe / Dataset validation
dbx
CLI tool for advanced Databricks jobs management.
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
deltaray
Delta reader for the Ray open-source toolkit for building ML applications
docker-spark
Apache Spark docker image
VenkatH's Repositories
VenkatH/30problems
VenkatH/awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
VenkatH/connectors
Connectors for Delta Lake
VenkatH/Daft
Distributed DataFrames for Python designed for the cloud, powered by Rust
VenkatH/DatabricksContent
Examples surrounding Databricks.
VenkatH/dataframe-rules-engine
Extensible Rules Engine for custom Dataframe / Dataset validation
VenkatH/dbx
CLI tool for advanced Databricks jobs management.
VenkatH/delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
VenkatH/deltaray
Delta reader for the Ray open-source toolkit for building ML applications
VenkatH/docker-spark
Apache Spark docker image
VenkatH/HiBench
HiBench is a big data benchmark suite.
VenkatH/kafka-delta-ingest
A highly efficient daemon for streaming data from Kafka into Delta Lake
VenkatH/machine-learning-notebook-series
Jupyter notebook series for machine learning and deep learning.
VenkatH/OAP
Optimized Analytics Package for Spark* Platform
VenkatH/Papers-Literature-ML-DL-RL-AI
Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
VenkatH/phoenix
Mirror of Apache Phoenix
VenkatH/phoenix-connectors
Apache Phoenix Connectors
VenkatH/PSTL
Parallel Streaming Transformation Loader
VenkatH/python-and-rust-tools
Introduction to Command-line tools with Python and Rust
VenkatH/rust-mlops-template
A work in progress to build out solutions in Rust for MLOPs
VenkatH/spark-adaptive
VenkatH/spark-expectations
A Python Library to support running data quality rules while the spark job is running⚡
VenkatH/spark-kafka-vertica
A template for connecting Vertica and Kafka using Spark (Batch mode and Structured Streaming)
VenkatH/spark-notes
VenkatH/spark-perf
Performance tests for Apache Spark
VenkatH/StreamingBench
A streaming benchmark supporting Flink and Spark
VenkatH/Sumac
Argument parsing in Scala