nadenf's Stars
elastic/elasticsearch
Free and Open Source, Distributed, RESTful Search Engine
cleanlab/cleanlab
The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
Angel-ML/angel
A Flexible and Powerful Parameter Server for large-scale machine learning
j-easy/easy-rules
The simple, stupid rules engine for Java
polynote/polynote
A better notebook for Scala (and more)
zio/zio
ZIO — A type-safe, composable library for async and concurrent programming in Scala
pinterest/querybook
Querybook is a Big Data Querying UI, combining collocated table metadata and a simple notebook interface.
elementary-data/elementary
The dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service with premium features.
json4s/json4s
JSON library
twosigma/flint
A Time Series Library for Apache Spark
rhaidiz/broxy
An HTTP/HTTPS intercept proxy written in Go.
elpy1/ssh-over-ssm
SSH over AWS SSM. No bastions or public-facing instances. SSH user management through IAM. No requirement to store SSH keys locally or on server.
openebs/mayastor
Dynamically provision Stateful Persistent Replicated Cluster-wide Fabric Volumes & Filesystems for Kubernetes that is provisioned from an optimized NVME SPDK backend data storage stack.
doyoubi/undermoon
Mordern Redis Cluster solution for easy operation.
zio/zio-json
Fast, secure JSON library with tight ZIO integration.
Kubeinit/kubeinit
Ansible automation to have a KUBErnetes cluster INITialized as soon as possible...
avast/scala-server-toolkit
Functional programming toolkit for building server applications in Scala.
qubole/rubix
Cache File System optimized for columnar formats and object stores
djspiewak/sbt-github-packages
A simple sbt plugin for publishing to GitHub Packages, in the style of sbt-sonatype and sbt-bintray
leobenkel/ZparkIO
Boiler plate framework to use Spark and ZIO together.
harana/search
Search everything, instantly.
dimajix/flowman
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.
springml/spark-salesforce
Spark data source for Salesforce
potix2/spark-google-spreadsheets
Google Spreadsheets datasource for SparkSQL and DataFrames
wouterken/htmltoadf
An HTML to Atlassian Document Format (ADF) converter, written in Rust
ascendix/salesforce-jdbc
gerardnico/calcite
Calcite Demo
hhu-bsinfo/hadroNIO
Transparent acceleration for Java NIO applications via UCX
gcrowder/programming-language-classifier
A naive bayes classifier for programming languages built using python and scikit-learn.
rahulsingh303/druid-helm
Druid helm chart