r39132's Stars
apache/superset
Apache Superset is a Data Visualization and Data Exploration Platform
apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
mlflow/mlflow
Open source platform for the machine learning lifecycle
facebook/prophet
Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
PRQL/prql
PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement
apache/cassandra
Apache Cassandra®
snorkel-team/snorkel
A system for quickly generating training data with weak supervision
uber/aresdb
A GPU-powered real-time analytics storage and query engine.
voldemort/voldemort
An open source clone of Amazon's Dynamo.
wiredtiger/wiredtiger
WiredTiger's source tree
Netflix/curator
ZooKeeper client wrapper and rich ZooKeeper framework
twitter/bijection
Reversible conversions between types
sebdah/dynamic-dynamodb
Dynamic DynamoDB provides auto scaling for AWS DynamoDB
cashapp/pranadb
justmarkham/pycon-2016-tutorial
Machine Learning with Text in scikit-learn
etsy/boundary-layer
Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform
paypal/gimel
Big Data Processing Framework - Unified Data API or SQL on Any Storage
jingwei/krati
A hash-based high-performance data store
cutting/trevni
a column file format
paypal/NNAnalytics
NameNodeAnalytics is a self-help utility for scouting and maintaining the namespace of an HDFS instance.
paypal/yurita
Anomaly detection framework @ PayPal
sriramsrao/sailfish
Mirror of Apache Hadoop common
coursera/aegisthus
A Bulk Data Pipeline out of Cassandra
r39132/airflow
AirFlow is a system to programmatically author, schedule and monitor data pipelines.