sebastiankress's Stars
wildfly/wildfly
WildFly Application Server
apache/spark
Apache Spark - A unified analytics engine for large-scale data processing
delta-io/delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
TouK/plumber
plumber helps you tame NiFi flow
apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
jfrazee/awesome-nifi
A list of useful Apache NiFi resources, processor bundles and tools
israelio/Apache-nifi-links
microsoft/SynapseML
Simple and Distributed Machine Learning
databricks/learning-spark
Example code from Learning Spark book
josephmisiti/awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
scikit-learn/scikit-learn
scikit-learn: machine learning in Python
pandas-dev/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
dmlc/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow