jayvanzyl
Full-stack technologist focusing on computational social science.
ecosystem.AiUnited States, Palo Alto
jayvanzyl's Stars
hpgrahsl/kafka-connect-mongodb
**Unofficial / Community** Kafka Connect MongoDB Sink Connector -> integrated 2019 into the official MongoDB Kafka Connector here: https://www.mongodb.com/kafka-connector
mongodb/mongo-spark
The MongoDB Spark Connector
Redocly/redoc
📘 OpenAPI/Swagger-generated API Reference Documentation
dongqianwei/presto-localcsv
a presto plugin supporting read csv files in local filesystem.
jmrozanec/trino-teradata-connector
Presto-Teradata connector
prestodb/presto-hive-apache
Shaded version of Apache Hive for Presto
prestodb/presto-python-client
Python DB-API client for Presto
prestodb/presto-admin
A tool to install, configure and manage Presto installations
trinodb/trino
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
apache/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
astronomer/dag-factory
Dynamically generate Apache Airflow DAGs from YAML configuration files
rawgraphs/rawgraphs-app
A web interface to create custom vector-based visualizations on top of RAWGraphs core
uber/grafana-dash-gen
grafana dash dash dash gen
apache/gobblin
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
alteryx/featuretools
An open source python library for automated feature engineering
hopshadoop/hops
Hops Hadoop is a distribution of Apache Hadoop with distributed metadata.
prestodb/presto
The official home of the Presto distributed SQL query engine for big data
kubeflow/kubeflow
Machine Learning Toolkit for Kubernetes
uber/marmaray
Generic Data Ingestion & Dispersal Library for Hadoop
ludwig-ai/ludwig
Low-code framework for building custom LLMs, neural networks, and other AI models
grafana/grafana
The open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
Hvass-Labs/TensorFlow-Tutorials
TensorFlow Tutorials with YouTube Videos
EdwardRaff/JSAT
Java Statistical Analysis Tool, a Java library for Machine Learning
h2oai/h2o-flow
Web based interactive computing environment for H2O
h2oai/h2o-droplets
Templates for projects based on top of H2O.
mozilla/DeepSpeech
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
d3/d3
Bring data to life with SVG, Canvas and HTML. :bar_chart::chart_with_upwards_trend::tada:
almende/vis
⚠️ This project is not maintained anymore! Please go to https://github.com/visjs
marcotcr/lime
Lime: Explaining the predictions of any machine learning classifier
h2oai/h2o-3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.