Dataminded

Belgium

Pinned Repositories

blog-tpcds-dbt-duckdb
This repository contains the tpcds queries together with the code required to run this benchmark for dbt and duckdb
Language:HCL17 5 00
conveyor-roadmap
This is the public roadmap for Conveyor.
1 3 710
conveyor-samples
Samples on how to use Conveyor.
Language:Jupyter Notebook3 4 00
conveyor-templates
Cookiecutter templates used by Conveyor.
Language:Python1 4 21
demo-elections2024-website
Language:TypeScript12 3 53
demo-llm-hackathon
Language:Jupyter Notebook5 1 02
lighthouse
Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines and apply best practices.
Language:Scala61 29 310
python-and-spark-for-data-analysis
A four-day course on Python, the Scientific Python stack and PySpark, adapted from a training course given by Patrick Varilly to one of our clients in December 2015
Language:Jupyter Notebook11 7 08
spark_on_azure_batch_demo
Language:Python6 2 02
webinar-containers
Language:HCL14 3 03

Dataminded's Repositories

datamindedbe/lighthouse
Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines and apply best practices.
Language:Scala61 29 310
datamindedbe/blog-tpcds-dbt-duckdb
This repository contains the tpcds queries together with the code required to run this benchmark for dbt and duckdb
Language:HCL17 5 00
datamindedbe/demo-elections2024-website
Language:TypeScript12 3 53
datamindedbe/demo-llm-hackathon
Language:Jupyter Notebook5 1 02
datamindedbe/incubator-sync-upgrade
Language:Python5 3 30
datamindedbe/blog-platform-quack-quack-ka-ching
The duck escapes with the credits.
3 2 0
datamindedbe/conveyor-samples
Samples on how to use Conveyor.
Language:Jupyter Notebook3 4 00
datamindedbe/iceberg-ingestion
Public repository containing sample code for how to improve ETL ingestion processes with Apache Iceberg
Language:Python3 15 02
datamindedbe/homebrew-conveyor-formulas
Brew tap repository for Conveyor
Language:Python2 4 10
datamindedbe/terraform-provider-conveyor
2 3 0
datamindedbe/academy_git
Language:Python1 2 03
datamindedbe/academy_linux
Language:Shell1 2 04
datamindedbe/conveyor-templates
Cookiecutter templates used by Conveyor.
Language:Python1 4 21
datamindedbe/dbt-testing-hackathon
Language:Python1 3 0
datamindedbe/playground-duckdb-wasm
Language:JavaScript1 2 0
datamindedbe/academy-capstone
5 039
datamindedbe/aws-glue-data-catalog-client-for-apache-hive-metastore
The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog as a central repository to store structural and operational metadata for their data. AWS Glue provides out-of-box integration with Amazon EMR that enables customers to use the AWS Glue Data Catalog as an external Hive Metastore. This is an open-source implementation of the Apache Hive Metastore client on Amazon EMR clusters that uses the AWS Glue Data Catalog as an external Hive Metastore. It serves as a reference implementation for building a Hive Metastore-compatible client that connects to the AWS Glue Data Catalog. It may be ported to other Hive Metastore-compatible platforms such as other Hadoop and Apache Spark distributions
Language:Java2 0
datamindedbe/dbt-conveyor-snowflake
The Conveyor Snowflake adapter is a thin shell around the Snowflake adapter to allow authenticating users in Conveyor IDE's with Snowflake to run DBT projects
Language:Python2 0
datamindedbe/dbt-playground
Try out dbt in a Gitpod environment in one click, with a Postgres database pre-configured
3 0
datamindedbe/ecr-mirror
Mirror public repositories to internal ECR repos
Language:Python2 0
datamindedbe/eks-spark-benchmark
Performance optimization for Spark running on Kubernetes
Language:Scala2 0
datamindedbe/git-credential-oauth
A Git credential helper that securely authenticates to GitHub, GitLab and BitBucket using OAuth.
Language:Go1 0
datamindedbe/iris
Artifacts related to a training on running stream processing pipelines
Language:Kotlin2 0
datamindedbe/kubernetes_academy_course
Language:Dockerfile4 03
datamindedbe/playground-engine-query
Language:Rust3 0
datamindedbe/snowflake-gitpod
Language:Jupyter Notebook4 5
datamindedbe/spark-sql-perf
Language:Scala2 0
datamindedbe/terraform-aws-eks
Terraform module to create an Elastic Kubernetes (EKS) cluster and associated resources 🇺🇦
Language:HCL2 0
datamindedbe/terraform-provider-dmcloud
3 0
datamindedbe/webinar-cross-dag-Airflow
Language:Python2 0