baolsen
10+ years experience as a data engineer. Primarily designing and developing big data solutions both on-premise and in AWS cloud, with a focus on data lakes.
@cloudandthingsSouth Africa
baolsen's Stars
python/cpython
The Python programming language
rclone/rclone
"rsync for cloud storage" - Google Drive, S3, Dropbox, Backblaze B2, One Drive, Swift, Hubic, Wasabi, Google Cloud Storage, Azure Blob, Azure Files, Yandex Files
GoogleContainerTools/kaniko
Build Container Images In Kubernetes
infracost/infracost
Cloud cost estimates for Terraform in pull requests💰📉 Shift FinOps Left!
hashicorp/terraform-provider-aws
The AWS Provider enables Terraform to manage AWS resources.
open-metadata/OpenMetadata
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
treeverse/lakeFS
lakeFS - Data version control for your data lake | Git for data
open-telemetry/opentelemetry-specification
Specifications for OpenTelemetry
canonical/cloud-init
Official upstream for the cloud-init: cloud instance initialization
astanin/python-tabulate
Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate.
infracost/vscode-infracost
See cost estimates for Terraform right in your editor💰📉
r1chardj0n3s/parse
Parse strings using a specification based on the Python format() syntax.
techno-tim/launchpad
A collection of quick starters for ansible, kubernetes, docker, linux, windows, and more. Great for HomeLabs!
amannn/action-semantic-pull-request
A GitHub Action that ensures that your PR title matches the Conventional Commits spec.
terraform-aws-modules/terraform-aws-lambda
Terraform module, which takes care of a lot of AWS Lambda/serverless tasks (build dependencies, packages, updates, deployments) in countless combinations 🇺🇦
unitystation/unitystation
The original unitystation
resgateio/resgate
A Realtime API Gateway used with NATS to build REST, real time, and RPC APIs, where all your clients are synchronized seamlessly.
aws-samples/aws-cudos-framework-deployment
Command Line Interface tool for Cloud Intelligence Dashboards deployment
data-dot-all/dataall
A modern data marketplace that makes collaboration among diverse users (like business, analysts and engineers) easier, increasing efficiency and agility in data projects on AWS.
GoogleCloudPlatform/terraform-python-testing-helper
Simple Python test helper for Terraform.
ktrueda/parquet-tools
easy install parquet-tools
AbsaOSS/cobrix
A COBOL parser and Mainframe/EBCDIC data source for Apache Spark
trussworks/terraform-aws-logs
Creates and configures an S3 bucket for storing AWS logs.
cloud-custodian/pytest-terraform
pytest terraform plugin with fixtures and offline replay support
trussworks/terraform-aws-cloudtrail
Creates and configures AWS CloudTrail
andrewjroth/requests-auth-aws-sigv4
Use AWS signature version 4 Authentication with the python requests module
nozaq/terraform-aws-lambda-auto-package
A terraform module to define a lambda function which source files are automatically built and packaged for lambda deployment.
cloudandthings/terraform-aws-costnotifier
cloudandthings/cat-lakefs-demo
AbsaOSS/spark-metadata-tool
Tool to fix _spark_metadata from Structured Streaming queries