da-tubi
Data Engineer @ TubiTV, a Scala enthusiast, for personal open source projects, I'm @darcy-shen.
Tubi
da-tubi's Stars
dremio/dremio-oss
Dremio - the missing link in modern data
pytorch/vision
Datasets, Transforms and Models specific to Computer Vision
ultralytics/yolov5
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
GoogleCloudPlatform/gsutil
A command line tool for interacting with cloud storage services.
MarquezProject/marquez
Collect, aggregate, and visualize a data ecosystem's metadata
OpenLineage/OpenLineage
An Open Standard for lineage metadata collection
Netflix/metacat
datahub-project/datahub
The Metadata Platform for your Data and AI Stack
dbt-labs/dbt-core
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
shaypal5/lazyimport
lazyimport lets you import python modules lazily.
mnmelo/lazy_import
A module for lazy loading of Python modules
deephacks/awesome-jvm
A curated list of awesome loosely performance related JVM stuff. Inspired by awesome-python.
awesomedata/awesome-public-datasets
A topic-centric list of HQ open datasets.
andialbrecht/sqlparse
A non-validating SQL parser module for Python
alculquicondor/psqlparse
A python module that gives access to PostgreSQL's query parser, for turning SQL into a parse tree.
mariomka/regex-benchmark
It's just a simple regex benchmark of different programming languages.
gpanther/regex-libraries-benchmarks
A JMH benchmark for different Java regular expressions libraries
fsspec/s3fs
S3 Filesystem
jrderuiter/airflow-fs
Composable filesystem hooks and operators for Apache Airflow.
ms32035/airflow-dag-dependencies
Visualize dependencies between Airflow DAGs
Flowminder/pytest-airflow
pytest support for airflow
oldratlee/oldratlee
whoami / my profile
pureconfig/pureconfig
A boilerplate-free library for loading configuration files
databricks/spark-redshift
Redshift data source for Apache Spark
com-lihaoyi/mill
Mill is a fast JVM build tool that supports Java, Scala and Kotlin. 2-4x faster than Gradle and 4-10x faster than Maven for common workflows, Mill aims to make your project’s build process performant, maintainable, and flexible
sparsetech/toml-scala
TOML parser with codec derivation for the Scala platform
47degrees/github4s
A GitHub API wrapper written in Scala
debezium/debezium
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
antlr/antlr4
ANTLR (ANother Tool for Language Recognition) is a powerful parser generator for reading, processing, executing, or translating structured text or binary files.
ThoughtWorksInc/sbt-example
Run Scaladoc as unit tests