Pinned Repositories
AI-Journey-Code
AI Journey code.
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
delta-rs
A native Rust library for Delta Lake, with bindings into Python and Ruby.
duckdb
DuckDB is an in-process SQL OLAP Database Management System
kubectl-trace
Schedule bpftrace programs on your kubernetes cluster using the kubectl
kubernetes
Production-Grade Container Scheduling and Management
ParquetDemo
Apache Parquet Reader/Writer in Java.
url-shortener
URL Shortener service built with serverless framework on AWS, API Gateway + Lambda + DynamoDB.
guihaojin's Repositories
guihaojin/airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
guihaojin/arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.
guihaojin/cdk-workshop
AWS CDK workshop
guihaojin/credstash
A little utility for managing credentials in the cloud
guihaojin/delta-rs
A native Rust library for Delta Lake, with bindings into Python and Ruby.
guihaojin/duckdb
DuckDB is an in-process SQL OLAP Database Management System
guihaojin/embedded-jetty-jsp
Example of Embedded Jetty with JSP support
guihaojin/firecracker
Secure and fast microVMs for serverless computing.
guihaojin/full-stack-fastapi-postgresql
Full stack, modern web application generator. Using FastAPI, PostgreSQL as database, Docker, automatic HTTPS and more.
guihaojin/GuiceInAction
Guice DI in practice.
guihaojin/iceberg
Apache Iceberg
guihaojin/incubator-livy
Mirror of Apache livy (Incubating)
guihaojin/jetty.project
Eclipse Jetty® - Web Container & Clients - supports HTTP/2, HTTP/1.1, HTTP/1.0, websocket, servlets, and more
guihaojin/kafka
Mirror of Apache Kafka
guihaojin/kubectl-trace
Schedule bpftrace programs on your kubernetes cluster using the kubectl
guihaojin/kubernetes
Production-Grade Container Scheduling and Management
guihaojin/learning
Learning code.
guihaojin/loft
Namespace & Virtual Cluster Manager for Kubernetes - Lightweight Virtual Clusters, Self-Service Provisioning for Engineers and 70% Cost Savings with Sleep Mode
guihaojin/OpenMetadata
Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.
guihaojin/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
guihaojin/skywalking
APM, Application Performance Monitoring System
guihaojin/smithy
Smithy is a protocol-agnostic interface definition language and set of tools for generating clients, servers, and documentation for any programming language.
guihaojin/spark
Apache Spark
guihaojin/TDengine
An open-source big data platform designed and optimized for the Internet of Things (IoT).
guihaojin/tomcat
Apache Tomcat
guihaojin/arrow-datafusion
Apache Arrow DataFusion and Ballista query engines
guihaojin/ascend-sdk-examples
Ascend SDK sample code.
guihaojin/dagster
An orchestration platform for the development, production, and observation of data assets.
guihaojin/omni-cli
Omni CLI pypi.
guihaojin/spark-snowflake
Snowflake Data Source for Apache Spark.